Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
LauferVA authored Apr 23, 2024
1 parent 1cfffd1 commit 31355cd
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
# CKB-JAX-HTML-Parser
This repository contains a very short workflow that parses the CKB CORE data housed at https://ckb.jax.org/

##Notes:
**Notes**:

These data were pulled using the attached parse script, which was run in Python 3.11. An env file is provided and can install necessary modules provided that pip3 is present and the user has appropriate permissions.
Briefly, the source HTML in the main gene page is used to build the first table (housed in the CKB_CORE tab). The links in this page are used to identify the individual gene pages. The remaining tabs are populated by parsing each single-gene page.
Expand All @@ -12,7 +12,7 @@ For this spreadsheet: Date of Access 4-22-2024; 18:30. The database pull and cur
For terms, glossary, curation information, etc. please see: https://ckb.jax.org/about/curationMethodology
Contact: For comments, questions, etc., please use either [email protected] or [email protected]

##Known_Issues:
**Known_Issues**:

The data provided differ somewhat from the raw output of the python functions provided. Specifically, 5 or so regular expressions were used to clean raw output (e.g., multi-line fields). Changes to content are not made.
Manual inspection indicate that some of the data itself could be regarded as suspect or definitions used as somewhat arbitrary. As an example, consider the information contained in the EfficacyEvidence tab related to the N549K variant in FGFR2.
Expand Down

0 comments on commit 31355cd

Please sign in to comment.