-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
2 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,7 +1,7 @@ | ||
# CKB-JAX-HTML-Parser | ||
This repository contains a very short workflow that parses the CKB CORE data housed at https://ckb.jax.org/ | ||
|
||
##Notes: | ||
**Notes**: | ||
|
||
These data were pulled using the attached parse script, which was run in Python 3.11. An env file is provided and can install necessary modules provided that pip3 is present and the user has appropriate permissions. | ||
Briefly, the source HTML in the main gene page is used to build the first table (housed in the CKB_CORE tab). The links in this page are used to identify the individual gene pages. The remaining tabs are populated by parsing each single-gene page. | ||
|
@@ -12,7 +12,7 @@ For this spreadsheet: Date of Access 4-22-2024; 18:30. The database pull and cur | |
For terms, glossary, curation information, etc. please see: https://ckb.jax.org/about/curationMethodology | ||
Contact: For comments, questions, etc., please use either [email protected] or [email protected] | ||
|
||
##Known_Issues: | ||
**Known_Issues**: | ||
|
||
The data provided differ somewhat from the raw output of the python functions provided. Specifically, 5 or so regular expressions were used to clean raw output (e.g., multi-line fields). Changes to content are not made. | ||
Manual inspection indicate that some of the data itself could be regarded as suspect or definitions used as somewhat arbitrary. As an example, consider the information contained in the EfficacyEvidence tab related to the N549K variant in FGFR2. | ||
|