Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load additional ConceptWiki linksets #22

Open
stain opened this issue Oct 14, 2015 · 9 comments
Open

Load additional ConceptWiki linksets #22

stain opened this issue Oct 14, 2015 · 9 comments
Assignees
Milestone

Comments

@stain
Copy link
Contributor

stain commented Oct 14, 2015

.. but which ones? Stian to provide list for further discussion.

@stain stain added this to the 2.1 milestone Oct 14, 2015
@stain stain self-assigned this Oct 14, 2015
@stain
Copy link
Contributor Author

stain commented Oct 14, 2015

@ianwdunlop
Copy link
Member

Any chance we can avoid 'hacks'? If they are valid then they should become part of the actual void file

@stain
Copy link
Contributor Author

stain commented Oct 14, 2015

ConceptWiki linksets that were loaded in 1.5:

So which one should also go in?

@stain stain assigned danidi and unassigned stain Oct 14, 2015
@danidi
Copy link
Contributor

danidi commented Oct 19, 2015

We'll definitely need a linkset from the ConceptWiki Gene IDs to something else, otherwise the text search with gene identifiers will not work. Currently, we only have a linkset available mapping Concept Wiki genes to Uniprot proteins, which actually prevents us to get any gene data back (this would need an additional mapping from uniprot proteins to a gene identifier, which @AlasdairGray mentioned the IMS would not do in one step).

Enzyme and GO would be useful to allow the text search for the classes, but the user might find concepts which are not exactly the same here (see #18).

@AlasdairGray
Copy link
Member

Is there another set of gene identifiers in the system that ConceptWiki could map to with the justification that they are the same gene?
ConceptWiki Gene to UniProt protein is a cross-boundary mapping which means that it can only be traversed in one direction and only once.

@danidi
Copy link
Contributor

danidi commented Oct 20, 2015

Could unigene be an option here? I wouldn't go for Ensembl, as we just have the human ones from Jonathan. The question is if there are all genes here, that are also in the CW to uniprot linkset are available there, and if it would cause some one to many issues.

@AlasdairGray
Copy link
Member

  1. Do we already have Unigene in the system and if so does it have a link to UniProt proteins?
  2. We would only have a one-to-many issue if there is a one-to-many relationship between ConceptWiki and Unigene, do we know whether such a linkset exists?
  3. If we have unigene, can we use that for gene labels and do away with ConceptWiki for genes?

@danidi
Copy link
Contributor

danidi commented Oct 20, 2015

We have a linkset http://ops2.few.vu.nl/QueryExpander/mappingSet/29, not sure if there is any additional data (and also labels).
But looking at the gene linksets we have so far, it seems that many of them are connected to uniprot only. So we will only have gene to gene mappings for the ones @JonathanMELIUS included in the human and mouse ensembl linksets. So I'm not sure anymore that having a direct CW to unigene linkset would give us an advantage.

@stain
Copy link
Contributor Author

stain commented Nov 5, 2015

Add the CW linksets that were part of the 1.5 default lens - to be able to do labels and text->concept before IRS2.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants