-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No hits from DIAMOND #170
Comments
Hi, I'm still having this issue. |
Diamond version here is 2.1.9. I just create an new environment and install the dbcan according to our document.
Following is the overview result
Can you have the same overview result like mine? |
Thanks for your reply! I updated my Diamon version to 2.1.9 and tried the above and I'm still having the same problem! It runs as expected, and comes up with no errors, but the diamond.out files and diamond column of the overview.txt file are empty still. Any other thoughts??? |
I've just tried from scratch again, setting up a new environment and installing everything again from scratch and still having the same issue :( looking like this data will just be missing from my dissertation! (Which is due next week) |
I need run_dbcan version too (can't use online) because it's a fungal genome |
Can you provide the data you are using? That does not make sense diamond no hits |
Thanks for replying. I've been using the example data to try to get it to work. Have tried both nucelotide and aa sequences, using And running my query data gave the same issue |
Sounds like your diamond might have not actually worked. If you run "diamond help", do you see the help information?
If you do, you can try to run diamond on your protein file directly on the command line (i.e., not using run_dbcan), like "diamond blastp -d {cazy_indexfile} -e {dia_eval} -q {yourfaafile} -k 1 -o diamond.out -f 6". Let me know if you see any output in diamond.out.
Yanbin
…________________________________
From: JosieMainwaring ***@***.***>
Sent: Tuesday, May 21, 2024 11:39 PM
To: linnabrown/run_dbcan ***@***.***>
Cc: Subscribed ***@***.***>
Subject: Re: [linnabrown/run_dbcan] No hits from DIAMOND (Issue #170)
Caution: Non-NU Email
Thanks for replying. I've been using the example data to try to get it to work. Have tried both nucelotide and aa sequences, using
"run_dbcan EscheriaColiK12MG1655.fna prok --out_dir output_EscheriaColiK12MG1655"
as well as the code you provided above:
"run_dbcan EscheriaColiK12MG1655.faa protein --out_dir output_233 -t diamond"
And running my query data gave the same issue
—
Reply to this email directly, view it on GitHub<#170 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AEXNKZXUJOJZYODAGOPCQXLZDQOORAVCNFSM6AAAAABEYSOU6KVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMRTHA2TCNBXGY>.
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
|
Yes, I see the following: Syntax: diamond COMMAND [OPTIONS] Commands: Possible [OPTIONS] for COMMAND can be seen with syntax: diamond COMMAND Online documentation at http://www.diamondsearch.org |
What do I input for cazy_indexfile and dia_eval ? |
Can you install the docker version? This is the fastest way. |
I haven't tried the docker version yet - not familiar with Docker at all. But I'll give it a try Edit: Will it be fastest for a noob who doesn't yet have Docker installed? |
I don't have space on my computer to pull the haidyi/run_dbcan image for Docker setup - I'll have to try through my university HPC tomorrow! Thanks for help so far guys. It's the last piece of data I need - all just to write a couple of numbers into a table! Will be back tomorrow |
So you do have a working diamond installed, but you don't have a protein fasta file. This is likely the reason that you don't have any result in diamond.out (I also have doubt that you will have meaningful result in hmmer.out). For eukaryotic genomes, we suggested that you use protein instead of nucleotide input. This is from the help page of dbCAN web server:
[cid:e0856152-b189-4e22-a5d7-1a8cae1c6fab]
Within run_dbcan, we call prodigal to predict protein coding genes if users input the genome nt fasta. But prodigal is for prokaryote/phage genomes but not designed for eukaryotes, so we do not recommend users use nt input for run_dbcan. Instead you should predict proteins outside of run_dbcan.
Yanbin
…________________________________
From: JosieMainwaring ***@***.***>
Sent: Wednesday, May 22, 2024 12:02 AM
To: linnabrown/run_dbcan ***@***.***>
Cc: Yanbin Yin ***@***.***>; Comment ***@***.***>
Subject: Re: [linnabrown/run_dbcan] No hits from DIAMOND (Issue #170)
Caution: Non-NU Email
Yes, I see the following:
"
(dbcan3) ***@***.***:~$ diamond help
diamond v2.1.9.163 (C) Max Planck Society for the Advancement of Science, Benjamin Buchfink, University of Tuebingen
Documentation, support and updates available at http://www.diamondsearch.org
Please cite: http://dx.doi.org/10.1038/s41592-021-01101-x Nature Methods (2021)
Syntax: diamond COMMAND [OPTIONS]
Commands:
makedb Build DIAMOND database from a FASTA file
prepdb Prepare BLAST database for use with Diamond
blastp Align amino acid query sequences against a protein reference database
blastx Align DNA query sequences against a protein reference database
cluster Cluster protein sequences
linclust Cluster protein sequences in linear time
realign Realign clustered sequences against their centroids
recluster Recompute clustering to fix errors
reassign Reassign clustered sequences to the closest centroid
view View DIAMOND alignment archive (DAA) formatted file
merge-daa Merge DAA files
help Produce help message
version Display version information
getseq Retrieve sequences from a DIAMOND database file
dbinfo Print information about a DIAMOND database file
test Run regression tests
makeidx Make database index
greedy-vertex-cover Compute greedy vertex cover
Possible [OPTIONS] for COMMAND can be seen with syntax: diamond COMMAND
Online documentation at http://www.diamondsearch.org
"
I'll try this for the example data, but for my query sequence I don't have an amino acid file unfortunately!
—
Reply to this email directly, view it on GitHub<#170 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AEXNKZTLYQYDN3RGDUKFFNLZDQRHDAVCNFSM6AAAAABEYSOU6KVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMRTHA3TCMZVG4>.
You are receiving this because you commented.
|
That makes sense for the query sequence, but why would the example E. coli data not work either? Including with the amino acid file? If I can get the example data working, then I still have hope for my query sequence. I'd just have to translate it to .faa by other means, right? |
E.coli data should work. Yes, you can check if using ecoli protein file would work. Commands are at here https://dbcan.readthedocs.io/en/latest/user_guide/index.html. There are example files here https://bcb.unl.edu/dbCAN2/download/Samples/.
…________________________________
From: JosieMainwaring ***@***.***>
Sent: Wednesday, May 22, 2024 2:58 PM
To: linnabrown/run_dbcan ***@***.***>
Cc: Yanbin Yin ***@***.***>; Comment ***@***.***>
Subject: Re: [linnabrown/run_dbcan] No hits from DIAMOND (Issue #170)
Caution: Non-NU Email
That makes sense for the query sequence, but why would the example E. coli data not work either? Including with the amino acid file? If I can get the example data working, then I still have hope for my query sequence. I'd just have to translate it to .faa by other means, right?
—
Reply to this email directly, view it on GitHub<#170 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AEXNKZXP3BDB3TLR44VCYWDZDT2HRAVCNFSM6AAAAABEYSOU6KVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMRVGY2DGOBRGY>.
You are receiving this because you commented.Message ID: ***@***.***>
|
Thanks everyone for your help. I got everything (example data & query) working just by running all the same steps on my HPC. For whatever reason Diamond was just determined to be broken on my linux. So, not solved but worked around. |
Again, highly recommend to use docker image when you confront this issue next time. Each person might change the configuration of his/her system which might ruin the installation for other software. Since docker won't ruin your linux system and it created its own linux system already @JosieMainwaring |
Hi all,
I have annotated the example E. coli K12 genome & my genome of interest using run_dbcan on a virtualbox linux system and I had no issues with errors in the code, and the output data files were produced as expected. However, in both cases, there are zero hits in the column for DIAMOND (all just '-' entries, and no hits with all 3 tools), which is unexpected for both genomes.
Does anyone know what might be causing this?
For reference, the diamond version I'm running is 2.0.11
Any help appreciated!
The text was updated successfully, but these errors were encountered: