Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minimum contig length for alternate code prediction #2

Open
snayfach opened this issue Aug 6, 2022 · 0 comments
Open

Minimum contig length for alternate code prediction #2

snayfach opened this issue Aug 6, 2022 · 0 comments

Comments

@snayfach
Copy link

snayfach commented Aug 6, 2022

A feature request to specify the minimum contig length for which prodigal-gv should try and predict a non-standard genetic code. Suggest using 10kb as the default. I believe this was your suggestion from our discussion, but wanted to create an issue to track it.

I looked at the rate in which prodigal-gv predicts alternatives codes in IMG/VR data. In large contigs > 20kb, prodigal-gv predicts alternative codes for ~1.5% of viral contigs. This increases to 2.5% <10kb, 3.3% <5kb, and 5.4% <2.5kb. My hunch is that most of the alternative code predictions for short contigs are FPs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant