Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IndexError #5

Open
khyati-sb opened this issue Sep 8, 2021 · 2 comments
Open

IndexError #5

khyati-sb opened this issue Sep 8, 2021 · 2 comments

Comments

@khyati-sb
Copy link

Hi

I am trying to use your script VCF2fasta.py, and as per the requirements I have used indexed and compressed files for reference genome as well as vcf file. My vcf file is multi-sample file. Also, Gff file used here is from NCBI Refseq database. I also removed comments from gif file as described in Issue3 Yet, somehow I am getting the following error while running:

python /home/bcgrc/bin/vcf2fasta/vcf2fasta.py -f Ref_70-15.fa -g Ref_70-15.gff -e gene -v all_snps.vcf.gz

Reading GFF file [Ref_70-15.gff] ... Traceback (most recent call last):
File "/home/bcgrc/bin/vcf2fasta/vcf2fasta.py", line 470, in
main()
File "/home/bcgrc/bin/vcf2fasta/vcf2fasta.py", line 78, in main
gff = filterFeatureInGFF(ReadGFF(args.gff), args.feat)
File "/home/bcgrc/bin/vcf2fasta/vcf2fasta.py", line 405, in ReadGFF
geneNames = getGeneNames(file)
File "/home/bcgrc/bin/vcf2fasta/vcf2fasta.py", line 378, in getGeneNames
last = processGeneName(fields[8])
IndexError: list index out of range

Also, I am trying to convert vcf to Fasta for all the region, Not just restricted to gene or CDS.
Your help is much appreciated.

@skaul31
Copy link

skaul31 commented May 19, 2022

I have been having this same issue and have not been able to solve it. Were you able to figure out what the problem was?

@vkulaganathan
Copy link

vkulaganathan commented Jul 8, 2022

Check if there are mismatches among reference.fa, the gff file & your input vcf.gz files in terms of Chromosome position entries.
In all the files, there should be consistency in the nomenclature of CHR position entries and gene id nomenclature.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants