Creating necessary files for Fasta Reference

When writing a genomics pipeline it is very useful to have test data that is downsampled so that changes to the pipeline can be made and tested quickly.

See generating test data for how to get the initial fastqs.

Next follow these steps to create the necessary supplementary files required by GATK, Picard, etc...

Create sequence dictionary

java -jar ~/Desktop/vendor_tools/picard-2.8.1.jar CreateSequenceDictionary R=Homo_sapiens_assembly19-14-105258897-105259017.fasta O=Homo_sapiens_assembly19-14-105258897-105259017.fasta.dict

Create BWA transform & other required files

bwa index Homo_sapiens_assembly19-14-105258897-105259017.fasta

Create fasta index

samtools faidx Homo_sapiens_assembly19-14-105258897-105259017.fasta

Footer is such a weird word. Footer.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Creating necessary files for Fasta Reference

Clone this wiki locally