NestLink-pipeline is a pipeline for processing NestLink libraries sequenced by nanopore sequencing. Reads are binned according to their flycodes (UMIs). Accurate consensus sequences are calculated using Medaka. Variants are called with the pipeline, resulting in a flycode assignment table that links protein variants to their respective set of flycodes.
Warning
NestLink-pipeline is still in development. Certain library-specific strings are still hard-coded in main.nf
and have to be edited before running the pipeline.
- Mamba/ Conda (https://conda-forge.org/)
- Nextflow (Installation guide)
- mini_align (mini_align.sh placed in
projectDir/bin/
)
- Slurm workflow manager
- Singularity
- Clone the repository.
- Edit the params.json file, specify the nanopore reads (bam) and reference sequence.
- Run the pipeline:
sbatch run_NL-pipeline.slurm
Note
Consensus sequence generation with medaka has to be run manually.
- Prepare the pipeline as described above.
- Run the pipeline:
bash run_NL-pipeline.sh