Skip to content

Latest commit

 

History

History
198 lines (186 loc) · 5.39 KB

performance_somatic.md

File metadata and controls

198 lines (186 loc) · 5.39 KB

megSAP benchmarks

tumor-normal short-read pipline

All performance benchmarks are performed on the GIAB reference samples NA12878 using the gold-standard variant list v4.2.1 and the NA12877 using the PlatimunGenomes variant list. The two samples were mixed to prepare a in-silico tumor sample with a specified tumor content of 5, 10, 20 or 40 %. The samples were mapped with the short-read single sample pipeline on the GRCh38 reference genome with masked false duplications and the calling was done using short-read tumor normal pipeline.

Sensitivity and positive predictive value (PPV) were measured using our somatic validation tool.

Whole exome sequencing

The WES samples were processed with a custom exome kit based on a Twist enrichment (Core, RefSeq, Mito and custom content) and sequenced on NovaSeq X Plus using 151PE. The normal samples had a depth of 110x while the mixed 'tumor' samples had a depth of 220x. The benchmarks were performed on GIAB / PlatinumGenomes high-confidence regions with at least 60x coverage.

Strelka2 calling

Test - BWA-MEM2 + Strelka2 calling SNVs InDels SNVs+InDels
sensitivity PPV sensitivity PPV sensitivity PPV
Variants >= 5% allele freq 90.37% 99.21% 45.58% 89.10% 88.32% 98.95%
Variants >= 10% allele freq 96.82% 99.79% 69.42% 97.83% 95.57% 99.73%
Variants >= 20% allele freq 98.55% 99.98% 77.12% 99.50% 97.57% 99.96%
Test - Dragen mapping + Strelka2 calling SNVs InDels SNVs+InDels
sensitivity PPV sensitivity PPV sensitivity PPV
Variants >= 5% allele freq 90.12% 99.49% 45.77% 90.15% 88.09% 99.25%
Variants >= 10% allele freq 96.61% 99.84% 69.42% 97.57% 95.37% 99.76%
Variants >= 20% allele freq 98.34% 99.98% 76.92% 99.50% 97.36% 99.96%

Dragen calling

Test - BWA-MEM2 + Dragen calling SNVs InDels SNVs+InDels
sensitivity PPV sensitivity PPV sensitivity PPV
Variants >= 5% allele freq 93.78% 98.61% 60.39% 95.44% 92.26% 98.51%
Variants >= 10% allele freq 98.29% 99.66% 75.19% 98.24% 97.23% 99.61%
Variants >= 20% allele freq 99.07% 99.99% 88.27% 99.78% 98.58% 99.98%
Test - Dragen mapping + Dragen calling SNVs InDels SNVs+InDels
sensitivity PPV sensitivity PPV sensitivity PPV
Variants >= 5% allele freq 94.14% 98.84% 63.01% 92.14% 92.72% 98.62%
Variants >= 10% allele freq 98.32% 99.74% 79.04% 97.39% 97.44% 99.65%
Variants >= 20% allele freq 99.05% 99.99% 91.73% 99.38% 98.72% 99.96%

Conclusion

While the mapping has only a small influence on the sensitivity and precision. The dragen calling improves the sensitivity especially for variants with low allel frequency. The sensitivity for InDel variants with Dragen calling improves over Strelka2 by over 10% and for SNVs by ~2%.