Measurements of Intrahost Viral Diversity Are Extremely Sensitive to Systematic Errors in Variant Calling
- PMID:27194763
- PMCID: PMC4944299
- DOI: 10.1128/JVI.00667-16
Measurements of Intrahost Viral Diversity Are Extremely Sensitive to Systematic Errors in Variant Calling
Abstract
With next-generation sequencing technologies, it is now feasible to efficiently sequence patient-derived virus populations at a depth of coverage sufficient to detect rare variants. However, each sequencing platform has characteristic error profiles, and sample collection, target amplification, and library preparation are additional processes whereby errors are introduced and propagated. Many studies account for these errors by using ad hoc quality thresholds and/or previously published statistical algorithms. Despite common usage, the majority of these approaches have not been validated under conditions that characterize many studies of intrahost diversity. Here, we use defined populations of influenza virus to mimic the diversity and titer typically found in patient-derived samples. We identified single-nucleotide variants using two commonly employed variant callers, DeepSNV and LoFreq. We found that the accuracy of these variant callers was lower than expected and exquisitely sensitive to the input titer. Small reductions in specificity had a significant impact on the number of minority variants identified and subsequent measures of diversity. We were able to increase the specificity of DeepSNV to >99.95% by applying an empirically validated set of quality thresholds. When applied to a set of influenza virus samples from a household-based cohort study, these changes resulted in a 10-fold reduction in measurements of viral diversity. We have made our sequence data and analysis code available so that others may improve on our work and use our data set to benchmark their own bioinformatics pipelines. Our work demonstrates that inadequate quality control and validation can lead to significant overestimation of intrahost diversity.
Importance: Advances in sequencing technology have made it feasible to sequence patient-derived viral samples at a level sufficient for detection of rare mutations. These high-throughput, cost-effective methods are revolutionizing the study of within-host viral diversity. However, the techniques are error prone, and the methods commonly used to control for these errors have not been validated under the conditions that characterize patient-derived samples. Here, we show that these conditions affect measurements of viral diversity. We found that the accuracy of previously benchmarked analysis pipelines was greatly reduced under patient-derived conditions. By carefully validating our sequencing analysis using known control samples, we were able to identify biases in our method and to improve our accuracy to acceptable levels. Application of our modified pipeline to a set of influenza virus samples from a cohort study provided a realistic picture of intrahost diversity and suggested the need for rigorous quality control in such studies.
Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Figures








Similar articles
- Vaccination has minimal impact on the intrahost diversity of H3N2 influenza viruses.Debbink K, McCrone JT, Petrie JG, Truscon R, Johnson E, Mantlo EK, Monto AS, Lauring AS.Debbink K, et al.PLoS Pathog. 2017 Jan 31;13(1):e1006194. doi: 10.1371/journal.ppat.1006194. eCollection 2017 Jan.PLoS Pathog. 2017.PMID:28141862Free PMC article.
- Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing.Van den Hoecke S, Verhelst J, Vuylsteke M, Saelens X.Van den Hoecke S, et al.BMC Genomics. 2015 Feb 14;16(1):79. doi: 10.1186/s12864-015-1284-z.BMC Genomics. 2015.PMID:25758772Free PMC article.
- Deep Sequencing of Influenza A Virus from a Human Challenge Study Reveals a Selective Bottleneck and Only Limited Intrahost Genetic Diversification.Sobel Leonard A, McClain MT, Smith GJ, Wentworth DE, Halpin RA, Lin X, Ransier A, Stockwell TB, Das SR, Gilbert AS, Lambkin-Williams R, Ginsburg GS, Woods CW, Koelle K.Sobel Leonard A, et al.J Virol. 2016 Nov 28;90(24):11247-11258. doi: 10.1128/JVI.01657-16. Print 2016 Dec 15.J Virol. 2016.PMID:27707932Free PMC article.
- Applying next-generation sequencing to unravel the mutational landscape in viral quasispecies.Lu IN, Muller CP, He FQ.Lu IN, et al.Virus Res. 2020 Jul 2;283:197963. doi: 10.1016/j.virusres.2020.197963. Epub 2020 Apr 9.Virus Res. 2020.PMID:32278821Free PMC article.Review.
- Within-Host Evolution of Human Influenza Virus.Xue KS, Moncla LH, Bedford T, Bloom JD.Xue KS, et al.Trends Microbiol. 2018 Sep;26(9):781-793. doi: 10.1016/j.tim.2018.02.007. Epub 2018 Mar 10.Trends Microbiol. 2018.PMID:29534854Free PMC article.Review.
Cited by
- An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar.Grubaugh ND, Gangavarapu K, Quick J, Matteson NL, De Jesus JG, Main BJ, Tan AL, Paul LM, Brackney DE, Grewal S, Gurfield N, Van Rompay KKA, Isern S, Michael SF, Coffey LL, Loman NJ, Andersen KG.Grubaugh ND, et al.Genome Biol. 2019 Jan 8;20(1):8. doi: 10.1186/s13059-018-1618-7.Genome Biol. 2019.PMID:30621750Free PMC article.
- Chikungunya virus populations experience diversity- dependent attenuation and purifying intra-vector selection in Californian Aedes aegypti mosquitoes.Riemersma KK, Coffey LL.Riemersma KK, et al.PLoS Negl Trop Dis. 2019 Nov 21;13(11):e0007853. doi: 10.1371/journal.pntd.0007853. eCollection 2019 Nov.PLoS Negl Trop Dis. 2019.PMID:31751338Free PMC article.
- Within-Host Viral Diversity: A Window into Viral Evolution.Lauring AS.Lauring AS.Annu Rev Virol. 2020 Sep 29;7(1):63-81. doi: 10.1146/annurev-virology-010320-061642. Epub 2020 Jun 8.Annu Rev Virol. 2020.PMID:32511081Free PMC article.Review.
- On the effective depth of viral sequence data.Illingworth CJR, Roy S, Beale MA, Tutill H, Williams R, Breuer J.Illingworth CJR, et al.Virus Evol. 2017 Nov 14;3(2):vex030. doi: 10.1093/ve/vex030. eCollection 2017 Jul.Virus Evol. 2017.PMID:29250429Free PMC article.
- Application of Next-Generation Sequencing to Reveal How Evolutionary Dynamics of Viral Population Shape Dengue Epidemiology.Ko HY, Salem GM, Chang GJ, Chao DY.Ko HY, et al.Front Microbiol. 2020 Jun 19;11:1371. doi: 10.3389/fmicb.2020.01371. eCollection 2020.Front Microbiol. 2020.PMID:32636827Free PMC article.Review.
References
- Andersen KG, Shapiro BJ, Matranga CB, Sealfon R, Lin AE, Moses LM, Folarin OA, Goba A, Odia I, Ehiane PE, Momoh M, England EM, Winnicki S, Branco LM, Gire SK, Phelan E, Tariyal R, Tewhey R, Omoniwa O, Fullah M, Fonnie R, Fonnie M, Kanneh L, Jalloh S, Gbakie M, Saffa S, Karbo K, Gladden AD, Qu J, Stremlau M, Nekoui M, Finucane HK, Tabrizi S, Vitti JJ, Birren B, Fitzgerald M, McCowan C, Ireland A, Berlin AM, Bochicchio J, Tazon-Vega B, Lennon NJ, Ryan EM, Bjornson Z, Milner DA Jr, Lukens AK, Broodie N, Rowland M, Heinrich M, Akdag M, Schieffelin JS, Levy D, Akpan H, Bausch DG, Rubins K, McCormick JB, Lander ES, Günther S, Hensley L, Okogbenin S, Viral Hemorrhagic Fever Consortium, Schaffner SF, Okokhere PO, Khan SH, Grant DS, Akpede GO, Asogun DA, Gnirke A, Levin JZ, Happi CT, Garry RF, Sabeti PC. 2015. Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus. Cell 162:738–750. doi:10.1016/j.cell.2015.07.020. - DOI - PMC - PubMed
- Grubaugh ND, Smith DR, Brackney DE, Bosco-Lauth AM, Fauver JR, Campbell CL, Felix TA, Romo H, Duggal NK, Dietrich EA, Eike T, Beane JE, Bowen RA, Black WC, Brault AC, Ebel GD. 2015. Experimental evolution of an RNA virus in wild birds: evidence for host-dependent impacts on population structure and competitive fitness. PLoS Pathog 11:e1004874. doi:10.1371/journal.ppat.1004874. - DOI - PMC - PubMed
- Rogers MB, Song T, Sebra R, Greenbaum BD, Hamelin M-E, Fitch A, Twaddle A, Cui L, Holmes EC, Boivin G, Ghedin E. 2015. Intrahost dynamics of antiviral resistance in influenza A virus reflect complex patterns of segment linkage, reassortment, and natural selection. mBio 6:e02464–14. doi:10.1128/mBio.02464-14. - DOI - PMC - PubMed
- Poon LLM, Song T, Rosenfeld R, Lin X, Rogers MB, Zhou B, Sebra R, Halpin RA, Guan Y, Twaddle A, DePasse JV, Stockwell TB, Wentworth DE, Holmes EC, Greenbaum B, Peiris JSM, Cowling BJ, Ghedin E. 2016. Quantifying influenza virus diversity and transmission in humans. Nat Genet 48:195–200. doi:10.1038/ng.3479. - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
Related information
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical