DNA damage is a pervasive cause of sequencing errors, directly confounding variant identification

被引:157
作者
Chen, Lixin [1 ]
Liu, Pingfang [1 ]
Evans, Thomas C [1 ]
Ettwiller, Laurence M. [1 ]
机构
[1] New England Biolabs Inc, 240 Cty Rd, Ipswich, MA 01938 USA
关键词
SOMATIC MUTATION; CANCER; DISCOVERY; BASE;
D O I
10.1126/science.aai8690
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Mutations in somatic cells generate a heterogeneous genomic population and may result in serious medical conditions. Although cancer is typically associated with somatic variations, advances in DNA sequencing indicate that cell-specific variants affect a number of phenotypes and pathologies. Here, we show that mutagenic damage accounts for the majority of the erroneous identification of variants with low to moderate (1 to 5%) frequency. More important, we found signatures of damage in most sequencing data sets in widely used resources, including the 1000 Genomes Project and The Cancer Genome Atlas, establishing damage as a pervasive cause of sequencing errors. The extent of this damage directly confounds the determination of somatic variants in these data sets.
引用
收藏
页码:752 / +
页数:5
相关论文
共 16 条
[1]   Clock-like mutational processes in human somatic cells [J].
Alexandrov, Ludmil B. ;
Jones, Philip H. ;
Wedge, David C. ;
Sale, Julian E. ;
Campbell, Peter J. ;
Nik-Zainal, Serena ;
Stratton, Michael R. .
NATURE GENETICS, 2015, 47 (12) :1402-+
[2]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[3]   Genetic variegation of clonal architecture and propagating cells in leukaemia [J].
Anderson, Kristina ;
Lutz, Christoph ;
van Delft, Frederik W. ;
Bateman, Caroline M. ;
Guo, Yanping ;
Colman, Susan M. ;
Kempski, Helena ;
Moorman, Anthony V. ;
Titley, Ian ;
Swansbury, John ;
Kearney, Lyndal ;
Enver, Tariq ;
Greaves, Mel .
NATURE, 2011, 469 (7330) :356-+
[4]   PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions [J].
Bendl, Jaroslav ;
Musil, Milos ;
Stourac, Jan ;
Zendulka, Jaroslav ;
Damborsky, Jiri ;
Brezovsky, Jan .
PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (05)
[5]   Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA [J].
Briggs, Adrian W. ;
Stenzel, Udo ;
Meyer, Matthias ;
Krause, Johannes ;
Kircher, Martin ;
Paeaebo, Svante .
NUCLEIC ACIDS RESEARCH, 2010, 38 (06) :e87.1-e87.12
[6]  
CHENG KC, 1992, J BIOL CHEM, V267, P166
[7]   Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples [J].
Cibulskis, Kristian ;
Lawrence, Michael S. ;
Carter, Scott L. ;
Sivachenko, Andrey ;
Jaffe, David ;
Sougnez, Carrie ;
Gabriel, Stacey ;
Meyerson, Matthew ;
Lander, Eric S. ;
Getz, Gad .
NATURE BIOTECHNOLOGY, 2013, 31 (03) :213-219
[8]   Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation [J].
Costello, Maura ;
Pugh, Trevor J. ;
Fennell, Timothy J. ;
Stewart, Chip ;
Lichtenstein, Lee ;
Meldrim, James C. ;
Fostel, Jennifer L. ;
Friedrich, Dennis C. ;
Perrin, Danielle ;
Dionne, Danielle ;
Kim, Sharon ;
Gabriel, Stacey B. ;
Lander, Eric S. ;
Fisher, Sheila ;
Getz, Gad .
NUCLEIC ACIDS RESEARCH, 2013, 41 (06) :e67
[9]   Sequence Artifacts in DNA from Formalin-Fixed Tissues: Causes and Strategies for Minimization [J].
Do, Hongdo ;
Dobrovic, Alexander .
CLINICAL CHEMISTRY, 2015, 61 (01) :64-71
[10]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194