AbsCN-seq: a statistical method to estimate tumor purity, ploidy and absolute copy numbers from next-generation sequencing data

被引:47
作者
Bao, Lei [1 ]
Pu, Minya [1 ]
Messer, Karen [1 ]
机构
[1] Univ Calif San Diego, Div Biostat, Moores Canc Ctr, La Jolla, CA 92093 USA
关键词
CANCER; DISCOVERY; FRAMEWORK; MUTATION; SAMPLES;
D O I
10.1093/bioinformatics/btt759
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Detection and quantification of the absolute DNA copy number alterations in tumor cells is challenging because the DNA specimen is extracted from a mixture of tumor and normal stromal cells. Estimates of tumor purity and ploidy are necessary to correctly infer copy number, and ploidy may itself be a prognostic factor in cancer progression. As deep sequencing of the exome or genome has become routine for characterization of tumor samples, in this work, we aim to develop a simple and robust algorithm to infer purity, ploidy and absolute copy numbers in whole numbers for tumor cells from sequencing data. Results: A simulation study shows that estimates have reasonable accuracy, and that the algorithm is robust against the presence of segmentation errors and subclonal populations. We validated our algorithm against a panel of cell lines with experimentally determined ploidy. We also compared our algorithm with the well-established single-nucleotide polymorphism array-based method called ABSOLUTE on three sets of tumors of different types. Our method had good performance on these four benchmark datasets for both purity and ploidy estimates, and may offer a simple solution to copy number alteration quantification for cancer sequencing projects.
引用
收藏
页码:1056 / 1063
页数:8
相关论文
共 27 条
[1]   The Exomes of the NCI-60 Panel: A Genomic Resource for Cancer Biology and Systems Pharmacology [J].
Abaan, Ogan D. ;
Polley, Eric C. ;
Davis, Sean R. ;
Zhu, Yuelin J. ;
Bilke, Sven ;
Walker, Robert L. ;
Pineda, Marbin ;
Gindin, Yevgeniy ;
Jiang, Yuan ;
Reinhold, William C. ;
Holbeck, Susan L. ;
Simon, Richard M. ;
Doroshow, James H. ;
Pommier, Yves ;
Meltzer, Paul S. .
CANCER RESEARCH, 2013, 73 (14) :4372-4382
[2]   Genomic copy number determination in cancer cells from single nucleotide polymorphism microarrays based on quantitative genotyping corrected for aneuploidy [J].
Attiyeh, Edward F. ;
Diskin, Sharon J. ;
Attiyeh, Marc A. ;
Mosse, Yael P. ;
Hou, Cuiping ;
Jackson, Eric M. ;
Kim, Cecilia ;
Glessner, Joseph ;
Hakonarson, Hakon ;
Biegel, Jaclyn A. ;
Maris, John M. .
GENOME RESEARCH, 2009, 19 (02) :276-283
[3]   Sequence analysis of mutations and translocations across breast cancer subtypes [J].
Banerji, Shantanu ;
Cibulskis, Kristian ;
Rangel-Escareno, Claudia ;
Brown, Kristin K. ;
Carter, Scott L. ;
Frederick, Abbie M. ;
Lawrence, Michael S. ;
Sivachenko, Andrey Y. ;
Sougnez, Carrie ;
Zou, Lihua ;
Cortes, Maria L. ;
Fernandez-Lopez, Juan C. ;
Peng, Shouyong ;
Ardlie, Kristin G. ;
Auclair, Daniel ;
Bautista-Pina, Veronica ;
Duke, Fujiko ;
Francis, Joshua ;
Jung, Joonil ;
Maffuz-Aziz, Antonio ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Pho, Nam H. ;
Quintanar-Jurado, Valeria ;
Ramos, Alex H. ;
Rebollar-Vega, Rosa ;
Rodriguez-Cuevas, Sergio ;
Romero-Cordoba, Sandra L. ;
Schumacher, Steven E. ;
Stransky, Nicolas ;
Thompson, Kristin M. ;
Uribe-Figueroa, Laura ;
Baselga, Jose ;
Beroukhim, Rameen ;
Polyak, Kornelia ;
Sgroi, Dennis C. ;
Richardson, Andrea L. ;
Jimenez-Sanchez, Gerardo ;
Lander, Eric S. ;
Gabriel, Stacey B. ;
Garraway, Levi A. ;
Golub, Todd R. ;
Melendez-Zajgla, Jorge ;
Toker, Alex ;
Getz, Gad ;
Hidalgo-Miranda, Alfredo ;
Meyerson, Matthew .
NATURE, 2012, 486 (7403) :405-409
[4]   TumorBoost: Normalization of allele-specific tumor copy numbers from a single pair of tumor-normal genotyping microarrays [J].
Bengtsson, Henrik ;
Neuvial, Pierre ;
Speed, Terence P. .
BMC BIOINFORMATICS, 2010, 11
[5]   The landscape of somatic copy-number alteration across human cancers [J].
Beroukhim, Rameen ;
Mermel, Craig H. ;
Porter, Dale ;
Wei, Guo ;
Raychaudhuri, Soumya ;
Donovan, Jerry ;
Barretina, Jordi ;
Boehm, Jesse S. ;
Dobson, Jennifer ;
Urashima, Mitsuyoshi ;
Mc Henry, Kevin T. ;
Pinchback, Reid M. ;
Ligon, Azra H. ;
Cho, Yoon-Jae ;
Haery, Leila ;
Greulich, Heidi ;
Reich, Michael ;
Winckler, Wendy ;
Lawrence, Michael S. ;
Weir, Barbara A. ;
Tanaka, Kumiko E. ;
Chiang, Derek Y. ;
Bass, Adam J. ;
Loo, Alice ;
Hoffman, Carter ;
Prensner, John ;
Liefeld, Ted ;
Gao, Qing ;
Yecies, Derek ;
Signoretti, Sabina ;
Maher, Elizabeth ;
Kaye, Frederic J. ;
Sasaki, Hidefumi ;
Tepper, Joel E. ;
Fletcher, Jonathan A. ;
Tabernero, Josep ;
Baselga, Jose ;
Tsao, Ming-Sound ;
Demichelis, Francesca ;
Rubin, Mark A. ;
Janne, Pasi A. ;
Daly, Mark J. ;
Nucera, Carmelo ;
Levine, Ross L. ;
Ebert, Benjamin L. ;
Gabriel, Stacey ;
Rustgi, Anil K. ;
Antonescu, Cristina R. ;
Ladanyi, Marc ;
Letai, Anthony .
NATURE, 2010, 463 (7283) :899-905
[6]   Absolute quantification of somatic DNA alterations in human cancer [J].
Carter, Scott L. ;
Cibulskis, Kristian ;
Helman, Elena ;
McKenna, Aaron ;
Shen, Hui ;
Zack, Travis ;
Laird, Peter W. ;
Onofrio, Robert C. ;
Winckler, Wendy ;
Weir, Barbara A. ;
Beroukhim, Rameen ;
Pellman, David ;
Levine, Douglas A. ;
Lander, Eric S. ;
Meyerson, Matthew ;
Getz, Gad .
NATURE BIOTECHNOLOGY, 2012, 30 (05) :413-+
[7]   Comprehensive genomic characterization defines human glioblastoma genes and core pathways [J].
Chin, L. ;
Meyerson, M. ;
Aldape, K. ;
Bigner, D. ;
Mikkelsen, T. ;
VandenBerg, S. ;
Kahn, A. ;
Penny, R. ;
Ferguson, M. L. ;
Gerhard, D. S. ;
Getz, G. ;
Brennan, C. ;
Taylor, B. S. ;
Winckler, W. ;
Park, P. ;
Ladanyi, M. ;
Hoadley, K. A. ;
Verhaak, R. G. W. ;
Hayes, D. N. ;
Spellman, Paul T. ;
Absher, D. ;
Weir, B. A. ;
Ding, L. ;
Wheeler, D. ;
Lawrence, M. S. ;
Cibulskis, K. ;
Mardis, E. ;
Zhang, Jinghui ;
Wilson, R. K. ;
Donehower, L. ;
Wheeler, D. A. ;
Purdom, E. ;
Wallis, J. ;
Laird, P. W. ;
Herman, J. G. ;
Schuebel, K. E. ;
Weisenberger, D. J. ;
Baylin, S. B. ;
Schultz, N. ;
Yao, Jun ;
Wiedemeyer, R. ;
Weinstein, J. ;
Sander, C. ;
Gibbs, R. A. ;
Gray, J. ;
Kucherlapati, R. ;
Lander, E. S. ;
Myers, R. M. ;
Perou, C. M. ;
McLendon, Roger .
NATURE, 2008, 455 (7216) :1061-1068
[8]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[9]   PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data [J].
Greenman, Chris D. ;
Bignell, Graham ;
Butler, Adam ;
Edkins, Sarah ;
Hinton, Jon ;
Beare, Dave ;
Swamy, Sajani ;
Santarius, Thomas ;
Chen, Lina ;
Widaa, Sara ;
Futreal, P. Andy ;
Stratton, Michael R. .
BIOSTATISTICS, 2010, 11 (01) :164-175
[10]   Correcting for cancer genome size and tumour cell content enables better estimation of copy number alterations from next-generation sequence data [J].
Gusnanto, Arief ;
Wood, Henry M. ;
Pawitan, Yudi ;
Rabbitts, Pamela ;
Berri, Stefano .
BIOINFORMATICS, 2012, 28 (01) :40-47