ContEst: estimating cross-contamination of human samples in next-generation sequencing data

被引:184
作者
Cibulskis, Kristian [1 ]
McKenna, Aaron [1 ]
Fennell, Tim [1 ]
Banks, Eric [1 ]
DePristo, Mark [1 ]
Getz, Gad [1 ]
机构
[1] Broad Inst, Genome Sequencing Anal Program & Platform, Cambridge, MA 02142 USA
关键词
D O I
10.1093/bioinformatics/btr446
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Here, we present ContEst, a tool for estimating the level of cross-individual contamination in next-generation sequencing data. We demonstrate the accuracy of ContEst across a range of contamination levels, sources and read depths using sequencing data mixed in silico at known concentrations. We applied our tool to published cancer sequencing datasets and report their estimated contamination levels.
引用
收藏
页码:2601 / 2602
页数:2
相关论文
共 7 条
  • [1] Integrated genomic analyses of ovarian carcinoma
    Bell, D.
    Berchuck, A.
    Birrer, M.
    Chien, J.
    Cramer, D. W.
    Dao, F.
    Dhir, R.
    DiSaia, P.
    Gabra, H.
    Glenn, P.
    Godwin, A. K.
    Gross, J.
    Hartmann, L.
    Huang, M.
    Huntsman, D. G.
    Iacocca, M.
    Imielinski, M.
    Kalloger, S.
    Karlan, B. Y.
    Levine, D. A.
    Mills, G. B.
    Morrison, C.
    Mutch, D.
    Olvera, N.
    Orsulic, S.
    Park, K.
    Petrelli, N.
    Rabeno, B.
    Rader, J. S.
    Sikic, B. I.
    Smith-McCune, K.
    Sood, A. K.
    Bowtell, D.
    Penny, R.
    Testa, J. R.
    Chang, K.
    Dinh, H. H.
    Drummond, J. A.
    Fowler, G.
    Gunaratne, P.
    Hawes, A. C.
    Kovar, C. L.
    Lewis, L. R.
    Morgan, M. B.
    Newsham, I. F.
    Santibanez, J.
    Reid, J. G.
    Trevino, L. R.
    Wu, Y. -Q.
    Wang, M.
    [J]. NATURE, 2011, 474 (7353) : 609 - 615
  • [2] The genomic complexity of primary human prostate cancer
    Berger, Michael F.
    Lawrence, Michael S.
    Demichelis, Francesca
    Drier, Yotam
    Cibulskis, Kristian
    Sivachenko, Andrey Y.
    Sboner, Andrea
    Esgueva, Raquel
    Pflueger, Dorothee
    Sougnez, Carrie
    Onofrio, Robert
    Carter, Scott L.
    Park, Kyung
    Habegger, Lukas
    Ambrogio, Lauren
    Fennell, Timothy
    Parkin, Melissa
    Saksena, Gordon
    Voet, Douglas
    Ramos, Alex H.
    Pugh, Trevor J.
    Wilkinson, Jane
    Fisher, Sheila
    Winckler, Wendy
    Mahan, Scott
    Ardlie, Kristin
    Baldwin, Jennifer
    Simons, Jonathan W.
    Kitabayashi, Naoki
    MacDonald, Theresa Y.
    Kantoff, Philip W.
    Chin, Lynda
    Gabriel, Stacey B.
    Gerstein, Mark B.
    Golub, Todd R.
    Meyerson, Matthew
    Tewari, Ashutosh
    Lander, Eric S.
    Getz, Gad
    Rubin, Mark A.
    Garraway, Levi A.
    [J]. NATURE, 2011, 470 (7333) : 214 - 220
  • [3] Initial genome sequencing and analysis of multiple myeloma
    Chapman, Michael A.
    Lawrence, Michael S.
    Keats, Jonathan J.
    Cibulskis, Kristian
    Sougnez, Carrie
    Schinzel, Anna C.
    Harview, Christina L.
    Brunet, Jean-Philippe
    Ahmann, Gregory J.
    Adli, Mazhar
    Anderson, Kenneth C.
    Ardlie, Kristin G.
    Auclair, Daniel
    Baker, Angela
    Bergsagel, P. Leif
    Bernstein, Bradley E.
    Drier, Yotam
    Fonseca, Rafael
    Gabriel, Stacey B.
    Hofmeister, Craig C.
    Jagannath, Sundar
    Jakubowiak, Andrzej J.
    Krishnan, Amrita
    Levy, Joan
    Liefeld, Ted
    Lonial, Sagar
    Mahan, Scott
    Mfuko, Bunmi
    Monti, Stefano
    Perkins, Louise M.
    Onofrio, Robb
    Pugh, Trevor J.
    Rajkumar, S. Vincent
    Ramos, Alex H.
    Siegel, David S.
    Sivachenko, Andrey
    Stewart, A. Keith
    Trudel, Suzanne
    Vij, Ravi
    Voet, Douglas
    Winckler, Wendy
    Zimmerman, Todd
    Carpten, John
    Trent, Jeff
    Hahn, William C.
    Garraway, Levi A.
    Meyerson, Matthew
    Lander, Eric S.
    Getz, Gad
    Golub, Todd R.
    [J]. NATURE, 2011, 471 (7339) : 467 - 472
  • [4] Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing
    Gnirke, Andreas
    Melnikov, Alexandre
    Maguire, Jared
    Rogov, Peter
    LeProust, Emily M.
    Brockman, William
    Fennell, Timothy
    Giannoukos, Georgia
    Fisher, Sheila
    Russ, Carsten
    Gabriel, Stacey
    Jaffe, David B.
    Lander, Eric S.
    Nusbaum, Chad
    [J]. NATURE BIOTECHNOLOGY, 2009, 27 (02) : 182 - 189
  • [5] The Sequence Alignment/Map format and SAMtools
    Li, Heng
    Handsaker, Bob
    Wysoker, Alec
    Fennell, Tim
    Ruan, Jue
    Homer, Nils
    Marth, Gabor
    Abecasis, Goncalo
    Durbin, Richard
    [J]. BIOINFORMATICS, 2009, 25 (16) : 2078 - 2079
  • [6] The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data
    McKenna, Aaron
    Hanna, Matthew
    Banks, Eric
    Sivachenko, Andrey
    Cibulskis, Kristian
    Kernytsky, Andrew
    Garimella, Kiran
    Altshuler, David
    Gabriel, Stacey
    Daly, Mark
    DePristo, Mark A.
    [J]. GENOME RESEARCH, 2010, 20 (09) : 1297 - 1303
  • [7] How to map billions of short reads onto genomes
    Trapnell, Cole
    Salzberg, Steven L.
    [J]. NATURE BIOTECHNOLOGY, 2009, 27 (05) : 455 - 457