CANOES: detecting rare copy number variants from whole exome sequencing data

被引:110
作者
Backenroth, Daniel [1 ,2 ,3 ]
Homsy, Jason [4 ]
Murillo, Laura R. [5 ,6 ,7 ]
Glessner, Joe [8 ]
Lin, Edwin [1 ,2 ,9 ,10 ]
Brueckner, Martina [11 ]
Lifton, Richard [11 ,12 ]
Goldmuntz, Elizabeth [13 ]
Chung, Wendy K. [9 ,10 ]
Shen, Yufeng [1 ,2 ,3 ]
机构
[1] Columbia Univ, Med Ctr, Dept Syst Biol, New York, NY 10032 USA
[2] Columbia Univ, Med Ctr, Dept Biomed Informat, New York, NY 10032 USA
[3] Columbia Univ, Med Ctr, JP Sulzberger Columbia Genome Ctr, New York, NY 10032 USA
[4] Massachusetts Gen Hosp, Cardiovascular Res Ctr, Boston, MA 02114 USA
[5] Harvard Univ, Sch Med, Dept Genet, Boston, MA 02115 USA
[6] Icahn Sch Med Mt Sinai, Dept Pediat & Genet, Boston, MA 02115 USA
[7] Icahn Sch Med Mt Sinai, Dept Gen Sci, Boston, MA 02115 USA
[8] Childrens Hosp Philadelphia, Ctr Appl Gen, Philadelphia, PA 19104 USA
[9] Columbia Univ, Med Ctr, Dept Pediat, New York, NY 10032 USA
[10] Columbia Univ, Med Ctr, Dept Med, New York, NY 10032 USA
[11] Yale Univ, Sch Med, Dept Genet, New Haven, CT 06510 USA
[12] Yale Univ, Howard Hughes Med Inst, New Haven, CT 06510 USA
[13] Univ Penn, Perelman Sch Med, Dept Pediat, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
GENOME-WIDE; MODEL; DISCOVERY; SNP;
D O I
10.1093/nar/gku345
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
We present CANOES, an algorithm for the detection of rare copy number variants from exome sequencing data. CANOES models read counts using a negative binomial distribution and estimates variance of the read counts using a regression-based approach based on selected reference samples in a given dataset. We test CANOES on a family-based exome sequencing dataset, and show that its sensitivity and specificity is comparable to that of XHMM. Moreover, the method is complementary to Gaussian approximation-based methods (e.g. XHMM or CoNIFER). When CANOES is used in combination with these methods, it will be possible to produce high accuracy calls, as demonstrated by a much reduced and more realistic de novo rate in results from trio data.
引用
收藏
页数:9
相关论文
共 22 条
[1]
Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[2]
Brennecke P, 2013, NAT METHODS, V10, P1093, DOI [10.1038/NMETH.2645, 10.1038/nmeth.2645]
[3]
The Autism Sequencing Consortium: Large-Scale, High-Throughput Sequencing in Autism Spectrum Disorders [J].
Buxbaum, Joseph D. ;
Daly, Mark J. ;
Devlin, Bernie ;
Lehner, Thomas ;
Roeder, Kathryn ;
State, Matthew W. .
NEURON, 2012, 76 (06) :1052-1056
[4]
Performance comparison of exome DNA sequencing technologies [J].
Clark, Michael J. ;
Chen, Rui ;
Lam, Hugo Y. K. ;
Karczewski, Konrad J. ;
Chen, Rong ;
Euskirchen, Ghia ;
Butte, Atul J. ;
Snyder, Michael .
NATURE BIOTECHNOLOGY, 2011, 29 (10) :908-U206
[5]
A genetic model for neurodevelopmental disease [J].
Coe, Bradley P. ;
Girirajan, Santhosh ;
Eichler, Evan E. .
CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (05) :829-836
[6]
Discovery and Statistical Genotyping of Copy-Number Variation from Whole-Exome Sequencing Depth [J].
Fromer, Menachem ;
Moran, Jennifer L. ;
Chambert, Kimberly ;
Banks, Eric ;
Bergen, Sarah E. ;
Ruderfer, Douglas M. ;
Handsaker, Robert E. ;
McCarroll, Steven A. ;
O'Donovan, Michael C. ;
Owen, Michael J. ;
Kirov, George ;
Sullivan, Patrick F. ;
Hultman, Christina M. ;
Sklar, Pamela ;
Purcell, Shaun M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 91 (04) :597-607
[7]
Refinement and Discovery of New Hotspots of Copy-Number Variation Associated with Autism Spectrum Disorder [J].
Girirajan, Santhosh ;
Dennis, Megan Y. ;
Baker, Carl ;
Malig, Maika ;
Coe, Bradley P. ;
Campbell, Catarina D. ;
Mark, Kenneth ;
Vu, Tiffany H. ;
Alkan, Can ;
Cheng, Ze ;
Biesecker, Leslie G. ;
Bernier, Raphael ;
Eichler, Evan E. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 92 (02) :221-237
[8]
Human Copy Number Variation and Complex Genetic Disease [J].
Girirajan, Santhosh ;
Campbell, Catarina D. ;
Eichler, Evan E. .
ANNUAL REVIEW OF GENETICS, VOL 45, 2011, 45 :203-226
[9]
Copy number variation detection and genotyping from exome sequence data [J].
Krumm, Niklas ;
Sudmant, Peter H. ;
Ko, Arthur ;
O'Roak, Brian J. ;
Malig, Maika ;
Coe, Bradley P. ;
Quinlan, Aaron R. ;
Nickerson, Deborah A. ;
Eichler, Evan E. .
GENOME RESEARCH, 2012, 22 (08) :1525-1532
[10]
Modeling Read Counts for CNV Detection in Exome Sequencing Data [J].
Love, Michael I. ;
Mysickova, Alena ;
Sun, Ruping ;
Kalscheuer, Vera ;
Vingron, Martin ;
Haas, Stefan A. .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)