CNV-seq, a new method to detect copy number variation using high-throughput sequencing

被引:436
作者
Xie, Chao [1 ]
Tammi, Martti T. [1 ,2 ,3 ]
机构
[1] Natl Univ Singapore, Dept Biol Sci, Singapore 117548, Singapore
[2] Karolinska Inst, Dept Microbiol Tumor & Cell Biol, Stockholm, Sweden
[3] Natl Univ Singapore, Dept Biochem, Singapore 117548, Singapore
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
COMPARATIVE GENOMIC HYBRIDIZATION; HIGH-RESOLUTION; DNA; DISCOVERY; REVEALS; RATIO;
D O I
10.1186/1471-2105-10-80
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: DNA copy number variation (CNV) has been recognized as an important source of genetic variation. Array comparative genomic hybridization (aCGH) is commonly used for CNV detection, but the microarray platform has a number of inherent limitations. Results: Here, we describe a method to detect copy number variation using shotgun sequencing, CNV-seq. The method is based on a robust statistical model that describes the complete analysis procedure and allows the computation of essential confidence values for detection of CNV. Our results show that the number of reads, not the length of the reads is the key factor determining the resolution of detection. This favors the next-generation sequencing methods that rapidly produce large amount of short reads. Conclusion: Simulation of various sequencing methods with coverage between 0.1x to 8x show overall specificity between 91.7 - 99.9%, and sensitivity between 72.2 - 96.5%. We also show the results for assessment of CNV between two individual human genomes.
引用
收藏
页数:9
相关论文
共 36 条
[1]   Whole-genome re-sequencing [J].
Bentley, David R. .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2006, 16 (06) :545-552
[2]   An overview of ensembl [J].
Birney, E ;
Andrews, TD ;
Bevan, P ;
Caccamo, M ;
Chen, Y ;
Clarke, L ;
Coates, G ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Down, T ;
Eyras, E ;
Fernandez-Suarez, XM ;
Gane, P ;
Gibbins, B ;
Gilbert, J ;
Hammond, M ;
Hotz, HR ;
Iyer, V ;
Jekosch, K ;
Kahari, A ;
Kasprzyk, A ;
Keefe, D ;
Keenan, S ;
Lehvaslaiho, H ;
McVicker, G ;
Melsopp, C ;
Meidl, P ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Proctor, G ;
Rae, M ;
Searle, S ;
Slater, G ;
Smedley, D ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Storey, R ;
Ureta-Vidal, A ;
Woodwark, KC ;
Cameron, G ;
Durbin, R ;
Cox, A ;
Hubbard, T ;
Clamp, M .
GENOME RESEARCH, 2004, 14 (05) :925-928
[3]   Methods and strategies for analyzing copy number variation using DNA microarrays [J].
Carter, Nigel P. .
NATURE GENETICS, 2007, 39 (Suppl 7) :S16-S21
[4]   Mapping translocation breakpoints by next-generation sequencing [J].
Chen, Wei ;
Kalscheuer, Vera ;
Tzschach, Andreas ;
Menzel, Corinna ;
Ullmann, Reinhard ;
Schulz, Marcel Holger ;
Erdogan, Fikret ;
Li, Na ;
Kijas, Zofia ;
Arkesteijn, Ger ;
Pajares, Isidora Lopez ;
Goetz-Sothmann, Margret ;
Heinrich, Uwe ;
Rost, Imma ;
Dufke, Andreas ;
Grasshoff, Ute ;
Glaeser, Birgitta ;
Vingron, Martin ;
Ropers, H. Hilger .
GENOME RESEARCH, 2008, 18 (07) :1143-1149
[5]   Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning [J].
Cokus, Shawn J. ;
Feng, Suhua ;
Zhang, Xiaoyu ;
Chen, Zugen ;
Merriman, Barry ;
Haudenschild, Christian D. ;
Pradhan, Sriharsa ;
Nelson, Stanley F. ;
Pellegrini, Matteo ;
Jacobsen, Steven E. .
NATURE, 2008, 452 (7184) :215-219
[6]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[7]   RATIO OF 2 NORMALLY DISTRIBUTED VARIABLES [J].
HAYYA, J ;
ARMSTRONG, D ;
GRESSIS, N .
MANAGEMENT SCIENCE SERIES A-THEORY, 1975, 21 (11) :1338-1341
[8]   Whole-genome sequencing and variant discovery in C-elegans [J].
Hillier, LaDeana W. ;
Marth, Gabor T. ;
Quinlan, Aaron R. ;
Dooling, David ;
Fewell, Ginger ;
Barnett, Derek ;
Fox, Paul ;
Glasscock, Jarret I. ;
Hickenbotham, Matthew ;
Huang, Weichun ;
Magrini, Vincent J. ;
Richt, Ryan J. ;
Sander, Sacha N. ;
Stewart, Donald A. ;
Stromberg, Michael ;
Tsung, Eric F. ;
Wylie, Todd ;
Schedl, Tim ;
Wilson, Richard K. ;
Mardis, Elaine R. .
NATURE METHODS, 2008, 5 (02) :183-188
[9]   ON RATIO OF 2 CORRELATED NORMAL RANDOM VARIABLES [J].
HINKLEY, DV .
BIOMETRIKA, 1969, 56 (03) :635-&
[10]   Detection of large-scale variation in the human genome [J].
Iafrate, AJ ;
Feuk, L ;
Rivera, MN ;
Listewnik, ML ;
Donahoe, PK ;
Qi, Y ;
Scherer, SW ;
Lee, C .
NATURE GENETICS, 2004, 36 (09) :949-951