Improved detection of global copy number variation using high density, non-polymorphic oligonucleotide probes

被引:15
作者
Shen, Fan [1 ]
Huang, Jing [1 ]
Fitch, Karen R. [1 ]
Truong, Vivi B. [1 ]
Kirby, Andrew [2 ]
Chen, Wenwei [1 ]
Zhang, Jane [1 ]
Liu, Guoying [1 ]
McCarroll, Steven A. [3 ,4 ]
Jones, Keith W. [1 ]
Shapero, Michael H. [1 ]
机构
[1] Affymetrix Inc, Santa Clara, CA 95051 USA
[2] Massachusetts Gen Hosp, Ctr Human Genet Res, Boston, MA 02114 USA
[3] MIT, Broad Inst, Program Med & Populat Genet, Cambridge, MA 02142 USA
[4] Harvard Univ, Cambridge, MA 02138 USA
关键词
D O I
10.1186/1471-2156-9-27
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: DNA sequence diversity within the human genome may be more greatly affected by copy number variations (CNVs) than single nucleotide polymorphisms (SNPs). Although the importance of CNVs in genome wide association studies (GWAS) is becoming widely accepted, the optimal methods for identifying these variants are still under evaluation. We have previously reported a comprehensive view of CNVs in the HapMap DNA collection using high density 500 K EA (Early Access) SNP genotyping arrays which revealed greater than 1,000 CNVs ranging in size from 1 kb to over 3 Mb. Although the arrays used most commonly for GWAS predominantly interrogate SNPs, CNV identification and detection does not necessarily require the use of DNA probes centered on polymorphic nucleotides and may even be hindered by the dependence on a successful SNP genotyping assay. Results: In this study, we have designed and evaluated a high density array predicated on the use of nonpolymorphic oligonucleotide probes for CNV detection. This approach effectively uncouples copy number detection from SNP genotyping and thus has the potential to significantly improve probe coverage for genome-wide CNV identification. This array, in conjunction with PCR-based, complexity-reduced DNA target, queries over 1.3 M independent NspI restriction enzyme fragments in the 200 bp to 1100 bp size range, which is a several fold increase in marker density as compared to the 500 K EA array. In addition, a novel algorithm was developed and validated to extract CNV regions and boundaries. Conclusion: Using a well-characterized pair of DNA samples, close to 200 CNVs were identified, of which nearly 50% appear novel yet were independently validated using quantitative PCR. The results indicate that non-polymorphic probes provide a robust approach for CNV identification, and the increasing precision of CNV boundary delineation should allow a more complete analysis of their genomic organization.
引用
收藏
页数:18
相关论文
共 72 条
[1]   Human Genome Variation 2006: emerging views on structural variation and large-scale SNP analysis [J].
Gonçalo Abecasis ;
Paul Kwong-Hang Tam ;
Carlos D Bustamante ;
Elaine A Ostrander ;
Stephen W Scherer ;
Stephen J Chanock ;
Pui-Yan Kwok ;
Anthony J Brookes .
Nature Genetics, 2007, 39 (2) :153-155
[2]   Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans [J].
Aitman, TJ ;
Dong, R ;
Vyse, TJ ;
Norsworthy, PJ ;
Johnson, MD ;
Smith, J ;
Mangion, J ;
Roberton-Lowe, C ;
Marshall, AJ ;
Petretto, E ;
Hodges, MD ;
Bhangal, G ;
Patel, SG ;
Sheehan-Rooney, K ;
Duda, M ;
Cook, PR ;
Evans, DJ ;
Domin, J ;
Flint, J ;
Boyle, JJ ;
Pusey, CD ;
Cook, HT .
NATURE, 2006, 439 (7078) :851-855
[3]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[4]   Copy number variants and genetic traits: closer to the resolution of phenotypic to genotypic variability [J].
Beckmann, Jacques S. ;
Estivill, Xavier ;
Antonarakis, Stylianos E. .
NATURE REVIEWS GENETICS, 2007, 8 (08) :639-646
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]  
Carson Andrew R., 2006, Human Genomics, V2, P403
[7]   Methods and strategies for analyzing copy number variation using DNA microarrays [J].
Carter, Nigel P. .
NATURE GENETICS, 2007, 39 (Suppl 7) :S16-S21
[8]   Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data [J].
Carvalho, Benilton ;
Bengtsson, Henrik ;
Speed, Terence P. ;
Irizarry, Rafael A. .
BIOSTATISTICS, 2007, 8 (02) :485-499
[9]   A high-resolution survey of deletion polymorphism in the human genome [J].
Conrad, DF ;
Andrews, TD ;
Carter, NP ;
Hurles, ME ;
Pritchard, JK .
NATURE GENETICS, 2006, 38 (01) :75-81
[10]   Dynamic model based algorithms for screening and genotyping over 100K SNPs on oligonucleotide microarrays [J].
Di, XJ ;
Matsuzaki, H ;
Webster, TA ;
Hubbell, E ;
Liu, GY ;
Dong, SL ;
Bartell, D ;
Huang, J ;
Chiles, R ;
Yang, G ;
Shen, MM ;
Kulp, D ;
Kennedy, GC ;
Mei, R ;
Jones, KW ;
Cawley, S .
BIOINFORMATICS, 2005, 21 (09) :1958-1963