CNstream: A method for the identification and genotyping of copy number polymorphisms using Illumina microarrays

被引:14
作者
Alonso, Arnald [1 ]
Julia, Antonio [1 ]
Tortosa, Rauel [1 ]
Canaleta, Cristina [1 ]
Canete, Juan D. [2 ]
Ballina, Javier [3 ]
Balsa, Alejandro [4 ]
Tornero, Jesus [5 ]
Marsal, Sara [1 ]
机构
[1] Hosp Univ Vall Hebron UAB, Grp Recerca Reumatol, Inst Recerca, Barcelona, Spain
[2] Hosp Clin Barcelona, Barcelona, Spain
[3] Hosp Univ Cent Asturias, Oviedo, Asturias, Spain
[4] Hosp Univ La Paz, Madrid, Spain
[5] Hosp Univ Guadalajara, Castilla La Mancha, Spain
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
STRUCTURAL VARIATION; FINE-SCALE; GENOME; ASSOCIATION; SEGMENTATION; ALGORITHM;
D O I
10.1186/1471-2105-11-264
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Understanding the genetic basis of disease risk in depth requires an exhaustive knowledge of the types of genetic variation. Very recently, Copy Number Variants (CNVs) have received much attention because of their potential implication in common disease susceptibility. Copy Number Polymorphisms (CNPs) are of interest as they segregate at an appreciable frequency in the general population (i.e. > 1%) and are potentially implicated in the genetic basis of common diseases. Results: This paper concerns CNstream, a method for whole-genome CNV discovery and genotyping, using Illumina Beadchip arrays. Compared with other methods, a high level of accuracy was achieved by analyzing the measures of each intensity channel separately and combining information from multiple samples. The CNstream method uses heuristics and parametrical statistics to assign a confidence score to each sample at each probe; the sensitivity of the analysis is increased by jointly calling the copy number state over a set of nearby and consecutive probes. The present method has been tested on a real dataset of 575 samples genotyped using Illumina HumanHap 300 Beadchip, and demonstrates a high correlation with the Database of Genomic Variants (DGV). The same set of samples was analyzed with PennCNV, one of the most frequently used copy number inference methods for Illumina platforms. CNstream was able to identify CNP loci that are not detected by PennCNV and it increased the sensitivity over multiple other loci in the genome. Conclusions: CNstream is a useful method for the identification and characterization of CNPs using Illumina genotyping microarrays. Compared to the PennCNV method, it has greater sensitivity over multiple CNP loci and allows more powerful statistical analysis in these regions. Therefore, CNstream is a robust CNP analysis tool of use to researchers performing genome-wide association studies (GWAS) on Illumina platforms and aiming to identify CNVs associated with the variables of interest. CNstream has been implemented as an R statistical software package that can work directly from raw intensity files generated from Illumina GWAS projects. The method is available at http://www.urr.cat/cnv/cnstream.html.
引用
收藏
页数:18
相关论文
共 37 条
[11]   A defect in the TUSC3 gene is associated with autosomal recessive mental retardation [J].
Garshasbi, Masoud ;
Hadavi, Valeh ;
Habibi, Haleh ;
Kahrizi, Kimia ;
Kariminejad, Roxana ;
Behjati, Farkhondeh ;
Tzschach, Andreas ;
Najmabadi, Hossein ;
Ropers, Hans Hilger ;
Kuss, Andreas Walter .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (05) :1158-1164
[12]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[13]   Autism genome-wide copy number variation reveals ubiquitin and neuronal genes [J].
Glessner, Joseph T. ;
Wang, Kai ;
Cai, Guiqing ;
Korvatska, Olena ;
Kim, Cecilia E. ;
Wood, Shawn ;
Zhang, Haitao ;
Estes, Annette ;
Brune, Camille W. ;
Bradfield, Jonathan P. ;
Imielinski, Marcin ;
Frackelton, Edward C. ;
Reichert, Jennifer ;
Crawford, Emily L. ;
Munson, Jeffrey ;
Sleiman, Patrick M. A. ;
Chiavacci, Rosetta ;
Annaiah, Kiran ;
Thomas, Kelly ;
Hou, Cuiping ;
Glaberson, Wendy ;
Flory, James ;
Otieno, Frederick ;
Garris, Maria ;
Soorya, Latha ;
Klei, Lambertus ;
Piven, Joseph ;
Meyer, Kacie J. ;
Anagnostou, Evdokia ;
Sakurai, Takeshi ;
Game, Rachel M. ;
Rudd, Danielle S. ;
Zurawiecki, Danielle ;
McDougle, Christopher J. ;
Davis, Lea K. ;
Miller, Judith ;
Posey, David J. ;
Michaels, Shana ;
Kolevzon, Alexander ;
Silverman, Jeremy M. ;
Bernier, Raphael ;
Levy, Susan E. ;
Schultz, Robert T. ;
Dawson, Geraldine ;
Owley, Thomas ;
McMahon, William M. ;
Wassink, Thomas H. ;
Sweeney, John A. ;
Nurnberger, John I., Jr. ;
Coon, Hilary .
NATURE, 2009, 459 (7246) :569-573
[14]   Genomewide Association Studies - Illuminating Biologic Pathways [J].
Hirschhorn, Joel N. .
NEW ENGLAND JOURNAL OF MEDICINE, 2009, 360 (17) :1699-1701
[15]   Detection of large-scale variation in the human genome [J].
Iafrate, AJ ;
Feuk, L ;
Rivera, MN ;
Listewnik, ML ;
Donahoe, PK ;
Qi, Y ;
Scherer, SW ;
Lee, C .
NATURE GENETICS, 2004, 36 (09) :949-951
[16]  
Ihaka R., 1996, J. Comput. Graph. Stat., V5, P299, DOI [10.2307/1390807, 10.1080/10618600.1996.10474713, DOI 10.1080/10618600.1996.10474713]
[17]   Genetic association analysis of copy-number variation (CNV) in human disease pathogenesis [J].
Ionita-Laza, Iuliana ;
Rogers, Angela J. ;
Lange, Christoph ;
Raby, Benjamin A. ;
Lee, Charles .
GENOMICS, 2009, 93 (01) :22-26
[18]   Population Analysis of Large Copy Number Variants and Hotspots of Human Genetic Disease (vol 84, pg 148, 2009) [J].
Itsara, Andy ;
Cooper, Gregory M. ;
Baker, Carl ;
Girirajan, Santhosh ;
Li, Jun ;
Absher, Devin ;
Krauss, Ronald M. ;
Myers, Richard M. ;
Ridker, Paul M. ;
Chasman, Daniel I. ;
Mefford, Heather C. ;
Ying, Phyllis ;
Nickerson, Deborah A. ;
Eichler, Evan E. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (04) :550-551
[19]   Genotype, haplotype and copy-number variation in worldwide human populations [J].
Jakobsson, Mattias ;
Scholz, Sonja W. ;
Scheet, Paul ;
Gibbs, J. Raphael ;
VanLiere, Jenna M. ;
Fung, Hon-Chung ;
Szpiech, Zachary A. ;
Degnan, James H. ;
Wang, Kai ;
Guerreiro, Rita ;
Bras, Jose M. ;
Schymick, Jennifer C. ;
Hernandez, Dena G. ;
Traynor, Bryan J. ;
Simon-Sanchez, Javier ;
Matarin, Mar ;
Britton, Angela ;
van de Leemput, Joyce ;
Rafferty, Ian ;
Bucan, Maja ;
Cann, Howard M. ;
Hardy, John A. ;
Rosenberg, Noah A. ;
Singleton, Andrew B. .
NATURE, 2008, 451 (7181) :998-1003
[20]  
JULI A, 2008, ARTHRITIS RHEUMATISM, V588, P2275