A statistical method to detect chromosomal regions with DNA copy number alterations using SNP-array-based CGH data

被引:19
作者
Lai, YL
Zhao, HY
机构
[1] George Washington Univ, Dept Stat, Washington, DC 20052 USA
[2] Yale Univ, Sch Med, Dept Epidemiol & Publ Hlth, New Haven, CT USA
[3] Yale Univ, Sch Med, Dept Genet, New Haven, CT 06510 USA
关键词
comparative genomic hybridization (CGH); single nucleotide polymorphism (SNP) array; statistical method;
D O I
10.1016/j.compbiolchem.2004.12.004
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single nucleotide polymorphism (SNP) arrays were used to detect chromosomal regions with DNA copy number alterations. Current statistical methods for microarray-based comparative genomic hybridization (array-CGH) analysis generally assume certain relationships among adjacent markers on the same chromosome, and these assumptions may be questionable. For an SNP-array-based CGH study, multiple normal reference SNP arrays were collected. In order to utilize these normal reference SNP arrays, we derived an empirical distribution of signal ratios for each SNP marker. With an assumed threshold value for the overall error rate control and the defined signal ratio ranges for chromosomal amplification and deletion, we proposed a procedure to identify chromosomal alteration regions based on several bootstrapped one-sample t-tests and the false discovery rate control. When we have multiple arrays for different individuals with the same disease, our method can also be used to detect SNP markers for chromosomal alteration regions that are common among these individuals. We applied our method to a published SNP array data set for breast carcinoma cell lines. For an individual with breast cancer, numerous chromosomal alteration regions were identified. Compared to results of previous studies, our method identified more chromosomal alteration regions, with some being implicated in the literature to harbor genes associated with breast cancer. For multiple cancer arrays, our results suggested the existence of common chromosomal alteration regions. However, a high proportion of false positives also indicated that genetic variations among different individuals with breast cancer can be present. (c) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:47 / 54
页数:8
相关论文
共 25 条
[1]  
An HX, 1999, CANCER RES, V59, P3941
[2]   CGH-Plotter: MATLAB toolbox for CGH-data analysis [J].
Autio, R ;
Hautaniemi, S ;
Kauraniemi, P ;
Yli-Harja, O ;
Astola, J ;
Wolf, M ;
Kallioniemi, A .
BIOINFORMATICS, 2003, 19 (13) :1714-1715
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   High-resolution analysis of DNA copy number using oligonucleotide microarrays [J].
Bignell, GR ;
Huang, J ;
Greshock, J ;
Watt, S ;
Butler, A ;
West, S ;
Grigorova, M ;
Jones, KW ;
Wei, W ;
Stratton, MR ;
Futreal, PA ;
Weber, B ;
Shapero, MH ;
Wooster, R .
GENOME RESEARCH, 2004, 14 (02) :287-295
[5]  
CHEN LC, 1994, CANCER RES, V54, P3021
[6]   Hidden Markov models approach to the analysis of array CGH data [J].
Fridlyand, J ;
Snijders, AM ;
Pinkel, D ;
Albertson, DG ;
Jain, AN .
JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 90 (01) :132-153
[7]   Modelling the molecular circuitry of cancer [J].
Hahn, WC ;
Weinberg, RA .
NATURE REVIEWS CANCER, 2002, 2 (05) :331-341
[8]   2 GUIDELINES FOR BOOTSTRAP HYPOTHESIS-TESTING [J].
HALL, P ;
WILSON, SR .
BIOMETRICS, 1991, 47 (02) :757-762
[9]   Analysis of array CGH data:: from signal ratio to gain and loss of DNA regions [J].
Hupé, P ;
Stransky, N ;
Thiery, JP ;
Radvanyi, F ;
Barillot, E .
BIOINFORMATICS, 2004, 20 (18) :3413-3422
[10]   A tiling resolution DNA microarray with complete coverage of the human genome [J].
Ishkanian, AS ;
Malloff, CA ;
Watson, SK ;
deLeeuw, RJ ;
Chi, B ;
Coe, BP ;
Snijders, A ;
Albertson, DG ;
Pinkel, D ;
Marra, MA ;
Ling, V ;
MacAulay, C ;
Lam, WL .
NATURE GENETICS, 2004, 36 (03) :299-303