A comparison study: applying segmentation to array CGH data for downstream analyses

被引:229
作者
Willenbrock, H
Fridlyand, J
机构
[1] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA
[2] Tech Univ Denmark, Dept Biotechnol, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
关键词
D O I
10.1093/bioinformatics/bti677
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Array comparative genomic hybridization (CGH) allows detection and mapping of copy number of DNA segments. A challenge is to make inferences about the copy number structure of the genome. Several statistical methods have been proposed to determine genomic segments with different copy number levels. However, to date, no comprehensive comparison of various characteristics of these methods exists. Moreover, the segmentation results have not been utilized in downstream analyses. Results: We describe a comparison of three popular and publicly available methods for the analysis of array CGH data and we demonstrate how segmentation results may be utilized in the downstream analyses such as testing and classification, yielding higher power and prediction accuracy. Since the methods operate on individual chromosomes, we also propose a novel procedure for merging segments across the genome, which results in an interpretable set of copy number levels, and thus facilitate identification of copy number alterations in each genome.
引用
收藏
页码:4084 / 4091
页数:8
相关论文
共 18 条
[1]  
[Anonymous], 1993, Resampling-based multiple testing: Examples and methods for P-value adjustment
[2]   CONSTRUCTING CONFIDENCE SETS USING RANK STATISTICS [J].
BAUER, DF .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1972, 67 (339) :687-690
[3]   A versatile statistical analysis algorithm to detect genome copy number variation [J].
Daruwala, RS ;
Rudra, A ;
Ostrer, H ;
Lucito, R ;
Wigler, M ;
Mishra, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (46) :16292-16297
[4]   Comparison of discrimination methods for the classification of tumors using gene expression data [J].
Dudoit, S ;
Fridlyand, J ;
Speed, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :77-87
[5]   Hidden Markov models approach to the analysis of array CGH data [J].
Fridlyand, J ;
Snijders, AM ;
Pinkel, D ;
Albertson, DG ;
Jain, AN .
JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 90 (01) :132-153
[6]   Genome scanning with array CGH delineates regional alterations in mouse islet carcinomas [J].
Hodgson G. ;
Hager J.H. ;
Volik S. ;
Hariono S. ;
Wernick M. ;
Moore D. ;
Albertson D.G. ;
Pinkel D. ;
Collins C. ;
Hanahan D. ;
Gray J.W. .
Nature Genetics, 2001, 29 (4) :459-464
[7]   Denoising array-based comparative genomic hybridization data using wavelets [J].
Hsu, L ;
Self, SG ;
Grove, D ;
Randolph, T ;
Wang, K ;
Delrow, JJ ;
Loo, L ;
Porter, P .
BIOSTATISTICS, 2005, 6 (02) :211-226
[8]   Analysis of array CGH data:: from signal ratio to gain and loss of DNA regions [J].
Hupé, P ;
Stransky, N ;
Thiery, JP ;
Radvanyi, F ;
Barillot, E .
BIOINFORMATICS, 2004, 20 (18) :3413-3422
[9]   Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data [J].
Lai, WR ;
Johnson, MD ;
Kucherlapati, R ;
Park, PJ .
BIOINFORMATICS, 2005, 21 (19) :3763-3770
[10]   Accurate detection of aneuploidies in array CGH and gene expression microarray data [J].
Myers, CL ;
Dunham, MJ ;
Kung, SY ;
Troyanskaya, OG .
BIOINFORMATICS, 2004, 20 (18) :3533-3543