Accurate detection of aneuploidies in array CGH and gene expression microarray data

被引:75
作者
Myers, CL
Dunham, MJ
Kung, SY
Troyanskaya, OG [1 ]
机构
[1] Princeton Univ, Lewis Sigler Inst Integrat Gen, Carl Icahn Lab, Princeton, NJ 08544 USA
[2] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/bth440
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Chromosomal copy number changes (aneuploidies) are common in cell populations that undergo multiple cell divisions including yeast strains, cell lines and tumor cells. Identification of aneuploidies is critical in evolutionary studies, where changes in copy number serve an adaptive purpose, as well as in cancer studies, where amplifications and deletions of chromosomal regions have been identified as a major pathogenetic mechanism. Aneuploidies can be studied on whole-genome level using array CGH (a microarray-based method that measures the DNA content), but their presence also affects gene expression. In gene expression microarray analysis, identification of copy number changes is especially important in preventing aberrant biological conclusions based on spurious gene expression correlation or masked phenotypes that arise due to aneuploidies. Previously suggested approaches for aneuploidy detection from microarray data mostly focus on array CGH, address only whole-chromosome or whole-arm copy number changes, and rely on thresholds or other heuristics, making them unsuitable for fully automated general application to gene expression datasets. There is a need for a general and robust method for identification of aneuploidies of any size from both array CGH and gene expression microarray data. Results: We present ChARM (Chromosomal Aberration Region Miner), a robust and accurate expectation-maximization based method for identification of segmental aneuploidies (partial chromosome changes) from gene expression and array CGH microarray data. Systematic evaluation of the algorithm on synthetic and biological data shows that the method is robust to noise, aneuploidal segment size and P-value cutoff. Using our approach, we identify known chromosomal changes and predict novel potential segmental aneuploidies in commonly used yeast deletion strains and in breast cancer. ChARM can be routinely used to identify aneuploidies in array CGH datasets and to screen gene expression data for aneuploidies or array biases. Our methodology is sensitive enough to detect statistically significant and biologically relevant aneuploidies even when expression or DNA content changes are subtle as in mixed populations of cells.
引用
收藏
页码:3533 / 3543
页数:11
相关论文
共 43 条
[1]   CGH-Plotter: MATLAB toolbox for CGH-data analysis [J].
Autio, R ;
Hautaniemi, S ;
Kauraniemi, P ;
Yli-Harja, O ;
Astola, J ;
Wolf, M ;
Kallioniemi, A .
BIOINFORMATICS, 2003, 19 (13) :1714-1715
[2]  
Baskar S, 1996, J IMMUNOL, V156, P3821
[3]   PHENOTYPIC DYNAMICS OF TUMOR PROGRESSION IN HUMAN-MALIGNANT MELANOMA [J].
BROCKER, EB ;
SUTER, L ;
BRUGGEN, J ;
RUITER, DJ ;
MACHER, E ;
SORG, C .
INTERNATIONAL JOURNAL OF CANCER, 1985, 36 (01) :29-35
[4]   High levels of Fas ligand and MHC class II in the absence of CD80 or CD86 expression and a decreased CD4+ T cell Infiltration, enables murine skin tumours to progress [J].
Byrne, SN ;
Halliday, GM .
CANCER IMMUNOLOGY IMMUNOTHERAPY, 2003, 52 (06) :396-402
[5]   Genetic instability and darwinian selection in tumours (Reprinted from Trends in Biochemical Science, vol 12, Dec., 1999) [J].
Cahill, DP ;
Kinzler, KW ;
Vogelstein, B ;
Lengauer, C .
TRENDS IN CELL BIOLOGY, 1999, 9 (12) :M57-M60
[6]   ORGANIZATION OF DNA-SEQUENCES AND REPLICATION ORIGINS AT YEAST TELOMERES [J].
CHAN, CSM ;
TYE, BK .
CELL, 1983, 33 (02) :563-573
[7]   Got1p and Sft2p: membrane proteins involved in traffic to the Golgi complex [J].
Conchon, S ;
Cao, XC ;
Barlow, C ;
Pelham, HRB .
EMBO JOURNAL, 1999, 18 (14) :3934-3946
[8]  
Crawley JJ, 2002, GENOME BIOL, V3
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]  
Dolinski K., 2004, SACCHAROMYCES GENOME