Assessing the significance of conserved genomic aberrations using high resolution genomic microarrays

被引:34
作者
Guttman, Mitchell [1 ]
Mies, Carolyn
Dudycz-Sulicz, Katarzyna
Diskin, Sharon J.
Baldwin, Don A.
Stoeckert, Christian J., Jr.
Grant, Gregory R.
机构
[1] Univ Penn, Penn Ctr Bioinformat, Philadelphia, PA 19104 USA
[2] Univ Penn, Sch Med, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
[3] Childrens Hosp Philadelphia, Div Oncol, Philadelphia, PA 19104 USA
[4] Univ Penn, Penn Microarray Facil, Philadelphia, PA 19104 USA
[5] Univ Penn, Sch Med, Dept Genet, Philadelphia, PA 19104 USA
来源
PLOS GENETICS | 2007年 / 3卷 / 08期
关键词
D O I
10.1371/journal.pgen.0030143
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genomic aberrations recurrent in a particular cancer type can be important prognostic markers for tumor progression. Typically in early tumorigenesis, cells incur a breakdown of the DNA replication machinery that results in an accumulation of genomic aberrations in the form of duplications, deletions, translocations, and other genomic alterations. Microarray methods allow for finer mapping of these aberrations than has previously been possible; however, data processing and analysis methods have not taken full advantage of this higher resolution. Attention has primarily been given to analysis on the single sample level, where multiple adjacent probes are necessarily used as replicates for the local region containing their target sequences. However, regions of concordant aberration can be short enough to be detected by only one, or very few, array elements. We describe a method called Multiple Sample Analysis for assessing the significance of concordant genomic aberrations across multiple experiments that does not require a-priori definition of aberration calls for each sample. If there are multiple samples, representing a class, then by exploiting the replication across samples our method can detect concordant aberrations at much higher resolution than can be derived from current single sample approaches. Additionally, this method provides a meaningful approach to addressing population-based questions such as determining important regions for a cancer subtype of interest or determining regions of copy number variation in a population. Multiple Sample Analysis also provides single sample aberration calls in the locations of significant concordance, producing high resolution calls per sample, in concordant regions. The approach is demonstrated on a dataset representing a challenging but important resource: breast tumors that have been formalin-fixed, paraffin-embedded, archived, and subsequently UV-laser capture microdissected and hybridized to two-channel BAC arrays using an amplification protocol. We demonstrate the accurate detection on simulated data, and on real datasets involving known regions of aberration within subtypes of breast cancer at a resolution consistent with that of the array. Similarly, we apply our method to previously published datasets, including a 250K SNP array, and verify known results as well as detect novel regions of concordant aberration. The algorithm has been fully implemented and tested and is freely available as a Java application at http:// www. cbil. upenn. edu/ MSA.
引用
收藏
页码:1464 / 1486
页数:23
相关论文
共 37 条
  • [1] Genetic relation of lobular carcinoma in situ, ductal carcinoma in situ, and associated invasive carcinoma of the breast
    Buerger, H
    Simon, R
    Schäfer, KL
    Diallo, R
    Littmann, R
    Poremba, C
    van Diest, PJ
    Dockhorn-Dworniczak, B
    Böcker, W
    [J]. JOURNAL OF CLINICAL PATHOLOGY-MOLECULAR PATHOLOGY, 2000, 53 (03): : 118 - 121
  • [2] Buerger H, 1999, J PATHOL, V187, P396, DOI 10.1002/(SICI)1096-9896(199903)187:4<396::AID-PATH286>3.0.CO
  • [3] 2-L
  • [4] STAC: A method for testing the significance of DNA copy number aberrations across multiple array-CGH experiments
    Diskin, Sharon J.
    Eck, Thomas
    Greshock, Joel
    Mosse, Yael P.
    Naylor, Tara
    Stoeckert, Christian J., Jr.
    Weber, Barbara L.
    Maris, John M.
    Grant, Gregory R.
    [J]. GENOME RESEARCH, 2006, 16 (09) : 1149 - 1158
  • [5] Ewens W, 2005, STAT BIOL HEALTH, P1, DOI 10.1007/b137845
  • [6] Hidden Markov models approach to the analysis of array CGH data
    Fridlyand, J
    Snijders, AM
    Pinkel, D
    Albertson, DG
    Jain, AN
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 90 (01) : 132 - 153
  • [7] Significance testing for direct identity-by-descent mapping
    Grant, GR
    Manduchi, E
    Cheung, VG
    Ewens, WJ
    [J]. ANNALS OF HUMAN GENETICS, 1999, 63 : 441 - 454
  • [8] Greshock J, 2004, GENOME RES, V14, P179
  • [9] The hallmarks of cancer
    Hanahan, D
    Weinberg, RA
    [J]. CELL, 2000, 100 (01) : 57 - 70
  • [10] Denoising array-based comparative genomic hybridization data using wavelets
    Hsu, L
    Self, SG
    Grove, D
    Randolph, T
    Wang, K
    Delrow, JJ
    Loo, L
    Porter, P
    [J]. BIOSTATISTICS, 2005, 6 (02) : 211 - 226