Denoising array-based comparative genomic hybridization data using wavelets

被引:109
作者
Hsu, L
Self, SG
Grove, D
Randolph, T
Wang, K
Delrow, JJ
Loo, L
Porter, P
机构
[1] Fred Hutchinson Canc Res Ctr, Biostat Program, Seattle, WA 98109 USA
[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[3] Columbia Univ, Dept Biomed Informat, New York, NY 10032 USA
[4] Fred Hutchinson Canc Res Ctr, Human Biol Div, Seattle, WA 98109 USA
关键词
array-CGH; denoising; hidden Markov models; nonparametric modeling; threshold; wavelet analysis;
D O I
10.1093/biostatistics/kxi004
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Array-based comparative genomic hybridization (array-CGH) provides a high-throughput, high-resolution method to measure relative changes in DNA copy number simultaneously at thousands of genomic loci. Typically, these measurements are reported and displayed linearly on chromosome maps, and gains and losses are detected as deviations from normal diploid cells. We propose that one may consider denoising the data to uncover the true copy number changes before drawing inferences on the patterns of aberrations in the samples. Nonparametric techniques are particularly suitable for data denoising as they do not impose a parametric model in finding structures in the data. In this paper, we employ wavelets to denoise the data as wavelets have sound theoretical properties and a fast computational algorithm, and are particularly well suited for handling the abrupt changes seen in array-CGH data. A simulation study shows that denoising data prior to testing can achieve greater power in detecting the aberrant spot than using the raw data without denoising. Finally, we illustrate the method on two array-CGH data sets.
引用
收藏
页码:211 / 226
页数:16
相关论文
共 31 条
  • [1] Abramovich F., 1995, LECT NOTES STAT, V103, P5
  • [2] Antoniadis A., 2001, J STAT SOFTW, V6, P1, DOI [10.18637/jss.v006.i06, DOI 10.18637/JSS.V006.I06]
  • [3] IDEAL SPATIAL ADAPTATION BY WAVELET SHRINKAGE
    DONOHO, DL
    JOHNSTONE, IM
    [J]. BIOMETRIKA, 1994, 81 (03) : 425 - 455
  • [4] DONOHO DL, 1995, J ROY STAT SOC B MET, V57, P301
  • [5] Adapting to unknown smoothness via wavelet shrinkage
    Donoho, DL
    Johnstone, IM
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) : 1200 - 1224
  • [6] Dudoit S, 2002, STAT SINICA, V12, P111
  • [7] Fan J., 1996, LOCAL POLYNOMIAL MOD
  • [8] Hidden Markov models approach to the analysis of array CGH data
    Fridlyand, J
    Snijders, AM
    Pinkel, D
    Albertson, DG
    Jain, AN
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 90 (01) : 132 - 153
  • [9] Friedman J., 2001, ELEMENTS STAT LEARNI, V1
  • [10] Green P. J., 1994, NONPARAMETRIC REGRES