Robust detection of periodic time series measured from biological systems -: art. no. 117

被引:101
作者
Ahdesmäki, M
Lähdesmäki, H
Pearson, R
Huttunen, H
Yli-Harja, O
机构
[1] Tampere Univ Technol, Inst Signal Proc, FIN-33101 Tampere, Finland
[2] ProSanos Corp, Harrisburg, PA 17101 USA
关键词
D O I
10.1186/1471-2105-6-117
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Periodic phenomena are widespread in biology. The problem of finding periodicity in biological time series can be viewed as a multiple hypothesis testing of the spectral content of a given time series. The exact noise characteristics are unknown in many bioinformatics applications. Furthermore, the observed time series can exhibit other non-idealities, such as outliers, short length and distortion from the original wave form. Hence, the computational methods should preferably be robust against such anomalies in the data. Results: We propose a general-purpose robust testing procedure for finding periodic sequences in multiple time series data. The proposed method is based on a robust spectral estimator which is incorporated into the hypothesis testing framework using a so-called g-statistic together with correction for multiple testing. This results in a robust testing procedure which is insensitive to heavy contamination of outliers, missing- values, short time series, nonlinear distortions, and is completely insensitive to any monotone nonlinear distortions. The performance of the methods is evaluated by performing extensive simulations. In addition, we compare the proposed method with another recent statistical signal detection estimator that uses Fisher's test, based on the Gaussian noise assumption. The results demonstrate that the proposed robust method provides remarkably better robustness properties. Moreover, the performance of the proposed method is preferable also in the standard Gaussian case. We validate the performance of the proposed method on real data on which the method performs very favorably. Conclusion: As the time series measured from biological systems are usually short and prone to contain different kinds of non-idealities, we are very optimistic about the multitude of possible applications for our proposed robust statistical periodicity detection method. Availability: The presented methods have been implemented in Matlab and in R. Codes are available on request. Supplementary material is available at: http://www.cs.tut.fi/sgn/ csb/ robustperiodic/.
引用
收藏
页数:18
相关论文
共 27 条
  • [1] ARTIS M, 200410 ECO EUR U I
  • [2] Deconvolving cell cycle expression data with complementary information
    Bar-Joseph, Ziv
    Farkash, Shlomit
    Gifford, David K.
    Simon, Itamar
    Rosenfeld, Roni
    [J]. BIOINFORMATICS, 2004, 20 : 23 - 30
  • [3] Periodic transcription: A cycle within a cycle
    Breeden, LL
    [J]. CURRENT BIOLOGY, 2003, 13 (01) : R31 - R38
  • [4] Brockwell P. J., 1991, TIME SERIES THEORY M
  • [5] CHIU ST, 1989, J ROY STAT SOC B MET, V51, P249
  • [6] Multiple oscillators regulate circadian gene expression in Neurospora
    Correa, A
    Lewis, AZ
    Greene, AV
    March, IJ
    Gomer, RH
    Bell-Pedersen, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (23) : 13597 - 13602
  • [7] Comparison of computational methods for the identification of cell cycle-regulated genes
    de Lichtenberg, U
    Jensen, LJ
    Fausboll, A
    Jensen, TS
    Bork, P
    Brunak, S
    [J]. BIOINFORMATICS, 2005, 21 (07) : 1164 - 1171
  • [8] Multiple hypothesis testing in microarray experiments
    Dudoit, S
    Shaffer, JP
    Boldrick, JC
    [J]. STATISTICAL SCIENCE, 2003, 18 (01) : 71 - 103
  • [9] GOOD P, 2003, PERMUTATION TESTS PR
  • [10] A multivariate approach applied to microarray data for identification of genes with cell cycle-coupled transcription
    Johansson, D
    Lindgren, P
    Berglund, A
    [J]. BIOINFORMATICS, 2003, 19 (04) : 467 - 473