Extended local similarity analysis (eLSA) of microbial community and other time series data with replicates

被引:177
作者
Xia, Li C. [1 ]
Steele, Joshua A. [2 ]
Cram, Jacob A.
Cardon, Zoe G. [3 ]
Simmons, Sheri L. [4 ]
Vallino, Joseph J. [3 ]
Fuhrman, Jed A.
Sun, Fengzhu [1 ]
机构
[1] Univ So Calif, Dept Biol Sci, Mol & Computat Biol Program, Los Angeles, CA 90089 USA
[2] CALTECH, Div Geol & Planetary Sci, Pasadena, CA 91125 USA
[3] Marine Biol Lab, Ctr Ecosyst, Woods Hole, MA 02543 USA
[4] Marine Biol Lab, Bay Paul Ctr, Woods Hole, MA 02543 USA
基金
美国国家科学基金会;
关键词
GENE-EXPRESSION; BACTERIAL; DYNAMICS; NETWORKS; LAKE; COVARIANCE; DIVERSITY; PATTERNS;
D O I
10.1186/1752-0509-5-S2-S15
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The increasing availability of time series microbial community data from metagenomics and other molecular biological studies has enabled the analysis of large-scale microbial co-occurrence and association networks. Among the many analytical techniques available, the Local Similarity Analysis (LSA) method is unique in that it captures local and potentially time-delayed co-occurrence and association patterns in time series data that cannot otherwise be identified by ordinary correlation analysis. However LSA, as originally developed, does not consider time series data with replicates, which hinders the full exploitation of available information. With replicates, it is possible to understand the variability of local similarity (LS) score and to obtain its confidence interval. Results: We extended our LSA technique to time series data with replicates and termed it extended LSA, or eLSA. Simulations showed the capability of eLSA to capture subinterval and time-delayed associations. We implemented the eLSA technique into an easy-to-use analytic software package. The software pipeline integrates data normalization, statistical correlation calculation, statistical significance evaluation, and association network construction steps. We applied the eLSA technique to microbial community and gene expression datasets, where unique time-dependent associations were identified. Conclusions: The extended LSA analysis technique was demonstrated to reveal statistically significant local and potentially time-delayed association patterns in replicated time series data beyond that of ordinary correlation analysis. These statistically significant associations can provide insights to the real dynamics of biological systems. The newly designed eLSA software efficiently streamlines the analysis and is freely available from the eLSA homepage, which can be accessed at http://meta.usc.edu/softs/lsa.
引用
收藏
页数:12
相关论文
共 32 条
[1]  
[Anonymous], 1998, INTRO BOOTSTRAP
[2]   Clustering of gene expression data using a local shape-based similarity measure [J].
Balasubramaniyan, R ;
Hüllermeier, E ;
Weskamp, N ;
Kämper, J .
BIOINFORMATICS, 2005, 21 (07) :1069-1077
[3]   Analyzing time series gene expression data [J].
Bar-Joseph, Z .
BIOINFORMATICS, 2004, 20 (16) :2493-2503
[4]   A global network of coexisting microbes from environmental and whole-genome sequence data [J].
Chaffron, Samuel ;
Rehrauer, Hubert ;
Pernthaler, Jakob ;
von Mering, Christian .
GENOME RESEARCH, 2010, 20 (07) :947-959
[5]   Integration of biological networks and gene expression data using Cytoscape [J].
Cline, Melissa S. ;
Smoot, Michael ;
Cerami, Ethan ;
Kuchinsky, Allan ;
Landys, Nerius ;
Workman, Chris ;
Christmas, Rowan ;
Avila-Campilo, Iliana ;
Creech, Michael ;
Gross, Benjamin ;
Hanspers, Kristina ;
Isserlin, Ruth ;
Kelley, Ryan ;
Killcoyne, Sarah ;
Lotia, Samad ;
Maere, Steven ;
Morris, John ;
Ono, Keiichiro ;
Pavlovic, Vuk ;
Pico, Alexander R. ;
Vailaya, Aditya ;
Wang, Peng-Liang ;
Adler, Annette ;
Conklin, Bruce R. ;
Hood, Leroy ;
Kuiper, Martin ;
Sander, Chris ;
Schmulevich, Ilya ;
Schwikowski, Benno ;
Warner, Guy J. ;
Ideker, Trey ;
Bader, Gary D. .
NATURE PROTOCOLS, 2007, 2 (10) :2366-2382
[6]   Seasonal analysis of protistan community structure and diversity at the USC Microbial Observatory (San Pedro Channel, North Pacific Ocean) [J].
Countway, Peter D. ;
Vigil, Patrick D. ;
Schnetzer, Astrid ;
Moorthi, Stefanie D. ;
Caron, David A. .
LIMNOLOGY AND OCEANOGRAPHY, 2010, 55 (06) :2381-2396
[7]  
Fisher MM, 1999, APPL ENVIRON MICROB, V65, P4630
[8]   Microbial community structure and its functional implications [J].
Fuhrman, Jed A. .
NATURE, 2009, 459 (7244) :193-199
[9]   The molecular mechanics of eukaryotic translation [J].
Kapp, LD ;
Lorsch, JR .
ANNUAL REVIEW OF BIOCHEMISTRY, 2004, 73 :657-704
[10]   Importance of replication in microarray gene expression studies: Statistical methods and evidence from repetitive cDNA hybridizations [J].
Lee, MLT ;
Kuo, FC ;
Whitmore, GA ;
Sklar, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :9834-9839