Identification of functional modules using network topology and high-throughput data

被引:209
作者
Ulitsky, Igor [1 ]
Shamir, Ron [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
关键词
D O I
10.1186/1752-0509-1-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: With the advent of systems biology, biological knowledge is often represented today by networks. These include regulatory and metabolic networks, protein-protein interaction networks, and many others. At the same time, high-throughput genomics and proteomics techniques generate very large data sets, which require sophisticated computational analysis. Usually, separate and different analysis methodologies are applied to each of the two data types. An integrated investigation of network and high-throughput information together can improve the quality of the analysis by accounting simultaneously for topological network properties alongside intrinsic features of the high-throughput data. Results: We describe a novel algorithmic framework for this challenge. We first transform the high-throughput data into similarity values, (e. g., by computing pairwise similarity of gene expression patterns from microarray data). Then, given a network of genes or proteins and similarity values between some of them, we seek connected sub-networks (or modules) that manifest high similarity. We develop algorithms for this problem and evaluate their performance on the osmotic shock response network in S. cerevisiae and on the human cell cycle network. We demonstrate that focused, biologically meaningful and relevant functional modules are obtained. In comparison with extant algorithms, our approach has higher sensitivity and higher specificity. Conclusion: We have demonstrated that our method can accurately identify functional modules. Hence, it carries the promise to be highly useful in analysis of high throughput data.
引用
收藏
页数:17
相关论文
共 51 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Topological units of environmental signal processing in the transcriptional regulatory network of Escherichia coli [J].
Balázsi, G ;
Barabási, AL ;
Oltvai, ZN .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (22) :7841-7846
[3]   Differential network expression during drug and stress response [J].
Cabusora, L ;
Sutton, E ;
Fulmer, A ;
Forst, CV .
BIOINFORMATICS, 2005, 21 (12) :2898-2905
[4]  
CHARIKAR MS, 2000, LECT NOTES COMPUTER, V1913, P84
[5]   Pyridoxine is required for post-embryonic root development and tolerance to osmotic and oxidative stresses [J].
Chen, H ;
Xiong, LM .
PLANT JOURNAL, 2005, 44 (03) :396-408
[6]   Dynamic complex formation during the yeast cell cycle [J].
de Lichtenberg, U ;
Jensen, LJ ;
Brunak, S ;
Bork, P .
SCIENCE, 2005, 307 (5710) :724-727
[7]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[8]   Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells [J].
Elkon, R ;
Linhart, C ;
Sharan, R ;
Shamir, R ;
Shiloh, Y .
GENOME RESEARCH, 2003, 13 (05) :773-780
[9]  
Even S., 1979, Graph Algorithms
[10]  
Everitt BS, 1993, CLUSTER ANAL