ARACNE: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context

被引:1842
作者
Margolin, AA
Nemenman, I
Basso, K
Wiggins, C
Stolovitzky, G
Dalla Favera, R
Califano, A [1 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY 10032 USA
[2] Columbia Univ, Joint Ctr Syst Biol, New York, NY 10032 USA
[3] Columbia Univ, Inst Canc Genet, New York, NY 10032 USA
[4] Columbia Univ, Dept Appl Phys & Appl Math, New York, NY 10032 USA
[5] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
Mutual Information; Joint Probability Distribution; Pairwise Interaction; Network Reconstruction; Relevance Network;
D O I
10.1186/1471-2105-7-S1-S7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Elucidating gene regulatory networks is crucial for understanding normal cell physiology and complex pathologic phenotypes. Existing computational methods for the genome-wide "reverse engineering" of such networks have been successful only for lower eukaryotes with simple genomes. Here we present ARACNE, a novel algorithm, using microarray expression profiles, specifically designed to scale up to the complexity of regulatory networks in mammalian cells, yet general enough to address a wider range of network deconvolution problems. This method uses an information theoretic approach to eliminate the majority of indirect interactions inferred by co-expression methods. Results: We prove that ARACNE reconstructs the network exactly (asymptotically) if the effect of loops in the network topology is negligible, and we show that the algorithm works well in practice, even in the presence of numerous loops and complex topologies. We assess ARACNE's ability to reconstruct transcriptional regulatory networks using both a realistic synthetic dataset and a microarray dataset from human B cells. On synthetic datasets ARACNE achieves very low error rates and outperforms established methods, such as Relevance Networks and Bayesian Networks. Application to the deconvolution of genetic networks in human B cells demonstrates ARACNE's ability to infer validated transcriptional targets of the cMYC proto-oncogene. We also study the effects of misestimation of mutual information on network reconstruction, and show that algorithms based on mutual information ranking are more resilient to estimation errors. Conclusion: ARACNE shows promise in identifying direct transcriptional interactions in mammalian cellular networks, a problem that has challenged existing reverse engineering algorithms. This approach should enhance our ability to use microarray data to elucidate functional mechanisms that underlie cellular processes and to identify molecular targets of pharmacological compounds in mammalian cellular networks.
引用
收藏
页数:15
相关论文
共 38 条
[21]  
Joe H., 1997, MULTIVARIATE MODELS
[22]   A THEORY OF COOPERATIVE PHENOMENA [J].
KIKUCHI, R .
PHYSICAL REVIEW, 1951, 81 (06) :988-1003
[23]   Gene expression profiling of B cell chronic lymphocytic leukemia reveals a homogeneous phenotype related to memory B cells [J].
Klein, U ;
Tu, YH ;
Stolovitzky, GA ;
Mattioli, M ;
Cattoretti, G ;
Husson, H ;
Freedman, A ;
Inghirami, G ;
Cro, L ;
Baldini, L ;
Neri, AN ;
Califano, A ;
Dalla-Favera, R .
JOURNAL OF EXPERIMENTAL MEDICINE, 2001, 194 (11) :1625-1638
[24]  
Ma S-K, 1985, STAT MECH
[25]   Artificial gene networks for objective comparison of analysis algorithms [J].
Mendes, Pedro ;
Sha, Wei ;
Ye, Keying .
BIOINFORMATICS, 2003, 19 :II122-II129
[26]   The Bethe lattice spin glass revisited [J].
Mézard, M ;
Parisi, G .
EUROPEAN PHYSICAL JOURNAL B, 2001, 20 (02) :217-233
[27]   The structure and function of complex networks [J].
Newman, MEJ .
SIAM REVIEW, 2003, 45 (02) :167-256
[28]  
Opper M., 2001, ADV MEAN FIELD METHO
[29]  
Pearl J., 1988, PROBABILISTIC REASON
[30]   Entropy and information in neural spike trains [J].
Strong, SP ;
Koberle, R ;
van Steveninck, RRD ;
Bialek, W .
PHYSICAL REVIEW LETTERS, 1998, 80 (01) :197-200