A generalized framework for Network Component Analysis

被引:34
作者
Boscolo, R
Sabatti, C
Liao, JC
Roychowdhury, VP
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Human Genet & Stat, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Chem Engn, Los Angeles, CA 90095 USA
基金
美国国家科学基金会;
关键词
system identification; biology and genetics; network models; data analysis;
D O I
10.1109/TCBB.2005.47
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The authors recently introduced a framework, named Network Component Analysis (NCA), for the reconstruction of the dynamics of transcriptional regulators' activities from gene expression assays. The original formulation had certain shortcomings that limited NCA'S application to a wide class of network dynamics reconstruction problems, either because of limitations in the sample size or because of the stringent requirements imposed by the set of identifiability conditions. In addition, the performance characteristics of the method for various levels of data noise or in the presence of model inaccuracies were never investigated. In this article, the following aspects of NCA have been addressed, resulting in a set of extensions to the original framework: 1) The sufficient conditions on the a priori connectivity information (required for successful reconstructions via NCA) are made less stringent, allowing easier verification of whether a network topology is identifiable, as well as extending the class of identifiable systems. Such a result is accomplished by introducing a set of identifiability requirements that can be directly tested on the regulatory architecture, rather than on specific instances of the system matrix. 2) The two-stage least square iterative procedure used in NCA is proven to identify stationary points of the likelihood function, under Gaussian noise assumption, thus reinforcing the statistical foundations of the method. 3) A framework for the simultaneous reconstruction of multiple regulatory subnetworks is introduced, thus overcoming one of the critical limitations of the original formulation of the decomposition, for example, occurring for poorly sampled data (typical of microarray experiments). A set of monte carlo simulations we conducted with synthetic data suggests that the approach is indeed capable of accurately reconstructing regulatory signals when these are the input of large-scale networks that satisfy the suggested identifiability criteria, even under fairly noisy conditions. The sensitivity of the reconstructed signals to inaccuracies in the hypothesized network topology is also investigated. We demonstrate the feasibility of our approach for the simultaneous reconstruction of multiple regulatory subnetworks from the same data set with a successful application of the technique to gene expression measurements of the bacterium Escherichia coli.
引用
收藏
页码:289 / 301
页数:13
相关论文
共 20 条
  • [11] Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis
    Kao, KC
    Yang, YL
    Boscolo, R
    Sabatti, C
    Roychowdhury, V
    Liao, JC
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (02) : 641 - 646
  • [12] EcoCyc:: a comprehensive database resource for Escherichia coli
    Keseler, IM
    Collado-Vides, J
    Gama-Castro, S
    Ingraham, J
    Paley, S
    Paulsen, IT
    Peralta-Gill, M
    Karp, PD
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D334 - D337
  • [13] Transcriptional regulatory networks in Saccharomyces cerevisiae
    Lee, TI
    Rinaldi, NJ
    Robert, F
    Odom, DT
    Bar-Joseph, Z
    Gerber, GK
    Hannett, NM
    Harbison, CT
    Thompson, CM
    Simon, I
    Zeitlinger, J
    Jennings, EG
    Murray, HL
    Gordon, DB
    Ren, B
    Wyrick, JJ
    Tagne, JB
    Volkert, TL
    Fraenkel, E
    Gifford, DK
    Young, RA
    [J]. SCIENCE, 2002, 298 (5594) : 799 - 804
  • [14] Network component analysis: Reconstruction of regulatory signals in biological systems
    Liao, JC
    Boscolo, R
    Yang, YL
    Tran, LM
    Sabatti, C
    Roychowdhury, VP
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (26) : 15522 - 15527
  • [15] TRANSFAC®:: transcriptional regulation, from patterns to profiles
    Matys, V
    Fricke, E
    Geffers, R
    Gössling, E
    Haubrock, M
    Hehl, R
    Hornischer, K
    Karas, D
    Kel, AE
    Kel-Margoulis, OV
    Kloos, DU
    Land, S
    Lewicki-Potapov, B
    Michael, H
    Münch, R
    Reuter, I
    Rotert, S
    Saxel, H
    Scheer, M
    Thiele, S
    Wingender, E
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 374 - 378
  • [16] THE E-COLI-FIS PROMOTER IS SUBJECT TO STRINGENT CONTROL AND AUTOREGULATION
    NINNEMANN, O
    KOCH, C
    KAHMANN, R
    [J]. EMBO JOURNAL, 1992, 11 (03) : 1075 - 1083
  • [17] Roberts S., 2001, Independent component analysis: principles and practice
  • [18] RegulonDB (version 4.0):: transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12
    Salgado, H
    Gama-Castro, S
    Martínez-Antonio, A
    Díaz-Peredo, E
    Sánchez-Solano, F
    Peralta-Gil, M
    Garcia-Alonso, D
    Jiménez-Jacinto, V
    Santos-Zavaleta, A
    Bonavides-Martínez, C
    Collado-Vides, J
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D303 - D306
  • [19] Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data
    Segal, E
    Shapira, M
    Regev, A
    Pe'er, D
    Botstein, D
    Koller, D
    Friedman, N
    [J]. NATURE GENETICS, 2003, 34 (02) : 166 - 176
  • [20] Systematic determination of genetic network architecture
    Tavazoie, S
    Hughes, JD
    Campbell, MJ
    Cho, RJ
    Church, GM
    [J]. NATURE GENETICS, 1999, 22 (03) : 281 - 285