Similarities and differences in genome-wide expression data of six organisms

被引:267
作者
Bergmann, S
Ihmels, J
Barkai, N [1 ]
机构
[1] Weizmann Inst Sci, Dept Mol Genet, Complex Syst, IL-76100 Rehovot, Israel
[2] Weizmann Inst Sci, Dept Phys, Complex Syst, IL-76100 Rehovot, Israel
关键词
D O I
10.1371/journal.pbio.0020009
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Comparing genomic properties of different organisms is of fundamental importance in the study of biological and evolutionary principles. Although differences among organisms are often attributed to differential gene expression, genome-wide comparative analysis thus far has been based primarily on genomic sequence information. We present a comparative study of large datasets of expression profiles from six evolutionarily distant organisms: S. cerevisiae, C. elegans, E. coli, A. thaliana, D. melanogaster, and H. sapiens. We use genomic sequence information to connect these data and compare global and modular properties of the transcription programs. Linking genes whose expression profiles are similar, we find that for all organisms the connectivity distribution follows a power-law, highly connected genes tend to be essential and conserved, and the expression program is highly modular. We reveal the modular structure by decomposing each set of expression data into coexpressed modules. Functionally related sets of genes are frequently coexpressed in multiple organisms. Yet their relative importance to the transcription program and their regulatory relationships vary among organisms. Our results demonstrate the potential of combining sequence and expression data for improving functional gene annotation and expanding our understanding of how gene expression and diversity evolved.
引用
收藏
页码:85 / 93
页数:9
相关论文
共 39 条
  • [1] Statistical mechanics of complex networks
    Albert, R
    Barabási, AL
    [J]. REVIEWS OF MODERN PHYSICS, 2002, 74 (01) : 47 - 97
  • [2] Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms
    Alter, O
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (06) : 3351 - 3356
  • [3] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [4] Gene expression during the life cycle of Drosophila melanogaster
    Arbeitman, MN
    Furlong, EEM
    Imam, F
    Johnson, E
    Null, BH
    Baker, BS
    Krasnow, MA
    Scott, MP
    Davis, RW
    White, KP
    [J]. SCIENCE, 2002, 297 (5590) : 2270 - 2275
  • [5] Emergence of scaling in random networks
    Barabási, AL
    Albert, R
    [J]. SCIENCE, 1999, 286 (5439) : 509 - 512
  • [6] Iterative signature algorithm for the analysis of large-scale gene expression data
    Bergmann, S
    Ihmels, J
    Barkai, N
    [J]. PHYSICAL REVIEW E, 2003, 67 (03): : 18
  • [7] Highly optimized tolerance: Robustness and design in complex systems
    Carlson, JM
    Doyle, J
    [J]. PHYSICAL REVIEW LETTERS, 2000, 84 (11) : 2529 - 2532
  • [8] Endless forms: the evolution of gene regulation and morphological diversity
    Carroll, SB
    [J]. CELL, 2000, 101 (06) : 577 - 580
  • [9] Remodeling of yeast genome expression in response to environmental changes
    Causton, HC
    Ren, B
    Koh, SS
    Harbison, CT
    Kanin, E
    Jennings, EG
    Lee, TI
    True, HL
    Lander, ES
    Young, RA
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2001, 12 (02) : 323 - 337
  • [10] Comparison of the complete protein sets of worm and yeast: Orthology and divergence
    Chervitz, SA
    Aravind, L
    Sherlock, G
    Ball, CA
    Koonin, EV
    Dwight, SS
    Harris, MA
    Dolinski, K
    Mohr, S
    Smith, T
    Weng, S
    Cherry, JM
    Botstein, D
    [J]. SCIENCE, 1998, 282 (5396) : 2022 - 2028