Scaling laws in the functional content of genomes

被引:217
作者
van Nimwegen, E [1 ]
机构
[1] Rockefeller Univ, Ctr Studies Phys & Biol, New York, NY 12001 USA
关键词
D O I
10.1016/S0168-9525(03)00203-8
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the number of sequenced genomes now totaling more than 100, and the availability of rough functional annotations for a substantial proportion of their genes, it has become possible to study the statistics of gene content across genomes. In this article I show that, for many high-level functional categories, the number of genes in each category scales as a power-law of the total number of genes in the genome. The occurrence of such scaling laws can be explained using a simple theoretical model, and this model suggests that the exponents of the observed scaling laws correspond to universal constants of the evolutionary process. I discuss some consequences of these scaling laws for our understanding of organism design.
引用
收藏
页码:479 / 484
页数:6
相关论文
共 7 条
[1]   The InterPro database, an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, T ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :37-40
[2]   Proteome Analysis Database: online application of InterPro and CluSTr for the functional classification of proteins in whole genomes [J].
Apweiler, R ;
Biswas, W ;
Fleischmann, W ;
Kanapin, A ;
Karavidopoulou, Y ;
Kersey, P ;
Kriventseva, EV ;
Mittard, V ;
Mulder, N ;
Phan, I ;
Zdobnov, E .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :44-48
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   The frequency distribution of gene family sizes in complete genomes [J].
Huynen, MA ;
van Nimwegen, E .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (05) :583-589
[5]  
LUSCOMBE NM, 2002, GENOLE BIOL, V3
[6]   Complete genome sequence of Pseudomonas aeruginosa PAO1, an opportunistic pathogen [J].
Stover, CK ;
Pham, XQ ;
Erwin, AL ;
Mizoguchi, SD ;
Warrener, P ;
Hickey, MJ ;
Brinkman, FSL ;
Hufnagle, WO ;
Kowalik, DJ ;
Lagrou, M ;
Garber, RL ;
Goltry, L ;
Tolentino, E ;
Westbrock-Wadman, S ;
Yuan, Y ;
Brody, LL ;
Coulter, SN ;
Folger, KR ;
Kas, A ;
Larbig, K ;
Lim, R ;
Smith, K ;
Spencer, D ;
Wong, GKS ;
Wu, Z ;
Paulsen, IT ;
Reizer, J ;
Saier, MH ;
Hancock, REW ;
Lory, S ;
Olson, MV .
NATURE, 2000, 406 (6799) :959-964
[7]   A genomic perspective on protein families [J].
Tatusov, RL ;
Koonin, EV ;
Lipman, DJ .
SCIENCE, 1997, 278 (5338) :631-637