Did evolution leap to create the protein universe?

被引:38
作者
Rost, B
机构
[1] Columbia Univ, CUBIC, Dept Biochem & Mol Biophys, New York, NY 10032 USA
[2] Columbia Univ, Ctr Computat Biol & Bioinformat C2B2, New York, NY 10032 USA
关键词
D O I
10.1016/S0959-440X(02)00337-8
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The genomes of over 60 organisms from all three kingdoms of life are now entirely sequenced. In many respects, the inventory of proteins used in different kingdoms appears surprisingly similar. However, eukaryotes differ from other kingdoms in that they use many long proteins, and have more proteins with coiled-coil helices and with regions abundant in regular secondary structure. Particular structural domains are used in many pathways. Nevertheless, one domain tends to occur only once in one particular pathway. Many proteins do not have close homologues in different species (orphans) and there could even be folds that are specific to one species. This view implies that protein fold space is discrete. An alternative model suggests that structure space is continuous and that modern proteins evolved by aggregating fragments of ancient proteins. Either way, after having harvested proteomes by applying standard tools, the challenge now seems to be to develop better methods for comparative proteomics.
引用
收藏
页码:409 / 416
页数:8
相关论文
共 95 条
[21]   GeneRAGE: a robust algorithm for sequence clustering and domain detection [J].
Enright, AJ ;
Ouzounis, CA .
BIOINFORMATICS, 2000, 16 (05) :451-457
[22]   EVA:: continuous automatic evaluation of protein structure prediction servers [J].
Eyrich, VA ;
Martí-Renom, MA ;
Przybylski, D ;
Madhusudhan, MS ;
Fiser, A ;
Pazos, F ;
Valencia, A ;
Sali, A ;
Rost, B .
BIOINFORMATICS, 2001, 17 (12) :1242-1243
[23]   Genomic-scale comparison of sequence- and structure-based methods of function prediction: Does structure provide additional insight? [J].
Fetrow, JS ;
Siew, N ;
Di Gennaro, JA ;
Martinez-Yamout, M ;
Dyson, HJ ;
Skolnick, J .
PROTEIN SCIENCE, 2001, 10 (05) :1005-1014
[24]   Predicting structures for genome proteins [J].
Fischer, D ;
Eisenberg, D .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1999, 9 (02) :208-211
[25]   Whole-genome analysis: annotations and updates [J].
Gaasterland, T ;
Oprea, M .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2001, 11 (03) :377-381
[26]   A structural census of the current population of protein sequences [J].
Gerstein, M ;
Levitt, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (22) :11911-11916
[27]   Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure [J].
Gough, J ;
Karplus, K ;
Hughey, R ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 313 (04) :903-919
[28]   SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments [J].
Gough, J ;
Chothia, C .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :268-272
[29]   From complete genomes to measures of substitution rate variability within and between proteins [J].
Grishin, NV ;
Wolf, YI ;
Koonin, EV .
GENOME RESEARCH, 2000, 10 (07) :991-1000
[30]   Digging for dead genes:: an analysis of the characteristics of the pseudogene population in the Caenorhabditis elegans genome [J].
Harrison, PM ;
Echols, N ;
Gerstein, MB .
NUCLEIC ACIDS RESEARCH, 2001, 29 (03) :818-830