Protein families and their evolution - A structural perspective

被引:217
作者
Orengo, CA
Thornton, JM
机构
[1] UCL, Dept Biochem & Mol Biol, London WC1E 6BT, England
[2] European Bioinformat Inst, Cambridge CB10 1SD, England
关键词
protein classifications; comparative genomics; bioinformatics;
D O I
10.1146/annurev.biochem.74.082803.133029
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We can now assign about two thirds of the sequences from completed genomes to as few as 1400 domain families for which structures are known and thus more ancient evolutionary relationships established. About 200 of these domain families are common to all kingdoms of life and account for nearly 50% of domain structure annotations in the genomes. Some of these domain families have been very extensively duplicated within a genome and combined with different domain partners giving rise to different multidomain proteins. The ways in which these domain combinations evolve tend to be specific to the organism so that less than 15% of the protein families found within a genome appear to be common to all kingdoms of life. Recent analyses of completed genomes, exploiting the structural data, have revealed the extent to which duplication of these domains and modifications of their functions can expand the functional repertoire of the organism, contributing to increasing complexity.
引用
收藏
页码:867 / 900
页数:34
相关论文
共 100 条
[1]  
ABELN S, 2004, I STRUCT MOL BIOL JU
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Evolution of enzymes in metabolism: A network perspective [J].
Alves, R ;
Chaleil, RAG ;
Sternberg, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 320 (04) :751-770
[4]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[5]  
[Anonymous], 1965, BIOGRAPHICAL SKETCH
[6]  
[Anonymous], P 5 INT C MOL STRUCT
[7]   Domain combinations in archaeal, eubacterial and eukaryotic proteomes [J].
Apic, G ;
Gough, J ;
Teichmann, SA .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) :311-325
[8]   PRINTS and its automatic supplement, prePRINTS [J].
Attwood, TK ;
Bradley, P ;
Flower, DR ;
Gaulton, A ;
Maudling, N ;
Mitchell, AL ;
Moulton, G ;
Nordle, A ;
Paine, K ;
Taylor, P ;
Uddin, A ;
Zygouri, C .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :400-402
[9]   Structure and evolution of transcriptional regulatory networks [J].
Babu, MM ;
Luscombe, NM ;
Aravind, L ;
Gerstein, M ;
Teichmann, SA .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2004, 14 (03) :283-291
[10]   The geometry of domain combination in proteins [J].
Bashton, M ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 315 (04) :927-939