Nature of the protein universe

被引:225
作者
Levitt, Michael [1 ]
机构
[1] Stanford Univ, Dept Biol Struct, Stanford, CA 94305 USA
基金
美国国家卫生研究院;
关键词
domain architecture; protein sequence; protein structure; structural genomics; STRUCTURE PREDICTION; ALPHA-LACTALBUMIN; FAMILIES; SEQUENCES; DOMAINS; BIOLOGY; TASSER; PFAM;
D O I
10.1073/pnas.0905029106
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The protein universe is the set of all proteins of all organisms. Here, all currently known sequences are analyzed in terms of families that have single-domain or multidomain architectures and whether they have a known three-dimensional structure. Growth of new single-domain families is very slow: Almost all growth comes from new multidomain architectures that are combinations of domains characterized by approximate to 15,000 sequence profiles. Single-domain families are mostly shared by the major groups of organisms, whereas multidomain architectures are specific and account for species diversity. There are known structures for a quarter of the single-domain families, and > 70% of all sequences can be partially modeled thanks to their membership in these families.
引用
收藏
页码:11079 / 11084
页数:6
相关论文
共 39 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]   The generation of new protein functions by the combination of domains [J].
Bashton, Matthew ;
Chothia, Cyrus .
STRUCTURE, 2007, 15 (01) :85-99
[4]   Automated server predictions in CASP7 [J].
Battey, James N. D. ;
Kopp, Jurgen ;
Bordoli, Lorenza ;
Read, Randy J. ;
Clarke, Neil D. ;
Schwede, Torsten .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 :68-82
[5]   Domain rearrangements in protein evolution [J].
Björklund, ÅK ;
Ekman, D ;
Light, S ;
Frey-Skött, J ;
Elofsson, A .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 353 (04) :911-923
[6]   A POSSIBLE 3-DIMENSIONAL STRUCTURE OF BOVINE ALPHA-LACTALBUMIN BASED ON THAT OF HENS EGG-WHITE LYSOZYME [J].
BROWNE, WJ ;
NORTH, ACT ;
PHILLIPS, DC .
JOURNAL OF MOLECULAR BIOLOGY, 1969, 42 (01) :65-&
[7]   The impact of structural genomics: Expectations and outcomes [J].
Chandonia, JM ;
Brenner, SE .
SCIENCE, 2006, 311 (5759) :347-351
[8]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[9]   Structure prediction for CABP7 targets using extensive all-atom refinement with Rosetta@home [J].
Das, Rhiju ;
Bin Qian ;
Raman, Srivatsan ;
Vernon, Robert ;
Thompson, James ;
Bradley, Philip ;
Khare, Sagar ;
Tyka, Michael D. ;
Bhat, Divya ;
Chivian, Dylan ;
Kim, David E. ;
Sheffler, William H. ;
Malmstrom, Lars ;
Wollacott, Andrew M. ;
Wang, Chu ;
Andre, Ingemar ;
Baker, David .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 :118-128
[10]   Hidden Markov models [J].
Eddy, SR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :361-365