Connecting the protein structure universe by using sparse recurring fragments

被引:55
作者
Friedberg, I [1 ]
Godzik, A [1 ]
机构
[1] Burnham Inst, Program Bioinformat & Syst Biol, La Jolla, CA 92037 USA
关键词
D O I
10.1016/j.str.2005.05.009
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The quest to order and classify protein structures has lead to various classification schemes, focusing mostly on hierarchical relationships between structural domains. At the coarsest classification level, such schemes typically identify hundreds of types of fundamental units called folds. As a result, we picture protein structure space as a collection of isolated fold islands. It is obvious, however, that many protein folds share structural and functional commonalities. Locating those commonalities is important for our understanding of protein structure, function, and evolution. Here, we present an alternative view of the protein fold space, based on an interfold similarity measure that is related to the frequency of fragments shared between folds. In this view, protein structures form a complicated, crossconnected network with very interesting topology. We show that interfold similarity based on sequence/structure fragments correlates well with similarities of functions between protein populations in different folds.
引用
收藏
页码:1213 / 1224
页数:12
相关论文
共 33 条
[1]   The ASTRAL compendium for protein structure and sequence analysis [J].
Brenner, SE ;
Koehl, P ;
Levitt, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :254-256
[2]  
Camon Evelyn, 2004, In Silico Biology, V4, P5
[3]   Local feature frequency profile: A method to measure structural similarity in proteins [J].
Choi, IG ;
Kwon, J ;
Kim, SH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (11) :3797-3802
[4]   The structure of Desulfovibrio vulgaris rubrerythrin reveals a unique combination of rubredoxin-like FeS4 and ferritin-like diiron domains [J].
deMare, F ;
Kurtz, DM ;
Nordlund, P .
NATURE STRUCTURAL BIOLOGY, 1996, 3 (06) :539-546
[5]   Global properties of the mapping between local amino acid sequence and local structure in proteins [J].
Han, KF ;
Baker, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (12) :5814-5818
[6]   The Gene Ontology (GO) database and informatics resource [J].
Harris, MA ;
Clark, J ;
Ireland, A ;
Lomax, J ;
Ashburner, M ;
Foulger, R ;
Eilbeck, K ;
Lewis, S ;
Marshall, B ;
Mungall, C ;
Richter, J ;
Rubin, GM ;
Blake, JA ;
Bult, C ;
Dolan, M ;
Drabkin, H ;
Eppig, JT ;
Hill, DP ;
Ni, L ;
Ringwald, M ;
Balakrishnan, R ;
Cherry, JM ;
Christie, KR ;
Costanzo, MC ;
Dwight, SS ;
Engel, S ;
Fisk, DG ;
Hirschman, JE ;
Hong, EL ;
Nash, RS ;
Sethuraman, A ;
Theesfeld, CL ;
Botstein, D ;
Dolinski, K ;
Feierbach, B ;
Berardini, T ;
Mundodi, S ;
Rhee, SY ;
Apweiler, R ;
Barrell, D ;
Camon, E ;
Dimmer, E ;
Lee, V ;
Chisholm, R ;
Gaudet, P ;
Kibbe, W ;
Kishore, R ;
Schwarz, EM ;
Sternberg, P ;
Gwinn, M .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D258-D261
[7]   Quantifying the similarities within fold space [J].
Harrison, A ;
Pearl, F ;
Mott, R ;
Thornton, J ;
Orengo, C .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 323 (05) :909-926
[8]   Reducing the computational complexity of protein folding via fragment folding and assembly [J].
Haspel, N ;
Tsai, CJ ;
Wolfson, H ;
Nussinov, R .
PROTEIN SCIENCE, 2003, 12 (06) :1177-1187
[9]   A global representation of the protein fold space [J].
Hou, JT ;
Sims, GE ;
Zhang, C ;
Kim, SH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (05) :2386-2390
[10]  
Hubbard TJP, 1999, PROTEINS, P15