Predicting ligand-binding function in families of bacterial receptors

被引:22
作者
Johnson, JM
Church, GM
机构
[1] Harvard Univ, Sch Med, Grad Program Biophys, Boston, MA 02115 USA
[2] Harvard Univ, Sch Med, Dept Genet, Boston, MA 02115 USA
关键词
D O I
10.1073/pnas.050580897
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The three-dimensional fold of a new protein sequence can often be inferred directly from sequence homology to a protein of known structure. The function of a new protein sequence is more difficult to predict, however, since homologues can have different molecular and cellular functions, To develop and automate computational methods for determining molecular function, we have analyzed ligand-binding specificity in two related families of binding proteins. One of these families includes Escherichia coli lactose repressor and ribose-binding protein, and the other includes E, coli sulfate- and phosphate-binding proteins. These proteins have similar folds but varying specificity, binding many different small molecules, including mono- and disaccharides, purines, oxyanions, ferric iron, and polyamines. Starting from template structural alignments, alignments of over 90 sequences per family were generated by iterative database searches with hidden Markov models. Phylogenetic trees were made of full-length sequences and of subsets of residues lining the binding cleft, to determine whether subbranches of the trees correlate with ligand-binding preference. Automated analyses of residues in the binding pocket were also used to predict ligand-binding function for many uncharacterized database sequences and to identify specific side chain-ligand contacts in proteins without solved structures. Our results demonstrate the utility of anchoring functional annotation within a protein family context.
引用
收藏
页码:3965 / 3970
页数:6
相关论文
共 45 条
[31]  
LIVINGSTONE CD, 1993, COMPUT APPL BIOSCI, V9, P745
[32]   A combined algorithm for genome-wide prediction of protein function [J].
Marcotte, EM ;
Pellegrini, M ;
Thompson, MJ ;
Yeates, TO ;
Eisenberg, D .
NATURE, 1999, 402 (6757) :83-86
[33]   THE 2-A RESOLUTION STRUCTURE OF THE SULFATE-BINDING PROTEIN INVOLVED IN ACTIVE-TRANSPORT IN SALMONELLA-TYPHIMURIUM [J].
PFLUGRATH, JW ;
QUIOCHO, FA .
JOURNAL OF MOLECULAR BIOLOGY, 1988, 200 (01) :163-180
[34]   NOVEL STEREOSPECIFICITY OF THE L-ARABINOSE-BINDING PROTEIN [J].
QUIOCHO, FA ;
VYAS, NK .
NATURE, 1984, 310 (5976) :381-386
[35]   Supersites within superfolds. Binding site similarity in the absence of homology [J].
Russell, RB ;
Sasieni, PD ;
Sternberg, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 282 (04) :903-918
[36]   The X-ray structure of the PurR-guanine-purF operator complex reveals the contributions of complementary electrostatic surfaces and a water-mediated hydrogen bond to corepressor specificity and binding affinity [J].
Schumacher, MA ;
Glasfeld, A ;
Zalkin, H ;
Brennan, RG .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1997, 272 (36) :22648-22653
[37]   The challenges of genome sequence annotation or "the devil is in the details" [J].
Smith, TF ;
Zhang, XL .
NATURE BIOTECHNOLOGY, 1997, 15 (12) :1222-1223
[38]   Crystal structure of PotD, the primary receptor of the polyamine transport system in Escherichia coli [J].
Sugiyama, S ;
Vassylyev, DG ;
Matsushima, M ;
Kashiwagi, K ;
Igarashi, K ;
Morikawa, K .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1996, 271 (16) :9519-9525
[39]   Class-directed structure determination: Foundation for a protein structure initiative [J].
Terwilliger, TC ;
Waldo, G ;
Peat, TS ;
Newman, JM ;
Chu, K ;
Berendzen, J .
PROTEIN SCIENCE, 1998, 7 (09) :1851-1856
[40]   Protein-ligand interaction:: Grafting of the uridine-specific determinants from the CytR regulator of Salmonella typhimurium to Escherichia coli CytR [J].
Thomsen, LE ;
Pedersen, M ;
Norregaard-Madsen, M ;
Valentin-Hansen, P ;
Kallipolitis, BH .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 288 (01) :165-175