Evaluation of local structure alphabets based on residue burial

被引:55
作者
Karchin, R
Cline, M
Karplus, K
机构
[1] Univ Calif San Francisco, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[2] Affymetrix Inc, Emeryville, CA USA
[3] Univ Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
关键词
protein structure prediction; fold recognition; local structure alphabet; solvent accessibility; neighborhood counts; residue burial; hidden Markov model; multitrack HMM;
D O I
10.1002/prot.20008
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Residue burial, which describes a protein residue's exposure to solvent and neighboring atoms, is key to protein structure prediction, modeling, and analysis. We assessed 21 alphabets representing residue burial, according to their predictability from amino acid sequence, conservation in structural alignments, and utility in one fold-recognition scenario. This follows upon our previous work in assessing nine representations of backbone geometry.(1) The alphabet found to be most effective overall has seven states and is based on a count of C-beta atoms within a 14 Angstrom-radius sphere centered at the C-beta of a residue of interest. When incorporated into a hidden Markov model (HMM), this alphabet gave us a 38% performance boost in fold recognition and 23% in alignment quality. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:508 / 518
页数:11
相关论文
共 51 条
  • [1] Protein structure prediction and structural genomics
    Baker, D
    Sali, A
    [J]. SCIENCE, 2001, 294 (5540) : 93 - 96
  • [2] BARRETT C, 2001, THESIS U CALIFORNIA
  • [3] Amino acid empirical contact energy definitions for fold recognition in the space of contact maps
    Berrera, M
    Molinari, H
    Fogolari, F
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)
  • [4] Betancourt MR, 1999, PROTEIN SCI, V8, P361
  • [5] HMMSTR: a hidden Markov model for local sequence-structure correlations in proteins
    Bystroff, C
    Thorsson, V
    Baker, D
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 301 (01) : 173 - 190
  • [6] CHOTHIA C, 1984, ANNU REV BIOCHEM, V53, P537
  • [7] Predicting reliable regions in protein sequence alignments
    Cline, M
    Hughey, R
    Karplus, K
    [J]. BIOINFORMATICS, 2002, 18 (02) : 306 - 314
  • [8] Information-theoretic dissection of pairwise contact potentials
    Cline, MS
    Karplus, K
    Lathrop, RH
    Smith, TF
    Rogers, RG
    Haussler, D
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 49 (01) : 7 - 14
  • [9] Dean A M, 2000, Pac Symp Biocomput, P6
  • [10] DUNBRACK R, 2001, CULLING PDB RESOLUTI