Exploiting heterogeneous sequence properties improves prediction of protein disorder

被引:433
作者
Obradovic, Z
Peng, K
Vucetic, S
Radivojac, P
Dunker, AK
机构
[1] Temple Univ, Ctr Informat Sci & Technol, Philadelphia, PA 19122 USA
[2] Indiana Univ, Sch Informat, Bloomington, IN USA
[3] Indiana Univ, Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46204 USA
关键词
disorder prediction; intrinsically disordered; length dependent predictors;
D O I
10.1002/prot.20735
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
During the past few years we have investigated methods to improve predictors of intrinsically disordered regions longer than 30 consecutive residues. Experimental evidence, however, showed that these predictors were less successful on short disordered regions, as observed two years ago during the fifth Critical Assessment of Techniques for Protein Structure Prediction (CASP5). To address this shortcoming, we developed a two-level model called VSL1 (CASP6 id: 193-1). At the first level, VSL1 consists of two specialized predictors, one of which was optimized for long disordered regions (> 30 residues) and the other for short disordered regions (> 30 residues). At the second level, a meta-predictor was built to assign weights for combining the two first-level predictors. As the results of the CASP6 experiment showed, this new predictor has achieved the highest accuracy yet and significantly improved performance on short disordered regions, while maintaining high performance on long disordered regions.
引用
收藏
页码:176 / 182
页数:7
相关论文
共 47 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [3] Combining prediction, computation and experiment for the characterization of protein disorder
    Bracken, C
    Iakoucheva, LM
    Rorner, PR
    Dunker, AK
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2004, 14 (05) : 570 - 576
  • [4] Prediction of unfolded segments in a protein sequence based on amino acid composition
    Coeytaux, K
    Poupon, A
    [J]. BIOINFORMATICS, 2005, 21 (09) : 1891 - 1900
  • [5] Daughdrill G.W., 2005, PROTEIN FOLDING HDB, P271
  • [6] Davidson Russell., 1993, Estimation and Inference in Econometrics
  • [7] The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins
    Dosztányi, Z
    Csizmók, V
    Tompa, P
    Simon, I
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2005, 347 (04) : 827 - 839
  • [8] Dunker AK, 2002, ADV PROTEIN CHEM, V62, P25
  • [9] Intrinsic disorder and protein function
    Dunker, AK
    Brown, CJ
    Lawson, JD
    Iakoucheva, LM
    Obradovic, Z
    [J]. BIOCHEMISTRY, 2002, 41 (21) : 6573 - 6582
  • [10] Intrinsically disordered protein
    Dunker, AK
    Lawson, JD
    Brown, CJ
    Williams, RM
    Romero, P
    Oh, JS
    Oldfield, CJ
    Campen, AM
    Ratliff, CR
    Hipps, KW
    Ausio, J
    Nissen, MS
    Reeves, R
    Kang, CH
    Kissinger, CR
    Bailey, RW
    Griswold, MD
    Chiu, M
    Garner, EC
    Obradovic, Z
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2001, 19 (01) : 26 - 59