Prediction of disordered regions in proteins from position specific score matrices

被引:156
作者
Jones, DT [1 ]
Ward, JJ [1 ]
机构
[1] UCL, Dept Comp Sci, Bioinformat Unit, London WC1E 6BT, England
关键词
protein structure prediction; folding; disorder; neural networks; sequence analysis;
D O I
10.1002/prot.10528
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe here the results of using a neural network based method (DISOPRED) for predicting disordered regions in 55 proteins in the 5(th) CASP experiment. A set of 715 highly resolved proteins with regions of disorder was used to train the network. The inputs to the network were derived from sequence profiles generated by PSI-BLAST. A post-filter was applied to the output of the network to prevent regions being predicted as disordered in regions of confidently predicted alpha helix or beta sheet structure. The overall two-state prediction accuracy for the method is very high (90%) but this is highly skewed by the fact that most residues are observed to be ordered. The overall Matthews' correlation coefficient for the submitted predictions is 0.34, which gives a more realistic impression of the overall accuracy of the method, though still indicates significant predictive power. (C) 2003 Wiley-Liss, Inc.
引用
收藏
页码:573 / 578
页数:6
相关论文
共 10 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   Evolutionary rate heterogeneity in proteins with long disordered regions [J].
Brown, CJ ;
Takayama, S ;
Campen, AM ;
Vise, P ;
Marshall, TW ;
Oldfield, CJ ;
Williams, CJ ;
Dunker, AK .
JOURNAL OF MOLECULAR EVOLUTION, 2002, 55 (01) :104-110
[4]   Protein secondary structure prediction based on position-specific scoring matrices [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 292 (02) :195-202
[5]  
Li, 1999, Genome Inform Ser Workshop Genome Inform, V10, P30
[6]   COMPARISON OF PREDICTED AND OBSERVED SECONDARY STRUCTURE OF T4 PHAGE LYSOZYME [J].
MATTHEWS, BW .
BIOCHIMICA ET BIOPHYSICA ACTA, 1975, 405 (02) :442-451
[7]  
Romero P, 1997, 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, P90, DOI 10.1109/ICNN.1997.611643
[8]   PREDICTION OF PROTEIN SECONDARY STRUCTURE AT BETTER THAN 70-PERCENT ACCURACY [J].
ROST, B ;
SANDER, C .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 232 (02) :584-599
[9]   FOLDING TRANSITION IN THE DNA-BINDING DOMAIN OF GCN4 ON SPECIFIC BINDING TO DNA [J].
WEISS, MA ;
ELLENBERGER, T ;
WOBBE, CR ;
LEE, JP ;
HARRISON, SC ;
STRUHL, K .
NATURE, 1990, 347 (6293) :575-578
[10]   Intrinsically unstructured proteins: Re-assessing the protein structure-function paradigm [J].
Wright, PE ;
Dyson, HJ .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 293 (02) :321-331