Predicting intrinsic disorder from amino acid sequence

被引:342
作者
Obradovic, Z
Peng, K
Vucetic, S
Radivojac, P
Brown, CJ
Dunker, AK
机构
[1] Temple Univ, Ctr Informat Sci & Technol, Philadelphia, PA 19122 USA
[2] Mol Kinet, Pullman, WA USA
[3] Washington State Univ, Sch Mol Biosci, Pullman, WA 99164 USA
来源
PROTEINS-STRUCTURE FUNCTION AND GENETICS | 2003年 / 53卷 / 06期
关键词
natively unfolded; intrinsically disordered; neural networks; ordinary least squares regression; machine learning;
D O I
10.1002/prot.10532
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Blind predictions of intrinsic order and disorder were made on 42 proteins subsequently revealed to contain 9,044 ordered residues, 284 disordered residues in 26 segments of length 30 residues or less, and 281 disordered residues in 2 disordered segments of length greater than 30 residues. The accuracies of the six predictors used in this experiment ranged from 77% to 91% for the ordered regions and from 56% to 78% for the disordered segments. The average of the order and disorder predictions ranged from 73% to 77%. The prediction of disorder in the shorter segments was poor, from 25% to 66% correct, while the prediction of disorder in the longer segments was better, from 75% to 95% correct. Four of the predictors were composed of ensembles of neural networks. This enabled them to deal more efficiently with the large asymmetry in the training data through diversified sampling from the significantly larger ordered set and achieve better accuracy on ordered and long disordered regions. The exclusive use of long disordered regions for predictor training likely contributed to the disparity of the predictions on long versus short disordered regions, while averaging the output values over 61-residue windows to eliminate short predictions of order or disorder probably contributed to the even greater disparity for three of the predictors. This experiment supports the predictability of intrinsic disorder from amino acid sequence. (C) 2003 Wiley-Liss, Inc.
引用
收藏
页码:566 / 572
页数:7
相关论文
共 35 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Evolutionary rate heterogeneity in proteins with long disordered regions [J].
Brown, CJ ;
Takayama, S ;
Campen, AM ;
Vise, P ;
Marshall, TW ;
Oldfield, CJ ;
Williams, CJ ;
Dunker, AK .
JOURNAL OF MOLECULAR EVOLUTION, 2002, 55 (01) :104-110
[3]   ALPHA-LACTALBUMIN - COMPACT STATE WITH FLUCTUATING TERTIARY STRUCTURE [J].
DOLGIKH, DA ;
GILMANSHIN, RI ;
BRAZHNIKOV, EV ;
BYCHKOVA, VE ;
SEMISOTNOV, GV ;
VENYAMINOV, SY ;
PTITSYN, OB .
FEBS LETTERS, 1981, 136 (02) :311-315
[4]   The protein trinity - linking function and disorder [J].
Dunker, AK ;
Obradovic, Z .
NATURE BIOTECHNOLOGY, 2001, 19 (09) :805-806
[5]  
Dunker AK, 2002, ADV PROTEIN CHEM, V62, P25
[6]   A MODEL FOR FD PHAGE PENETRATION AND ASSEMBLY [J].
DUNKER, AK ;
ENSIGN, LD ;
ARNOLD, GE ;
ROBERTS, LM .
FEBS LETTERS, 1991, 292 (1-2) :271-274
[7]   Intrinsic disorder and protein function [J].
Dunker, AK ;
Brown, CJ ;
Lawson, JD ;
Iakoucheva, LM ;
Obradovic, Z .
BIOCHEMISTRY, 2002, 41 (21) :6573-6582
[8]   Intrinsically disordered protein [J].
Dunker, AK ;
Lawson, JD ;
Brown, CJ ;
Williams, RM ;
Romero, P ;
Oh, JS ;
Oldfield, CJ ;
Campen, AM ;
Ratliff, CR ;
Hipps, KW ;
Ausio, J ;
Nissen, MS ;
Reeves, R ;
Kang, CH ;
Kissinger, CR ;
Bailey, RW ;
Griswold, MD ;
Chiu, M ;
Garner, EC ;
Obradovic, Z .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2001, 19 (01) :26-59
[9]   PROPOSED MOLTEN GLOBULE INTERMEDIATES IN FD PHAGE PENETRATION AND ASSEMBLY [J].
DUNKER, AK ;
ENSIGN, LD ;
ARNOLD, GE ;
ROBERTS, LM .
FEBS LETTERS, 1991, 292 (1-2) :275-278
[10]  
Efron B., 1993, INTRO BOOTSTRAP, V1st ed., DOI DOI 10.1201/9780429246593