DNdisorder: predicting protein disorder using boosting and deep networks

被引:62
作者
Eickholt, Jesse [1 ]
Cheng, Jianlin [1 ,2 ,3 ]
机构
[1] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
[2] Univ Missouri, Inst Informat, Columbia, MO 65211 USA
[3] Univ Missouri, C Bond Life Sci Ctr, Columbia, MO 65211 USA
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
Protein disorder prediction; Disordered regions; Deep networks; Deep learning; NATIVELY UNFOLDED PROTEINS; INTRINSIC DISORDER; UNSTRUCTURED REGIONS; ACCURATE PREDICTION; WEB SERVER; DEFINITION;
D O I
10.1186/1471-2105-14-88
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A number of proteins contain regions which do not adopt a stable tertiary structure in their native state. Such regions known as disordered regions have been shown to participate in many vital cell functions and are increasingly being examined as drug targets. Results: This work presents a new sequence based approach for the prediction of protein disorder. The method uses boosted ensembles of deep networks to make predictions and participated in the CASP10 experiment. In a 10 fold cross validation procedure on a dataset of 723 proteins, the method achieved an average balanced accuracy of 0.82 and an area under the ROC curve of 0.90. These results are achieved in part by a boosting procedure which is able to steadily increase balanced accuracy and the area under the ROC curve over several rounds. The method also compared competitively when evaluated against a number of state-of-the-art disorder predictors on CASP9 and CASP10 benchmark datasets. Conclusions: DNdisorder is available as a web service at http://iris.rnet.missouri.edu/dndisorder/.
引用
收藏
页数:10
相关论文
共 42 条
[11]   Intrinsic disorder and protein function [J].
Dunker, AK ;
Brown, CJ ;
Lawson, JD ;
Iakoucheva, LM ;
Obradovic, Z .
BIOCHEMISTRY, 2002, 41 (21) :6573-6582
[12]   Predicting protein residue-residue contacts using deep networks and boosting [J].
Eickholt, Jesse ;
Cheng, Jianlin .
BIOINFORMATICS, 2012, 28 (23) :3066-3072
[13]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[14]   FoldUnfold: web server for the prediction of disordered regions in protein chain [J].
Galzitskaya, Oxana V. ;
Garbuzynskiy, Sergiy O. ;
Lobanov, Michail Yu. .
BIOINFORMATICS, 2006, 22 (23) :2948-2949
[15]   THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE [J].
HANLEY, JA ;
MCNEIL, BJ .
RADIOLOGY, 1982, 143 (01) :29-36
[16]   Predicting intrinsic disorder in proteins: an overview [J].
He, Bo ;
Wang, Kejun ;
Liu, Yunlong ;
Xue, Bin ;
Uversky, Vladimir N. ;
Dunker, A. Keith .
CELL RESEARCH, 2009, 19 (08) :929-949
[17]   Protein disorder prediction at multiple levels of sensitivity and specificity [J].
Hecker, Joshua ;
Yang, Jack Y. ;
Cheng, Jianlin .
BMC GENOMICS, 2008, 9 (Suppl 1)
[18]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[19]   Training products of experts by minimizing contrastive divergence [J].
Hinton, GE .
NEURAL COMPUTATION, 2002, 14 (08) :1771-1800
[20]   Deep Neural Networks for Acoustic Modeling in Speech Recognition [J].
Hinton, Geoffrey ;
Deng, Li ;
Yu, Dong ;
Dahl, George E. ;
Mohamed, Abdel-rahman ;
Jaitly, Navdeep ;
Senior, Andrew ;
Vanhoucke, Vincent ;
Patrick Nguyen ;
Sainath, Tara N. ;
Kingsbury, Brian .
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :82-97