Natively unstructured regions in proteins identified from contact predictions

被引:94
作者
Schlessinger, Avner
Punta, Marco
Rost, Burkhard
机构
[1] Columbia Univ, Dept Biochem & Mol Biophys, New York, NY 10027 USA
[2] Columbia Univ, Ctr Computat Biol & Bioinformat, NE Struct Genom Consortium, New York, NY USA
关键词
D O I
10.1093/bioinformatics/btm349
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Natively unstructured (also dubbed intrinsically disordered) regions in proteins lack a defined 3D structure under physiological conditions and often adopt regular structures under particular conditions. Proteins with such regions are overly abundant in eukaryotes, they may increase functional complexity of organisms and they usually evade structure determination in the unbound form. Low propensity for the formation of internal residue contacts has been previously used to predict natively unstructured regions. Results: We combined PROFcon predictions for protein-specific contacts with a generic pairwise potential to predict unstructured regions. This novel method, Ucon, outperformed the best available methods in predicting proteins with long unstructured regions. Furthermore, Ucon correctly identified cases missed by other methods. By computing the difference between predictions based on specific contacts (approach introduced here) and those based on generic potentials (realized in other methods), we might identify unstructured regions that are involved in protein-protein binding. We discussed one example to illustrate this ambitious aim. Overall, Ucon added quality and an orthogonal aspect that may help in the experimental study of unstructured regions in network hubs.
引用
收藏
页码:2376 / 2384
页数:9
相关论文
共 70 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
BORDOLI L, 2006, ASSESSMENT DISORDER
[3]   Accurate prediction of protein disordered regions by mining protein structure data [J].
Cheng, JL ;
Sweredoski, MJ ;
Baldi, P .
DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (03) :213-222
[4]  
Demchenko AP, 2001, J MOL RECOGNIT, V14, P42, DOI 10.1002/1099-1352(200101/02)14:1<42::AID-JMR518>3.0.CO
[5]  
2-8
[6]   Simple fold composition and modular architecture of the nuclear pore complex [J].
Devos, D ;
Dokudovskaya, S ;
Williams, R ;
Alber, F ;
Eswar, N ;
Chait, BT ;
Rout, MP ;
Sali, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (07) :2172-2177
[7]   Stabilization centers in proteins: Identification, characterization and predictions [J].
Dosztanyi, Z ;
Fiser, A ;
Simon, I .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 272 (04) :597-612
[8]   IUPred:: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content [J].
Dosztányi, Z ;
Csizmok, V ;
Tompa, P ;
Simon, I .
BIOINFORMATICS, 2005, 21 (16) :3433-3434
[9]   The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins [J].
Dosztányi, Z ;
Csizmók, V ;
Tompa, P ;
Simon, I .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 347 (04) :827-839
[10]  
Dunker A.K., 1998, PACIFIC S BIOCOMPUTI, P473