Structural bioinformatics prediction of membrane-binding proteins

被引:44
作者
Bhardwaj, Nitin
Stahelin, Robert V.
Langlois, Robert E.
Cho, Wonhwa [1 ]
Lu, Hui
机构
[1] Univ Illinois, Dept Bioengn, Chicago, IL 60607 USA
[2] Univ Illinois, Dept Chem, Chicago, IL 60607 USA
基金
美国国家卫生研究院;
关键词
protein-membrane interactions; function annotation; support vector machines; peripheral proteins; protein function prediction;
D O I
10.1016/j.jmb.2006.03.039
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Membrane-binding peripheral proteins play important roles in many biological processes, including cell signaling and membrane trafficking. Unlike integral membrane proteins, these proteins bind the membrane mostly in a reversible manner. Since peripheral proteins do not have canonical transmembrane segments, it is difficult to identify them from their amino acid sequences. As a first step toward genome-scale identification of membrane-binding peripheral proteins, we built a kernel-based machine learning protocol. Key features of known membrane-binding proteins, including electrostatic properties and amino acid composition, were calculated from their amino acid sequences and tertiary structures, which were then incorporated into the support vector machine to perform the classification. A data set of 40 membrane-binding proteins and 230 non-membrane-binding proteins was used to construct and validate the protocol. Cross-validation and holdout evaluation of the protocol showed that the accuracy of the prediction reached up to 93.7% and 91.6%, respectively. The protocol was applied to the prediction of membrane-binding properties of four C2 domains from novel protein kinases C. Although these C2 domains have 50% sequence identity only one of them was predicted to bind the membrane, which was verified experimentally with surface plasmon resonance analysis. These results suggest that our protocol can be used for predicting membrane-binding properties of a wide variety of modular domains and may be further extended to genome-scale identification of membrane-binding peripheral proteins. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:486 / 495
页数:10
相关论文
共 50 条
[1]   Kernel-based machine learning protocol for predicting DNA-binding proteins [J].
Bhardwaj, N ;
Langlois, RE ;
Zhao, GJ ;
Lu, H .
NUCLEIC ACIDS RESEARCH, 2005, 33 (20) :6486-6493
[2]  
BHARDWAJ N, 2005, P 27 ANN INT C ENG M
[3]   The molecular basis of the differential subcellular localization of FYVE domains [J].
Blatner, NR ;
Stahelin, RV ;
Diraviyam, K ;
Hawkins, PT ;
Hong, WJ ;
Murray, D ;
Cho, WH .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (51) :53818-53827
[4]   Statistical analysis and prediction of protein-protein interfaces [J].
Bordner, AJ ;
Abagyan, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 60 (03) :353-366
[5]   ERM proteins and merlin: Integrators at the cell cortex [J].
Bretscher, A ;
Edwards, K ;
Fehon, RG .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2002, 3 (08) :586-599
[6]   CHARMM - A PROGRAM FOR MACROMOLECULAR ENERGY, MINIMIZATION, AND DYNAMICS CALCULATIONS [J].
BROOKS, BR ;
BRUCCOLERI, RE ;
OLAFSON, BD ;
STATES, DJ ;
SWAMINATHAN, S ;
KARPLUS, M .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1983, 4 (02) :187-217
[7]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[8]   Tubby proteins: The plot thickens [J].
Carroll, K ;
Gomez, C ;
Shapiro, L .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2004, 5 (01) :55-63
[9]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[10]   Membrane targeting by C1 and C2 domains [J].
Cho, W .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2001, 276 (35) :32407-32410