Using pseudo amino acid composition to predict transmembrane regions in protein: cellular automata and Lempel-Ziv complexity

被引:48
作者
Diao, Y. [1 ,2 ]
Ma, D. [1 ]
Wen, Z. [1 ,2 ]
Yin, J. [1 ]
Xiang, J. [1 ]
Li, M. [1 ,2 ]
机构
[1] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
[2] Sichuan Univ, State Key Lab Biotherapy, Chengdu 610064, Sichuan, Peoples R China
关键词
cellular automata; pseudo amino acid composition; Lempel-Ziv complexity; augmented covariant-discriminant algorithm; Chou's invariance theorem; transmembrane regions;
D O I
10.1007/s00726-007-0550-z
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Transmembrane (TM) proteins represent about 20-30% of the protein sequences in higher eukaryotes, playing important roles across a range of cellular functions. Moreover, knowledge about topology of these proteins often provides crucial hints toward their function. Due to the difficulties in experimental structure determinations of TM protein, theoretical prediction methods are highly preferred in identifying the topology of newly found ones according to their primary sequences, useful in both basic research and drug discovery. In this paper, based on the concept of pseudo amino acid composition (PseAA) that can incorporate sequence-order information of a protein sequence so as to remarkably enhance the power of discrete models (Chou, K. C., Proteins: Structure, Function, and Genetics, 2001, 43: 246-255), cellular automata and Lempel-Ziv complexity are introduced to predict the TM regions of integral membrane proteins including both alpha-helical and beta-barrel membrane proteins, validated by jackknife test. The result thus obtained is quite promising, which indicates that the current approach might be a quite potential high throughput tool in the post-genomic era. The source code and dataset are available for academic users at liml@scu.edu.cn.
引用
收藏
页码:111 / 117
页数:7
相关论文
共 70 条
[1]  
[Anonymous], 1993, Biol. Chem. Hoppe Seyler, DOI DOI 10.1515/BCHM3.1993.374.1-6.143
[2]   PRED-TMBB:: a web server for predicting the topology of β-barrel outer membrane proteins [J].
Bagos, PG ;
Liakopoulos, TD ;
Spyropoulos, IC ;
Hamodrakas, SJ .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W400-W404
[3]   Predicting enzyme family classes by hybridizing gene product composition and pseudo-amino acid composition [J].
Cai, YD ;
Zhou, GP ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2005, 234 (01) :145-149
[4]   Predicting 22 protein localizations in budding yeast [J].
Cai, YD ;
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 323 (02) :425-428
[5]   Predicting subcellular localization of proteins in a hybridization space [J].
Cai, YD ;
Chou, KC .
BIOINFORMATICS, 2004, 20 (07) :1151-1156
[6]   Enhanced recognition of protein transmembrane domains with prediction-based structural profiles [J].
Cao, BQ ;
Porollo, A ;
Adamczak, R ;
Jarrell, M ;
Meller, J .
BIOINFORMATICS, 2006, 22 (03) :303-309
[7]   Using pseudo-amino acid composition and support vector machine to predict protein structural class [J].
Chen, Chao ;
Tian, Yuan-Xin ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang ;
Mo, Jin-Yuan .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) :444-448
[8]   Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network [J].
Chen, Chao ;
Zhou, Xibin ;
Tian, Yuanxin ;
Zou, Xiaoyong ;
Cai, Peixiang .
ANALYTICAL BIOCHEMISTRY, 2006, 357 (01) :116-121
[9]  
Chen Chien Peter, 2002, Appl Bioinformatics, V1, P21
[10]   Prediction of protein subcellular locations by incorporating quasi-sequence-order effect [J].
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2000, 278 (02) :477-483