共 73 条
Prediction of mitochondrial proteins based on genetic algorithm - partial least squares and support vector machine
被引:41
作者:
Tan, F.
[1
]
Feng, X.
[1
]
Fang, Z.
[1
]
Li, M.
[1
]
Guo, Y.
[1
]
Jiang, L.
[1
]
机构:
[1] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
来源:
关键词:
mitochondrial proteins;
dipeptide composition;
genetic algorithm-partial least square;
support vector machine;
D O I:
10.1007/s00726-006-0465-0
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
Mitochondria are essential cell organelles of eukaryotes. Hence, it is vitally important to develop an automated and reliable method for timely identification of novel mitochondrial proteins. In this study, mitochondrial proteins were encoded by dipeptide composition technology; then, the genetic algorithm-partial least square (GA-PLS) method was used to evaluate the dipeptide composition elements which are more important in recognizing mitochondrial proteins; further, these selected dipeptide composition elements were applied to support vector machine (SVM)-based classifiers to predict the mitochondrial proteins. All the models were trained and validated by the jackknife cross-validation test. The prediction accuracy is 85%, suggesting that it performs reasonably well in predicting the mitochondrial proteins. Our results strongly imply that not all the dipeptide compositions are informative and indispensable for predicting proteins. The source code of MATLAB and the dataset are available on request under liml@scu.edu.cn.
引用
收藏
页码:669 / 675
页数:7
相关论文