Function prediction for DNA-/RNA-binding proteins, GPCRs, and drug ADME-associated proteins by SVM

被引:3
作者
Cai, Congzhong [1 ]
Xiao, Hanguang [1 ,2 ]
Yuan, Qianfei [1 ]
Liu, Xinghua [1 ]
Wen, Yufeng [1 ]
机构
[1] Chongqing Univ, Dept Appl Phys, Chongqing 400044, Peoples R China
[2] Chongqing Inst Technol, Dept Appl Phys, Chongqing 400054, Peoples R China
关键词
protein function prediction; DNA-binding proteins; RNA-binding proteins; G-protein coupled receptors (GPCRs); drug absorption proteins; drug metabolizing enzymes; drug distribution and excretion proteins; support vector machine (SVM);
D O I
10.2174/092986608784567528
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This paper explores the use of support vector machine (SVM) for protein function prediction. Studies are conducted on several groups of proteins with different functions including DNA-binding proteins, RNA-binding proteins, G-protein coupled receptors, drug absorption proteins, drug metabolizing enzymes, drug distribution and excretion proteins. The computed accuracy for the prediction of these proteins is found to be in the range of 82.32% to 99.7%, which illustrates the potential of SVM in facilitating protein function prediction.
引用
收藏
页码:463 / 468
页数:6
相关论文
共 38 条
[1]  
[Anonymous], 1999, The Nature Statist. Learn. Theory
[2]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[3]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[4]   Predicting protein-protein interactions from primary structure [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2001, 17 (05) :455-460
[5]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14
[6]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[7]   Enzyme family classification by support vector machines [J].
Cai, CZ ;
Han, LY ;
Ji, ZL ;
Chen, YZ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 55 (01) :66-76
[8]   Support vector machine classification of physical and biological datasets [J].
Cai, CZ ;
Wang, WL ;
Chen, YZ .
INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2003, 14 (05) :575-585
[9]   SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence [J].
Cai, CZ ;
Han, LY ;
Ji, ZL ;
Chen, X ;
Chen, YZ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3692-3697
[10]   Prediction of protein structural classes by support vector machines [J].
Cai, YD ;
Liu, XJ ;
Xu, XB ;
Chou, KC .
COMPUTERS & CHEMISTRY, 2002, 26 (03) :293-296