Efficiency analysis of KNN and minimum distance-based classifiers in enzyme family prediction

被引:36
作者
Nasibov, Efendi [1 ]
Kandemir-Cavas, Cagin [1 ]
机构
[1] Dokuz Eylul Univ, Fac Arts & Sci, Dept Stat, TR-35160 Izmir, Turkey
关键词
Amino acid composition; Enzyme class; K-nearest neighbor; Minimum-distance classifier; AMINO-ACID-COMPOSITION; SUBCELLULAR LOCATION PREDICTION; PROTEIN-STRUCTURE; LOCALIZATION;
D O I
10.1016/j.compbiolchem.2009.09.002
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nearly all enzymes are proteins. They are the biological catalysts that accelerate the function of cellular reactions. Because of different characteristics of reaction tasks, they split into six classes: oxidoreductases (EC-1), transferases (EC-2), hydrolases (EC-3), lyases (EC-4), isomerases (EC-5), ligases (EC-6). Prediction of enzyme classes is of great importance in identifying which enzyme class is a member of a protein. Since the enzyme sequences increase day by day, contrary to experimental analysis in prediction of enzyme classes for a newly found enzyme sequence, providing from data mining techniques becomes very useful and time-saving. In this paper, two kinds of simple minimum distance-based classifier methods have been proposed. These methods and known K-nearest neighbor (KNN) classification algorithm have been performed in order to classify enzymes according to their amino acid composition. Performance measurements and elapsed time to execute algorithms have been compared. In addition, equality of two proposed approaches under special condition has been proved in order to be a guide for researchers. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:461 / 464
页数:4
相关论文
共 21 条
[1]  
Bahar I, 1997, PROTEINS, V29, P172, DOI 10.1002/(SICI)1097-0134(199710)29:2<172::AID-PROT5>3.0.CO
[2]  
2-F
[3]   The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[4]  
Borro LC, 2006, GENET MOL RES, V5, P193
[5]   Predicting enzyme family classes by hybridizing gene product composition and pseudo-amino acid composition [J].
Cai, YD ;
Zhou, GP ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2005, 234 (01) :145-149
[6]   Relation between amino acid composition and cellular location of proteins [J].
Cedano, J ;
Aloy, P ;
PerezPons, JA ;
Querol, E .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (03) :594-600
[7]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[8]   Prediction of enzyme family classes [J].
Chou, KC ;
Elrod, DW .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :183-190
[9]   Protein subcellular location prediction [J].
Chou, KC ;
Elrod, DW .
PROTEIN ENGINEERING, 1999, 12 (02) :107-118
[10]   Predicting enzyme class from protein structure without alignments [J].
Dobson, PD ;
Doig, AJ .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 345 (01) :187-199