On the hierarchical classification of G protein-coupled receptors

被引:66
作者
Davies, Matthew N. [1 ]
Secker, Andrew
Freitas, Alex A.
Mendao, Miguel
Timmis, Jon
Flower, Darren R.
机构
[1] Edward Jenner Inst, Newbury RG20 7NN, Berks, England
[2] Univ Kent, Ctr Biomed Informat, Dept Comp, Canterbury CT2 7NF, Kent, England
[3] Univ York, Dept Comp Sci & Elect, York YO10 5DD, N Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1093/bioinformatics/btm506
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: G protein-coupled receptors (GPCRs) play an important role in many physiological systems by transducing an extracellular signal into an intracellular response. Over 50 of all marketed drugs are targeted towards a GPCR. There is considerable interest in developing an algorithm that could effectively predict the function of a GPCR from its primary sequence. Such an algorithm is useful not only in identifying novel GPCR sequences but in characterizing the interrelationships between known GPCRs. Results: An alignment-free approach to GPCR classification has been developed using techniques drawn from data mining and proteochemometrics. A dataset of over 8000 sequences was constructed to train the algorithm. This represents one of the largest GPCR datasets currently available. A predictive algorithm was developed based upon the simplest reasonable numerical representation of the proteins physicochemical properties. A selective top-down approach was developed, which used a hierarchical classifier to assign sequences to subdivisions within the GPCR hierarchy. The predictive performance of the algorithm was assessed against several standard data mining classifiers and further validated against Support Vector Machine-based GPCR prediction servers. The selective top-down approach achieves significantly higher accuracy than standard data mining methods in almost all cases.
引用
收藏
页码:3113 / 3118
页数:6
相关论文
共 42 条
[1]  
[Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
[2]   PRINTS and PRINTS-S shed light on protein ancestry [J].
Attwood, TK ;
Blythe, MJ ;
Flower, DR ;
Gaulton, A ;
Mabey, JE ;
Maudling, N ;
McGregor, L ;
Mitchell, AL ;
Moulton, G ;
Paine, K ;
Scordis, P .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :239-241
[3]   A compendium of specific motifs for diagnosing GPCR subtypes [J].
Attwood, TK .
TRENDS IN PHARMACOLOGICAL SCIENCES, 2001, 22 (04) :162-165
[4]   GPCRsclass: a web tool for the classification of amine type of G-protein-coupled receptors [J].
Bhasin, M ;
Raghava, GPS .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W143-W147
[5]   GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors [J].
Bhasin, M ;
Raghava, GPS .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W383-W389
[6]   Conformational changes of G protein-coupled receptors during their activation by agonist binding [J].
Bissantz, C .
JOURNAL OF RECEPTORS AND SIGNAL TRANSDUCTION, 2003, 23 (2-3) :123-153
[7]  
Brownlee J, 2007, WEKA CLASSIFICATION
[8]   Evolution of secretin family GPCR members in the metazoa [J].
Cardoso, Joao C. R. ;
Pinto, Vanda C. ;
Vieira, Florbela A. ;
Clark, Melody S. ;
Power, Deborah M. .
BMC EVOLUTIONARY BIOLOGY, 2006, 6 (1)
[9]   G protein-coupled receptor allosterism and complexing [J].
Christopoulos, A ;
Kenakin, T .
PHARMACOLOGICAL REVIEWS, 2002, 54 (02) :323-374
[10]   Predicting gene function in Saccharomyces cerevisiae [J].
Clare, A. ;
King, R. D. .
BIOINFORMATICS, 2003, 19 :II42-II49