Proteome-wide classification and identification of mammalian-type GPCRs by binary topology pattern

被引:29
作者
Inoue, Y
Ikeda, M
Shimizu, J [1 ]
机构
[1] Hirosaki Univ, Fac Sci & Technol, Dept Elect & Informat Syst Engn, Hirosaki, Aomori 0368561, Japan
[2] Iwate Univ, United Grad Sch Agr Sci, Sci Bioresources Program, Morioka, Iwate 0208550, Japan
关键词
G protein-coupled receptor; binary topology pattern; length of loop domain; transmembrane topology; functional identification;
D O I
10.1016/j.compbiolchem.2003.11.003
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
G protein-coupled receptors (GPCRs), a large eukaryotic protein family, have proved difficult to comprehensively detect and functionally identify by homology searches and domain detection, because they are highly divergent and their sequences share strikingly little similarity. Transmembrane (TM) topology pattern analysis has been used to classify TM proteins, and such patterns are conserved within GPCRs of similar function. Here, we developed a stepwise binary topology pattern (BTP) method for GPCR classification and identification and used it to identify and classify mammalian-type GPCRs in the genomes of 10 different eukaryotic species. A binary topology pattern was obtained for each functional class or group by assigning binary loop threshold lengths of '0' (short loop) or '1' (long loop). The GPCR-classification ability of the BTP method had quite high accuracies for classifying GPCR functions at the class level (Classes A, B, C, Frizzled/Smoothened, Non-GPCR, based on the GPCRDB classification scheme), with many classes being classified with 100% accuracy. Sufficiently high accuracies were also maintained at the functional group level, 0.945 over 15 functional groups. Proteome-wide mammalian-type GPCR searches in 10 eukaryotic genomes (H. sapiens, M. musculus, F rubripes, C. intestinalis, A. thaliana, D. melanogaster, A. gambiae, C. elegans, P falciparum, S. cerevisiae) using the BTP method showed much higher classification/identification in non-mammalian genomes than typical BLAST searches, in which a higher number of sequences were classified as Non-GPCR. This stepwise BTP method should prove useful for the identification and functional classification of GPCRs from the genomes of a wide range of species. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:39 / 49
页数:11
相关论文
共 32 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
[Anonymous], 1998, SCIENCE, V282, P2012
[4]   Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes [J].
Aparicio, S ;
Chapman, J ;
Stupka, E ;
Putnam, N ;
Chia, J ;
Dehal, P ;
Christoffels, A ;
Rash, S ;
Hoon, S ;
Smit, A ;
Gelpke, MDS ;
Roach, J ;
Oh, T ;
Ho, IY ;
Wong, M ;
Detter, C ;
Verhoef, F ;
Predki, P ;
Tay, A ;
Lucas, S ;
Richardson, P ;
Smith, SF ;
Clark, MS ;
Edwards, YJK ;
Doggett, N ;
Zharkikh, A ;
Tavtigian, SV ;
Pruss, D ;
Barnstead, M ;
Evans, C ;
Baden, H ;
Powell, J ;
Glusman, G ;
Rowen, L ;
Hood, L ;
Tan, YH ;
Elgar, G ;
Hawkins, T ;
Venkatesh, B ;
Rokhsar, D ;
Brenner, S .
SCIENCE, 2002, 297 (5585) :1301-1310
[5]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[6]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[7]   Identification of novel membrane proteins by searching for patterns in hydropathy profiles [J].
Clements, JD ;
Martin, RE .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 2002, 269 (08) :2101-2107
[8]   A novel family of divergent seven-transmembrane proteins:: Candidate odorant receptors in Drosophila [J].
Clyne, PJ ;
Warr, CG ;
Freeman, MR ;
Lessing, D ;
Kim, JH ;
Carlson, JR .
NEURON, 1999, 22 (02) :327-338
[9]   The draft genome of Ciona intestinalis:: Insights into chordate and vertebrate origins [J].
Dehal, P ;
Satou, Y ;
Campbell, RK ;
Chapman, J ;
Degnan, B ;
De Tomaso, A ;
Davidson, B ;
Di Gregorio, A ;
Gelpke, M ;
Goodstein, DM ;
Harafuji, N ;
Hastings, KEM ;
Ho, I ;
Hotta, K ;
Huang, W ;
Kawashima, T ;
Lemaire, P ;
Martinez, D ;
Meinertzhagen, IA ;
Necula, S ;
Nonaka, M ;
Putnam, N ;
Rash, S ;
Saiga, H ;
Satake, M ;
Terry, A ;
Yamada, L ;
Wang, HG ;
Awazu, S ;
Azumi, K ;
Boore, J ;
Branno, M ;
Chin-bow, S ;
DeSantis, R ;
Doyle, S ;
Francino, P ;
Keys, DN ;
Haga, S ;
Hayashi, H ;
Hino, K ;
Imai, KS ;
Inaba, K ;
Kano, S ;
Kobayashi, K ;
Kobayashi, M ;
Lee, BI ;
Makabe, KW ;
Manohar, C ;
Matassi, G ;
Medina, M .
SCIENCE, 2002, 298 (5601) :2157-2167
[10]   Genome sequence of the human malaria parasite Plasmodium falciparum [J].
Gardner, MJ ;
Hall, N ;
Fung, E ;
White, O ;
Berriman, M ;
Hyman, RW ;
Carlton, JM ;
Pain, A ;
Nelson, KE ;
Bowman, S ;
Paulsen, IT ;
James, K ;
Eisen, JA ;
Rutherford, K ;
Salzberg, SL ;
Craig, A ;
Kyes, S ;
Chan, MS ;
Nene, V ;
Shallom, SJ ;
Suh, B ;
Peterson, J ;
Angiuoli, S ;
Pertea, M ;
Allen, J ;
Selengut, J ;
Haft, D ;
Mather, MW ;
Vaidya, AB ;
Martin, DMA ;
Fairlamb, AH ;
Fraunholz, MJ ;
Roos, DS ;
Ralph, SA ;
McFadden, GI ;
Cummings, LM ;
Subramanian, GM ;
Mungall, C ;
Venter, JC ;
Carucci, DJ ;
Hoffman, SL ;
Newbold, C ;
Davis, RW ;
Fraser, CM ;
Barrell, B .
NATURE, 2002, 419 (6906) :498-511