Conserved spatially interacting motifs of protein superfamilies: Application to fold recognition and function annotation of genome data

被引:17
作者
Bhaduri, A
Ravishankar, R
Sowdhamini, R
机构
[1] Univ Agr Sci Bangalore, Natl Ctr Biol Sci, Tata Inst Fundamental Res, Bangalore 560065, Karnataka, India
[2] Anna Univ, Ctr Biotechnol, Madras 600025, Tamil Nadu, India
关键词
distance relationship; BLAST; structure prediction; function prediction; sequence searches; genome analysis;
D O I
10.1002/prot.10638
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Limitations in techniques for the elucidation of protein function have led to an increasing gap between the annotated proteins and those encoded in a genome. The functional selection and three-dimensional structural constraints of proteins in nature often relate to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. We identify spatially interacting conserved regions, or motifs, within protein superfamilies that are critical for structure and/or function. A search in sequence databases using these descriptors as additional constraints is an approach to identifying putative additional members of superfamilies. Such constrained searches have been tested against proteins of known structure to demonstrate high percentage specificity (93) with a low error rate of 0.0004. This approach has been compared with other sensitive sequence search methods (e.g., PSI-BLAST, HMMsearch, and IMPALA). It has been extended to analyze the distribution of 11 superfamilies in 93 genomes, including the human genome. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:657 / 670
页数:14
相关论文
共 48 条
[1]   REFINEMENT OF RECOMBINANT ONCOMODULIN AT 1.30-ANGSTROM RESOLUTION [J].
AHMED, FR ;
ROSE, DR ;
EVANS, SV ;
PIPPY, ME ;
TO, R .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 230 (04) :1216-1224
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Structural and evolutionary relationships among protein tyrosine phosphatase domains [J].
Andersen, JN ;
Mortensen, OH ;
Peters, GH ;
Drake, PG ;
Iversen, LF ;
Olsen, OH ;
Jansen, PG ;
Andersen, HS ;
Tonks, NK ;
Moller, NPH .
MOLECULAR AND CELLULAR BIOLOGY, 2001, 21 (21) :7117-7136
[4]  
[Anonymous], ISMB
[5]   PRINTS - A PROTEIN MOTIF FINGERPRINT DATABASE [J].
ATTWOOD, TK ;
BECK, ME .
PROTEIN ENGINEERING, 1994, 7 (07) :841-848
[6]   PROSITE - A DICTIONARY OF SITES AND PATTERNS IN PROTEINS [J].
BAIROCH, A .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2241-2245
[7]   ION-PAIRS IN PROTEINS [J].
BARLOW, DJ ;
THORNTON, JM .
JOURNAL OF MOLECULAR BIOLOGY, 1983, 168 (04) :867-885
[8]   The Protein Data Bank [J].
Berman, HM ;
Battistuz, T ;
Bhat, TN ;
Bluhm, WF ;
Bourne, PE ;
Burkhardt, K ;
Iype, L ;
Jain, S ;
Fagan, P ;
Marvin, J ;
Padilla, D ;
Ravichandran, V ;
Schneider, B ;
Thanki, N ;
Weissig, H ;
Westbrook, JD ;
Zardecki, C .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2002, 58 :899-907
[9]   SPLASH: structural pattern localization analysis by sequential histograms [J].
Califano, A .
BIOINFORMATICS, 2000, 16 (04) :341-357
[10]   IDENTIFICATION OF IMPORTANT FUNCTIONAL ENVIRONS IN PROTEIN TERTIARY STRUCTURES FROM THE ANALYSIS OF RESIDUE VARIATION IN 3-D - APPLICATION TO CYTOCHROMES-C AND CARBOXYPEPTIDASE-A AND CARBOXYPEPTIDASE-B [J].
CARDLE, L ;
DUFTON, MJ .
PROTEIN ENGINEERING, 1994, 7 (12) :1423-1431