Beyond complete genomes: from sequence to structure and function

被引:119
作者
Koonin, EV [1 ]
Tatusov, RL [1 ]
Galperin, MY [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
关键词
D O I
10.1016/S0959-440X(98)80070-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Computer analysis of complete prokaryotic genomes shows that microbial proteins are in general highly conserved - similar to 70% of them contain ancient conserved regions. This allows us to delineate families of orthologs across a wide phylogenetic range and, in many cases, predict protein functions with considerable precision. Sequence database searches using newly developed, sensitive algorithms result in the unification of such orthologous families into larger superfamilies sharing common sequence motifs. For many of these superfamilies, prediction of the structural fold and specific amino acid residues involved in enzymatic catalysis is possible. Taken together, sequence and structure comparisons provide a powerful methodology that can successfully complement traditional experimental approaches.
引用
收藏
页码:355 / 363
页数:9
相关论文
共 60 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The catalytic domain of the P-type ATPase has the haloacid dehalogenase fold [J].
Aravind, L ;
Galperin, MY ;
Koonin, EV .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (04) :127-129
[3]   A novel family of predicted phosphoesterases includes Drosophila prune protein and bacterial RecJ exonuclease [J].
Aravind, L ;
Koonin, EV .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (01) :17-19
[4]   The PRINTS protein fingerprint database in its fifth year [J].
Attwood, TK ;
Beck, ME ;
Flower, DR ;
Scordis, P ;
Selley, JN .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :304-308
[5]   The PROSITE database, its status in 1997 [J].
Bairoch, A ;
Bucher, P ;
Hofmann, K .
NUCLEIC ACIDS RESEARCH, 1997, 25 (01) :217-221
[6]   The PIR-international Protein Sequence Database [J].
Barker, WC ;
Garavelli, JS ;
Haft, DH ;
Hunt, LT ;
Marzec, CR ;
Orcutt, BC ;
Srinivasarao, GY ;
Yeh, LSL ;
Ledley, RS ;
Mewes, HW ;
Pfeiffer, F ;
Tsugita, A .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :27-32
[7]   GenBank [J].
Benson, DA ;
Boguski, MS ;
Lipman, DJ ;
Ostell, J ;
Ouellette, BFF .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :1-7
[8]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[9]   Structure of a human lysosomal sulfatase [J].
Bond, CS ;
Clements, PR ;
Ashby, SJ ;
Collyer, CA ;
Harrop, SJ ;
Hopwood, JJ ;
Guss, JM .
STRUCTURE, 1997, 5 (02) :277-289
[10]   Protein sequence motifs [J].
Bork, P ;
Koonin, EV .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :366-376