Beyond complete genomes: from sequence to structure and function

被引:119
作者
Koonin, EV [1 ]
Tatusov, RL [1 ]
Galperin, MY [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
关键词
D O I
10.1016/S0959-440X(98)80070-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Computer analysis of complete prokaryotic genomes shows that microbial proteins are in general highly conserved - similar to 70% of them contain ancient conserved regions. This allows us to delineate families of orthologs across a wide phylogenetic range and, in many cases, predict protein functions with considerable precision. Sequence database searches using newly developed, sensitive algorithms result in the unification of such orthologous families into larger superfamilies sharing common sequence motifs. For many of these superfamilies, prediction of the structural fold and specific amino acid residues involved in enzymatic catalysis is possible. Taken together, sequence and structure comparisons provide a powerful methodology that can successfully complement traditional experimental approaches.
引用
收藏
页码:355 / 363
页数:9
相关论文
共 60 条
[21]   Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium [J].
Fischer, D ;
Eisenberg, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (22) :11929-11934
[22]   USES FOR EVOLUTIONARY TREES [J].
FITCH, WM .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 1995, 349 (1327) :93-102
[23]   DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS [J].
FITCH, WM .
SYSTEMATIC ZOOLOGY, 1970, 19 (02) :99-&
[24]   WHOLE-GENOME RANDOM SEQUENCING AND ASSEMBLY OF HAEMOPHILUS-INFLUENZAE RD [J].
FLEISCHMANN, RD ;
ADAMS, MD ;
WHITE, O ;
CLAYTON, RA ;
KIRKNESS, EF ;
KERLAVAGE, AR ;
BULT, CJ ;
TOMB, JF ;
DOUGHERTY, BA ;
MERRICK, JM ;
MCKENNEY, K ;
SUTTON, G ;
FITZHUGH, W ;
FIELDS, C ;
GOCAYNE, JD ;
SCOTT, J ;
SHIRLEY, R ;
LIU, LI ;
GLODEK, A ;
KELLEY, JM ;
WEIDMAN, JF ;
PHILLIPS, CA ;
SPRIGGS, T ;
HEDBLOM, E ;
COTTON, MD ;
UTTERBACK, TR ;
HANNA, MC ;
NGUYEN, DT ;
SAUDEK, DM ;
BRANDON, RC ;
FINE, LD ;
FRITCHMAN, JL ;
FUHRMANN, JL ;
GEOGHAGEN, NSM ;
GNEHM, CL ;
MCDONALD, LA ;
SMALL, KV ;
FRASER, CM ;
SMITH, HO ;
VENTER, JC .
SCIENCE, 1995, 269 (5223) :496-512
[25]   THE MINIMAL GENE COMPLEMENT OF MYCOPLASMA-GENITALIUM [J].
FRASER, CM ;
GOCAYNE, JD ;
WHITE, O ;
ADAMS, MD ;
CLAYTON, RA ;
FLEISCHMANN, RD ;
BULT, CJ ;
KERLAVAGE, AR ;
SUTTON, G ;
KELLEY, JM ;
FRITCHMAN, JL ;
WEIDMAN, JF ;
SMALL, KV ;
SANDUSKY, M ;
FUHRMANN, J ;
NGUYEN, D ;
UTTERBACK, TR ;
SAUDEK, DM ;
PHILLIPS, CA ;
MERRICK, JM ;
TOMB, JF ;
DOUGHERTY, BA ;
BOTT, KF ;
HU, PC ;
LUCIER, TS ;
PETERSON, SN ;
SMITH, HO ;
HUTCHISON, CA ;
VENTER, JC .
SCIENCE, 1995, 270 (5235) :397-403
[26]   Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi [J].
Fraser, CM ;
Casjens, S ;
Huang, WM ;
Sutton, GG ;
Clayton, R ;
Lathigra, R ;
White, O ;
Ketchum, KA ;
Dodson, R ;
Hickey, EK ;
Gwinn, M ;
Dougherty, B ;
Tomb, JF ;
Fleischmann, RD ;
Richardson, D ;
Peterson, J ;
Kerlavage, AR ;
Quackenbush, J ;
Salzberg, S ;
Hanson, M ;
vanVugt, R ;
Palmer, N ;
Adams, MD ;
Gocayne, J ;
Weidman, J ;
Utterback, T ;
Watthey, L ;
McDonald, L ;
Artiach, P ;
Bowman, C ;
Garland, S ;
Fujii, C ;
Cotton, MD ;
Horst, K ;
Roberts, K ;
Hatch, B ;
Smith, HO ;
Venter, JC .
NATURE, 1997, 390 (6660) :580-586
[27]   PEDANTic genome analysis [J].
Frishman, D ;
Mewes, HW .
TRENDS IN GENETICS, 1997, 13 (10) :415-416
[28]  
Galperin MY, 1997, PROTEIN SCI, V6, P2639
[29]  
GALPERIN MY, 1998, IN PRESS PROTEIN SCI, V7
[30]   Life with 6000 genes [J].
Goffeau, A ;
Barrell, BG ;
Bussey, H ;
Davis, RW ;
Dujon, B ;
Feldmann, H ;
Galibert, F ;
Hoheisel, JD ;
Jacq, C ;
Johnston, M ;
Louis, EJ ;
Mewes, HW ;
Murakami, Y ;
Philippsen, P ;
Tettelin, H ;
Oliver, SG .
SCIENCE, 1996, 274 (5287) :546-&