DbW: automatic update of a functional family-specific multiple alignment

被引:4
作者
Prigent, V [1 ]
Thierry, JC [1 ]
Poch, O [1 ]
Plewniak, F [1 ]
机构
[1] ULP, INSERM, CNRS,Inst Genet & Biol Mol & Cellulaire, Lab Biol & Genom Struct, F-67404 Illkirch Graffenstaden, France
关键词
D O I
10.1093/bioinformatics/bti218
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recent advances in gene sequencing have provided complete sequence information for a number of genomes and as a result the amount of data in the sequence databases is growing at an exponential rate. We introduce here a new program, DbW, to automate the update of a functional family-specific multiple alignment that tries to include relevant sequences. The program is based on the use of different sources of information: sequences and annotations in databases. Results: The advantages of DbW are demonstrated using the 20 families of aminoacyl-tRNA synthetases, where DbW detects a maximum of homologous sequences in the Swiss-Prot and SPTREMBL databases. The global specificity of DbW in this test is 98.4% (1.6% of the sequences included in the alignment did not belong to the family according to their function), and the global sensitivity of DbW is estimated to be 95.2%. Thus, DbW provides a reliable basis for the many applications that rely on accurate multiple alignments, e.g. functional residue identification, 2D/3D structure prediction or homology modeling.
引用
收藏
页码:1437 / 1442
页数:6
相关论文
共 25 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   A gene fusion event in the evolution of aminoacyl-tRNA synthetases [J].
Berthonneau, E ;
Mirande, M .
FEBS LETTERS, 2000, 470 (03) :300-304
[4]   Predicting function: From genes to genomes and back [J].
Bork, P ;
Dandekar, T ;
Diaz-Lazcoz, Y ;
Eisenhaber, F ;
Huynen, M ;
Yuan, YP .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 283 (04) :707-725
[5]   Evidence for the early divergence of tryptophanyl- and tyrosyl-tRNA synthetases [J].
Brown, JR ;
Robb, FT ;
Weiss, R ;
Doolittle, WF .
JOURNAL OF MOLECULAR EVOLUTION, 1997, 45 (01) :9-16
[6]   TRYPTOPHANYL-TRANSFER-RNA SYNTHETASE CRYSTAL-STRUCTURE REVEALS AN UNEXPECTED HOMOLOGY TO TYROSYL-TRANSFER-RNA SYNTHETASE [J].
DOUBLIE, S ;
BRICOGNE, G ;
GILMORE, C ;
CARTER, CW .
STRUCTURE, 1995, 3 (01) :17-31
[7]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[8]  
Eggenberger F, 1996, COMPUT APPL BIOSCI, V12, P129
[9]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[10]  
Etzold T, 1996, METHOD ENZYMOL, V266, P114