Extracting phylogenetic information from whole-genome sequencing projects: the lactic acid bacteria as a test case

被引:55
作者
Coenye, T [1 ]
Vandamme, P [1 ]
机构
[1] State Univ Ghent, Microbiol Lab, B-9000 Ghent, Belgium
来源
MICROBIOLOGY-SGM | 2003年 / 149卷
关键词
D O I
10.1099/mic.0.26515-0
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
The availability of an ever increasing number of complete genome sequences of diverse prokaryotic taxa has led to the introduction of novel approaches to infer phylogenetic relationships among bacteria. In the present study the sequences of the 16S rRNA gene and nine housekeeping genes were compared with the fraction of shared putative orthologous protein-encoding genes, conservation of gene order, dinucleotide relative abundance and codon usage among 11 genomes of species belonging to the lactic acid bacteria. In general there is a good correlation between the results obtained with various approaches, although it is clear that there is a stronger phylogenetic signal in some datasets than in others, and that different parameters have different taxonomic resolutions. It appears that trees based on different kinds of information derived from whole-genome sequencing projects do not provide much additional information about the phylogenetic relationships among bacterial taxa compared to more traditional alignment-based methods. Nevertheless, it is expected that the study of these novel forms of information will have its value in taxonomy, to determine which genes are shared, when genes or sets of genes were lost in evolutionary history, to detect the presence of horizontally transferred genes and/or confirm or enhance the phylogenetic signal derived from traditional methods. Although these conclusions are based on a relatively small dataset, they are largely in agreement with other studies and it is anticipated that similar trends will be observed when comparing other genomes.
引用
收藏
页码:3507 / 3517
页数:11
相关论文
共 65 条
[1]   Phylogenetic analysis of Gram-positive bacteria based on grpE, encoded by the dnaK operon [J].
Ahmad, S ;
Selvapandiyan, A ;
Bhatnagar, RK .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2000, 50 :1761-1766
[2]   Genome sequence of Streptococcus mutans UA159, a cariogenic dental pathogen [J].
Ajdic, D ;
McShan, WM ;
McLaughlin, RE ;
Savic, G ;
Chang, J ;
Carson, MB ;
Primeaux, C ;
Tian, RY ;
Kenton, S ;
Jia, HG ;
Lin, SP ;
Qian, YD ;
Li, SL ;
Zhu, H ;
Najar, F ;
Lai, HS ;
White, J ;
Roe, BA ;
Ferretti, JJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (22) :14434-14439
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   Codon usage in the Mycobacterium tuberculosis complex [J].
Andersson, SGE ;
Sharp, PM .
MICROBIOLOGY-UK, 1996, 142 :915-925
[5]   Evolutionary analysis by whole-genome comparisons [J].
Bansal, AK ;
Meyer, TE .
JOURNAL OF BACTERIOLOGY, 2002, 184 (08) :2260-2272
[6]   INTRAGENERIC STRUCTURE OF STREPTOCOCCUS BASED ON COMPARATIVE-ANALYSIS OF SMALL-SUBUNIT RIBOSOMAL-RNA SEQUENCES [J].
BENTLEY, RW ;
LEIGH, JA ;
COLLINS, MD .
INTERNATIONAL JOURNAL OF SYSTEMATIC BACTERIOLOGY, 1991, 41 (04) :487-494
[7]   Genome sequence of a serotype M3 strain of group A Streptococcus:: Phage-encoded toxins, the high-virulence phenotype, and clone emergence [J].
Beres, SB ;
Sylva, GL ;
Barbian, KD ;
Lei, BF ;
Hoff, JS ;
Mammarella, ND ;
Liu, MY ;
Smoot, JC ;
Porcella, SF ;
Parkins, LD ;
Campbell, DS ;
Smith, TM ;
McCormick, JK ;
Leung, DYM ;
Schlievert, PM ;
Musser, JM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (15) :10078-10083
[8]   The complete genome sequence of the lactic acid bacterium Lactococcus lactis ssp lactis IL1403 [J].
Bolotin, A ;
Wincker, P ;
Mauger, S ;
Jaillon, O ;
Malarme, K ;
Weissenbach, J ;
Ehrlich, SD ;
Sorokin, A .
GENOME RESEARCH, 2001, 11 (05) :731-753
[9]   Phylogenetic inferences from molecular sequences: Review and critique [J].
Brocchieri, L .
THEORETICAL POPULATION BIOLOGY, 2001, 59 (01) :27-40
[10]   Universal trees based on large combined protein sequence data sets [J].
Brown, JR ;
Douady, CJ ;
Italia, MJ ;
Marshall, WE ;
Stanhope, MJ .
NATURE GENETICS, 2001, 28 (03) :281-285