Low-redundancy sequencing of the entire Lactococcus lactis IL1403 genome

被引:124
作者
Bolotin, A [1 ]
Mauger, S [1 ]
Malarme, K [1 ]
Ehrlich, SD [1 ]
Sorokin, A [1 ]
机构
[1] INRA, F-78352 Jouy En Josas, France
来源
ANTONIE VAN LEEUWENHOEK INTERNATIONAL JOURNAL OF GENERAL AND MOLECULAR MICROBIOLOGY | 1999年 / 76卷 / 1-4期
关键词
D O I
10.1023/A:1002048720611
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Lactococcus lactis is an AT-rich gram positive bacterium phylogenetically close to the genus Streptococcus. Various strains of L. lactis are used in dairy industry as starters for cheese making. L. lactis is also one of the well characterized laboratory microorganisms, widely used for studies on physiology of lactic acid bacteria. We describe here a low redundancy sequence of the genome of the strain L. lactis IL1403. The strategy which we followed to determine the sequence consists of two main steps. First, a limited number of plasmids and lambda-phages that carry random segments of the genome were sequenced. Second, sequences of the inserts were used for production of novel sequencing templates by applying Multiplex Long Accurate PCR protocols. Using of these PCR products allowed to determine the sequence of the entire 2.35 Mb genome with a very low redundancy, close to 2. The error rate of the sequence is estimated to be below 1%. The correctness of the sequence assembly was confirmed by PCR amplification of the entire L. lactis IL1403 genome, using a set of 266 oligonucleotides. Anotation of the sequence was undertaken by using automatic gene prediction computer tools. This allowed to identify 1495 protein-encoding genes, to locate them on the genome map and to classify their functions on the basis of homology to known proteins. The function of about 700 genes expected to encode proteins that lack homologs in data bases cannot be reliably predicted in this way. The approach which we used eliminates high redundancy sequencing and mapping efforts, needed to obtain detailed and comprehensive genetic and physical maps of a bacterium. Availability of detailed genetic and physical maps of the L. lactis IL1403 genome provides many entries to study metabolism and physiology of bacteria from this group. The presence of 42 copies of five different IS elements in the IL1403 genome confirms the importance of these elements for genetic exchange in Lactococci. These include two previously unknown elements, present at seven and fifteen copies and designated IS1077 and IS983, respectively. Five potential or rudimentary prophages were identified in the genome by detecting clusters of phage-related genes. The metabolic and regulatory potential of L. lactis was evaluated by inspecting gene sets classified into different functional categories. L. lactis has the genetic potential to synthesise 20 standard amino acids, purine and pyrimidine nucleotides and at least four cofactors. Some of these metabolites, which are usually present in chemically defined media, can probably be omitted. About twenty compounds can be used by L. lactis as a sole carbon source. Some 83 regulators were revealed, indicating a regulatory potential close to that of Haemophilus influenzae, a bacterium with a similar genome size. Unexpectedly, L. lactis has a complete set of late competence genes, which may have concerted transcriptional regulation and unleadered polycistronic mRNAs. These findings open new possibilities for developing genetic tools, useful for studies of gene regulation in AT-rich gram positive bacteria and for engineering of new strains for the diary industry.
引用
收藏
页码:27 / 76
页数:50
相关论文
共 94 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] Sequence analysis and identification of the pyrKDbF operon from Lactococcus lactis including a novel gene, pyrK, involved in pyrimidine biosynthesis
    Andersen, PS
    Martinussen, J
    Hammer, K
    [J]. JOURNAL OF BACTERIOLOGY, 1996, 178 (16) : 5005 - 5012
  • [3] THE NUTRITIONAL REQUIREMENTS OF LACTIC STREPTOCOCCI ISOLATED FROM STARTER CULTURES .1. GROWTH IN A SYNTHETIC MEDIUM
    ANDERSON, AW
    ELLIKER, PR
    [J]. JOURNAL OF DAIRY SCIENCE, 1953, 36 (02) : 161 - 167
  • [4] The genome sequence of Rickettsia prowazekii and the origin of mitochondria
    Andersson, SGE
    Zomorodipour, A
    Andersson, JO
    Sicheritz-Pontén, T
    Alsmark, UCM
    Podowski, RM
    Näslund, AK
    Eriksson, AS
    Winkler, HH
    Kurland, CG
    [J]. NATURE, 1998, 396 (6707) : 133 - 140
  • [5] BGLR PROTEIN, WHICH BELONGS TO THE BGLG FAMILY OF TRANSCRIPTIONAL ANTITERMINATORS, IS INVOLVED IN BETA-GLUCOSIDE UTILIZATION IN LACTOCOCCUS-LACTIS
    BARDOWSKI, J
    EHRLICH, SD
    CHOPIN, A
    [J]. JOURNAL OF BACTERIOLOGY, 1994, 176 (18) : 5681 - 5685
  • [6] TRYPTOPHAN BIOSYNTHESIS GENES IN LACTOCOCCUS-LACTIS SUBSP LACTIS
    BARDOWSKI, J
    EHRLICH, SD
    CHOPIN, A
    [J]. JOURNAL OF BACTERIOLOGY, 1992, 174 (20) : 6563 - 6570
  • [7] BERESFORD T, 1991, FEMS MICROBIOL LETT, V78, P319, DOI [10.1016/S0378-1097(99)00377-8, 10.1111/j.1574-6968.1991.tb04463.x]
  • [8] Codon usage can explain CT-rich islands surrounding Chi sites on the Escherichia coli genome
    Biaudet, V
    El Karoui, M
    Gruss, A
    [J]. MOLECULAR MICROBIOLOGY, 1998, 29 (02) : 666 - 669
  • [9] A 7-BASE-PAIR SEQUENCE PROTECTS DNA FROM EXONUCLEOLYTIC DEGRADATION IN LACTOCOCCUS-LACTIS
    BISWAS, I
    MAGUIN, E
    EHRLICH, SD
    GRUSS, A
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (06) : 2244 - 2248
  • [10] SecDF of Bacillus subtilis, a molecular Siamese twin required for the efficient secretion of proteins
    Bolhuis, A
    Broekhuizen, CP
    Sorokin, A
    van Roosmalen, ML
    Venema, G
    Bron, S
    Quax, WJ
    van Dijl, JM
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (33) : 21217 - 21224