Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv

被引:435
作者
Camus, JC
Pryor, MJ
Médigue, C
Cole, ST
机构
[1] Inst Pasteur, Unite Genet Mol Bacterienne, F-75724 Paris, France
[2] Inst Pasteur, Paris, France
[3] Genoscope, UMR 8030, F-91006 Evry, France
来源
MICROBIOLOGY-SGM | 2002年 / 148卷
关键词
mycobacteria; tuberculosis; genomics;
D O I
10.1099/00221287-148-10-2967
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Original genome annotations need to be regularly updated if the information they contain is to remain accurate and relevant. Here the complete reannotation of the genome sequence of Mycobacterium tuberculosis strain H37Rv is presented almost 4 years after the first submission. Eighty-two new protein-coding sequences (COS) have been included and 22 of these have a predicted function. The majority were identified by manual or automated reanalysis of the genome and most of them were shorter than the 100 codon cutoff used in the initial genome analysis. The functional classification of 643 CDS has been changed based principally on recent sequence comparisons and new experimental data from the literature. More than 300 gene names and over 1000 targeted citations have been added and the lengths of 60 genes have been modified. Presently, it is possible to assign a function to 2058 proteins (52 % of the 3995 proteins predicted) and only 376 putative proteins share no homology with known proteins and thus could be unique to M. tuberculosis.
引用
收藏
页码:2967 / 2973
页数:7
相关论文
共 39 条
  • [1] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [2] Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2)
    Bentley, SD
    Chater, KF
    Cerdeño-Tárraga, AM
    Challis, GL
    Thomson, NR
    James, KD
    Harris, DE
    Quail, MA
    Kieser, H
    Harper, D
    Bateman, A
    Brown, S
    Chandra, G
    Chen, CW
    Collins, M
    Cronin, A
    Fraser, A
    Goble, A
    Hidalgo, J
    Hornsby, T
    Howarth, S
    Huang, CH
    Kieser, T
    Larke, L
    Murphy, L
    Oliver, K
    O'Neil, S
    Rabbinowitsch, E
    Rajandream, MA
    Rutherford, K
    Rutter, S
    Seeger, K
    Saunders, D
    Sharp, S
    Squares, R
    Squares, S
    Taylor, K
    Warren, T
    Wietzorrek, A
    Woodward, J
    Barrell, BG
    Parkhill, J
    Hopwood, DA
    [J]. NATURE, 2002, 417 (6885) : 141 - 147
  • [3] Comparison of the proteome of Mycobacterium tuberculosis strain H37Rv with clinical isolate CDC 1551
    Betts, JC
    Dodson, P
    Quan, S
    Lewis, AP
    Thomas, PJ
    Duncan, K
    McAdam, RA
    [J]. MICROBIOLOGY-SGM, 2000, 146 : 3205 - 3216
  • [4] Re-annotation of genome microbial CoDing-Sequences:: finding new genes and inaccurately annotated genes -: art. no. 5
    Bocs, S
    Danchin, A
    Médigue, C
    [J]. BMC BIOINFORMATICS, 2002, 3 (1)
  • [5] The ATP binding cassette (ABC) transport systems of Mycobacterium tuberculosis
    Braibant, M
    Gilot, P
    Content, J
    [J]. FEMS MICROBIOLOGY REVIEWS, 2000, 24 (04) : 449 - 467
  • [6] Massive gene decay in the leprosy bacillus
    Cole, ST
    Eiglmeier, K
    Parkhill, J
    James, KD
    Thomson, NR
    Wheeler, PR
    Honoré, N
    Garnier, T
    Churcher, C
    Harris, D
    Mungall, K
    Basham, D
    Brown, D
    Chillingworth, T
    Connor, R
    Davies, RM
    Devlin, K
    Duthoy, S
    Feltwell, T
    Fraser, A
    Hamlin, N
    Holroyd, S
    Hornsby, T
    Jagels, K
    Lacroix, C
    Maclean, J
    Moule, S
    Murphy, L
    Oliver, K
    Quail, MA
    Rajandream, MA
    Rutherford, KM
    Rutter, S
    Seeger, K
    Simon, S
    Simmonds, M
    Skelton, J
    Squares, R
    Squares, S
    Stevens, K
    Taylor, K
    Whitehead, S
    Woodward, JR
    Barrell, BG
    [J]. NATURE, 2001, 409 (6823) : 1007 - 1011
  • [7] Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence
    Cole, ST
    Brosch, R
    Parkhill, J
    Garnier, T
    Churcher, C
    Harris, D
    Gordon, SV
    Eiglmeier, K
    Gas, S
    Barry, CE
    Tekaia, F
    Badcock, K
    Basham, D
    Brown, D
    Chillingworth, T
    Connor, R
    Davies, R
    Devlin, K
    Feltwell, T
    Gentles, S
    Hamlin, N
    Holroyd, S
    Hornby, T
    Jagels, K
    Krogh, A
    McLean, J
    Moule, S
    Murphy, L
    Oliver, K
    Osborne, J
    Quail, MA
    Rajandream, MA
    Rogers, J
    Rutter, S
    Seeger, K
    Skelton, J
    Squares, R
    Squares, S
    Sulston, JE
    Taylor, K
    Whitehead, S
    Barrell, BG
    [J]. NATURE, 1998, 393 (6685) : 537 - +
  • [8] Re-annotating the Mycoplasma pneumoniae genome sequence:: adding value, function and reading frames
    Dandekar, T
    Huynen, M
    Regula, JT
    Ueberle, B
    Zimmermann, CU
    Andrade, MA
    Doerks, T
    Sánchez-Pulido, L
    Snel, B
    Suyama, M
    Yuan, YP
    Herrmann, R
    Bork, P
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (17) : 3278 - 3288
  • [9] Eiglmeier K, 2001, LEPROSY REV, V72, P387
  • [10] The PROSITE database, its status in 2002
    Falquet, L
    Pagni, M
    Bucher, P
    Hulo, N
    Sigrist, CJA
    Hofmann, K
    Bairoch, A
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 235 - 238