Automatic detection of conserved gene clusters in multiple genomes by graph comparison and P-quasi grouping

被引:37
作者
Fujibuchi, W
Ogata, H
Matsuda, H
Kanehisa, M [1 ]
机构
[1] Kyoto Univ, Inst Chem Res, Kyoto 6110011, Japan
[2] Osaka Univ, Grad Sch Engn Sci, Osaka 5608531, Japan
关键词
D O I
10.1093/nar/28.20.4029
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We previously reported two graph algorithms for analysis of genomic information: a graph Comparison algorithm to detect locally similar regions called correlated clusters and an algorithm to find a graph feature called P-quasi complete linkage. Based on these algorithms we have developed an automatic procedure to detect conserved gene clusters and align orthologous gene orders in multiple genomes. In the first step, the graph comparison is applied to pairwise genome comparisons, where the genome is considered as a one-dimensionally connected graph with genes as its nodes, and correlated clusters of genes that share sequence similarities are identified. In the next step, the P-quasi complete linkage analysis is applied to grouping of related clusters and conserved gene clusters in multiple genomes are identified, In the last step, orthologous relations of genes are established among each conserved cluster. We analyzed 17 completely sequenced microbial genomes and obtained 2313 clusters when the completeness parameter P was 40%, About one quarter contained at least two genes that appeared in the metabolic and regulatory pathways in the KEGG database. This collection of conserved gene clusters is used to refine and augment ortholog group tables in KEGG and also to define ortholog identifiers as an extension of EC numbers.
引用
收藏
页码:4029 / 4036
页数:8
相关论文
共 37 条
  • [21] Lawrence JG, 1996, GENETICS, V143, P1843
  • [22] Detecting protein function and protein-protein interactions from genome sequences
    Marcotte, EM
    Pellegrini, M
    Ng, HL
    Rice, DW
    Yeates, TO
    Eisenberg, D
    [J]. SCIENCE, 1999, 285 (5428) : 751 - 753
  • [23] Classifying molecular sequences using a linkage graph with their pairwise similarities
    Matsuda, H
    Ishihara, T
    Hashimoto, A
    [J]. THEORETICAL COMPUTER SCIENCE, 1999, 210 (02) : 305 - 325
  • [24] A heuristic graph comparison algorithm and its application to detect functionally related enzyme clusters
    Ogata, H
    Fujibuchi, W
    Goto, S
    Kanehisa, M
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (20) : 4021 - 4028
  • [25] The use of gene clusters to infer functional coupling
    Overbeek, R
    Fonstein, M
    D'Souza, M
    Pusch, GD
    Maltsev, N
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (06) : 2896 - 2901
  • [26] Conserved gene clusters in bacterial genomes provide further support for the primacy of RNA
    Siefert, JL
    Martin, KA
    Abdi, F
    Widger, WR
    Fox, GE
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1997, 45 (05) : 467 - 472
  • [27] Complete genome sequence of Methanobacterium thermoautotrophicum Delta H: Functional analysis and comparative genomics
    Smith, DR
    DoucetteStamm, LA
    Deloughery, C
    Lee, HM
    Dubois, J
    Aldredge, T
    Bashirzadeh, R
    Blakely, D
    Cook, R
    Gilbert, K
    Harrison, D
    Hoang, L
    Keagle, P
    Lumm, W
    Pothier, B
    Qiu, DY
    Spadafora, R
    Vicaire, R
    Wang, Y
    Wierzbowski, J
    Gibson, R
    Jiwani, N
    Caruso, A
    Bush, D
    Safer, H
    Patwell, D
    Prabhakar, S
    McDougall, S
    Shimer, G
    Goyal, A
    Pietrokovski, S
    Church, GM
    Daniels, CJ
    Mao, JI
    Rice, P
    Nolling, J
    Reeve, JN
    [J]. JOURNAL OF BACTERIOLOGY, 1997, 179 (22) : 7135 - 7155
  • [28] IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES
    SMITH, TF
    WATERMAN, MS
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) : 195 - 197
  • [29] Genome sequence of an obligate intracellular pathogen of humans:: Chlamydia trachomatis
    Stephens, RS
    Kalman, S
    Lammel, C
    Fan, J
    Marathe, R
    Aravind, L
    Mitchell, W
    Olinger, L
    Tatusov, RL
    Zhao, QX
    Koonin, EV
    Davis, RW
    [J]. SCIENCE, 1998, 282 (5389) : 754 - 759
  • [30] Conserved clusters of functionally related genes in two bacterial genomes
    Tamames, J
    Casari, G
    Ouzounis, C
    Valencia, A
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1997, 44 (01) : 66 - 73