Detection of evolutionarily stable fragments of cellular pathways by hierarchical clustering of phyletic patterns

被引:47
作者
Glazko, GV
Mushegian, AR
机构
[1] Stowers Inst Med Res, Kansas City, MO 64110 USA
[2] Univ Kansas, Med Ctr, Dept Microbiol Mol Genet & Immunol, Kansas City, KS 66160 USA
来源
GENOME BIOLOGY | 2004年 / 5卷 / 05期
关键词
D O I
10.1186/gb-2004-5-5-r32
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Phyletic patterns denote the presence and absence of orthologous genes in completely sequenced genomes and are used to infer functional links between genes, on the assumption that genes involved in the same pathway or functional system are co-inherited by the same set of genomes. However, this basic premise has not been quantitatively tested, and the limits of applicability of the phyletic-pattern method remain unknown. Results: We characterized a hierarchy of 3,688 phyletic patterns encompassing more than 5,000 known protein-coding genes from 66 complete microbial genomes, using different distances, clustering algorithms, and measures of cluster quality. The most sensitive set of parameters recovered 223 clusters, each consisting of genes that belong to the same metabolic pathway or functional system. Fifty-six clusters included unexpected genes with plausible functional links to the rest of the cluster. Only a small percentage of known pathways and multiprotein complexes are co-inherited as one cluster; most are split into many clusters, indicating that gene loss and displacement has occurred in the evolution of most pathways. Conclusions: Phyletic patterns of functionally linked genes are perturbed by differential gains, losses and displacements of orthologous genes in different species, reflecting the high plasticity of microbial genomes. Groups of genes that are co-inherited can, however, be recovered by hierarchical clustering, and may represent elementary functional modules of cellular metabolism. The phyletic patterns approach alone can confidently predict the functional linkages for about 24% of the entire data set.
引用
收藏
页数:13
相关论文
共 49 条
[1]   Biosynthesis of vitamin B2 (riboflavin) [J].
Bacher, A ;
Eberhardt, S ;
Fischer, M ;
Kis, K ;
Richter, G .
ANNUAL REVIEW OF NUTRITION, 2000, 20 :153-167
[2]  
Clusters of Orthologous Groups of proteins (COGs), CLUSTERS ORTHOLOGOUS
[3]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[4]   Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages [J].
Date, SV ;
Marcotte, EM .
NATURE BIOTECHNOLOGY, 2003, 21 (09) :1055-1062
[5]   Comparisons and validation of statistical clustering techniques for microarray gene expression data [J].
Datta, S ;
Datta, S .
BIOINFORMATICS, 2003, 19 (04) :459-466
[6]   Diametrical clustering for identifying anti-correlated gene clusters [J].
Dhillon, IS ;
Marcotte, EM ;
Roshan, U .
BIOINFORMATICS, 2003, 19 (13) :1612-1619
[7]   Expanding protein universe and its origin from the biological Big Bang [J].
Dokholyan, NV ;
Shakhnovich, B ;
Shakhnovich, EI .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (22) :14132-14136
[8]   Biosynthesis of riboflavin: An unusual riboflavin synthase of Methanobacterium thermoautotrophicum [J].
Eberhardt, S ;
Korn, S ;
Lottspeich, F ;
Bacher, A .
JOURNAL OF BACTERIOLOGY, 1997, 179 (09) :2938-2943
[9]   Annotation of bacterial genomes using improved phylogenomic profiles [J].
Enault, F. ;
Suhre, K. ;
Abergel, C. ;
Poirot, O. ;
Claverie, J. -M. .
BIOINFORMATICS, 2003, 19 :i105-i107
[10]   Making ribosomes [J].
Fatica, A ;
Tollervey, D .
CURRENT OPINION IN CELL BIOLOGY, 2002, 14 (03) :313-318