Identifying clusters of functionally related genes in genomes

被引:103
作者
Yi, Gangman
Sze, Sing-Hoi
Thon, Michael R.
机构
[1] Texas A&M Univ, Dept Comp Sci, College Stn, TX 77845 USA
[2] Texas A&M Univ, Dept Biochem & Biophys, College Stn, TX 77845 USA
[3] Texas A&M Univ, Dept Plant Pathol & Microbiol, College Stn, TX 77845 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btl673
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: An increasing body of literature shows that genomes of eukaryotes can contain clusters of functionally related genes. Most approaches to identify gene clusters utilize microarray data or metabolic pathway databases to find groups of genes on chromosomes that are linked by common attributes. A generalized method that can find gene clusters regardless of the mechanism of origin would provide researchers with an unbiased method for finding clusters and studying the evolutionary forces that give rise to them. Results: We present an algorithm to identify gene clusters in eukaryotic genomes that utilizes functional categories defined in graph-based vocabularies such as the Gene Ontology (GO). Clusters identified in this manner need only have a common function and are not constrained by gene expression or other properties. We tested the algorithm by analyzing genomes of a representative set of species. We identified species-specific variation in percentage of clustered genes as well as in properties of gene clusters including size distribution and functional annotation. These properties may be diagnostic of the evolutionary forces that lead to the formation of gene clusters.
引用
收藏
页码:1053 / 1060
页数:8
相关论文
共 29 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]  
Blumenthal T, 1998, BIOESSAYS, V20, P480, DOI 10.1002/(SICI)1521-1878(199806)20:6&lt
[3]  
480::AID-BIES6&gt
[4]  
3.0.CO
[5]  
2-Q
[6]  
COOPER TG, 1996, MYCOTA, V3, P139
[7]  
Cormen T. H., 2001, Introduction to Algorithms, V2nd
[8]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[9]  
Fisher R.A., 1930, The Genetical Theory of Natural Selection, DOI 10.5962/bhl.title.27468
[10]  
HERBERT NA, 1975, NATURE, V254, P26