A graph-theoretic modeling on GO space for biological interpretation of gene clusters

被引:57
作者
Lee, SG
Hur, JU
Kim, YS
机构
[1] ISTECH Inc, Bioinformat Unit, Goyang 411380, Gyunggido, South Korea
[2] Yonsei Univ, Coll Med, Canc Metastasis Res Ctr, Seoul 120752, South Korea
关键词
D O I
10.1093/bioinformatics/btg420
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: With the advent of DNA microarray technologies, the parallel quantification of genome-wide transcriptions has been a great opportunity to systematically understand the complicated biological phenomena. Amidst the enthusiastic investigations into the intricate gene expression data, clustering methods have been the useful tools to uncover the meaningful patterns hidden in those data. The mathematical techniques, however, entirely based on the numerical expression data, do not show biologically relevant information on the clustering results. Results: We present a novel methodology for biological interpretation of gene clusters. Our graph theoretic algorithm extracts common biological attributes of the genes within a cluster or a group of interest through the modified structure of gene ontology (GO) called GO tree. After genes are annotated with GO terms, the hierarchical nature of GO terms is used to find the representative biological meanings of the gene clusters. In addition, the biological significance of gene clusters can be assessed quantitatively by defining a distance function on the GO tree. Our approach has a complementary meaning to many statistical clustering techniques; we can see clustering problems from a different viewpoint by use of biological ontology. We applied this algorithm to the well-known data set and successfully obtained the biological features of the gene clusters with the quantitative biological assessment of clustering quality through GO Biological Process.
引用
收藏
页码:381 / 388
页数:8
相关论文
共 19 条
  • [1] Ashburner M, 2001, GENOME RES, V11, P1425
  • [2] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [3] A cluster validity framework for genome expression data
    Azuaje, F
    [J]. BIOINFORMATICS, 2002, 18 (02) : 319 - 320
  • [4] Clustering gene expression patterns
    Ben-Dor, A
    Shamir, R
    Yakhini, Z
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) : 281 - 297
  • [5] Dettling M, 2002, GENOME BIOL, V3
  • [6] MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data
    Doniger, SW
    Salomonis, N
    Dahlquist, KD
    Vranizan, K
    Lawlor, SC
    Conklin, BR
    [J]. GENOME BIOLOGY, 2003, 4 (01)
  • [7] Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO)
    Dwight, SS
    Harris, MA
    Dolinski, K
    Ball, CA
    Binkley, G
    Christie, KR
    Fisk, DG
    Issel-Tarver, L
    Schroeder, M
    Sherlock, G
    Sethuraman, A
    Weng, S
    Botstein, D
    Cherry, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 69 - 72
  • [8] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [9] Judging the quality of gene expression-based clustering methods using gene annotation
    Gibbons, FD
    Roth, FP
    [J]. GENOME RESEARCH, 2002, 12 (10) : 1574 - 1581
  • [10] On clustering validation techniques
    Halkidi, M
    Batistakis, Y
    Vazirgiannis, M
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2001, 17 (2-3) : 107 - 145