Beyond synexpression relationships: Local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions

被引:122
作者
Qian, J [1 ]
Dolled-Filhart, M [1 ]
Lin, J [1 ]
Yu, HY [1 ]
Gerstein, M [1 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
关键词
gene expression; local clustering; time-shifted; inverted; bioinformatics;
D O I
10.1006/jmbi.2000.5219
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The complexity of biological systems provides for a great diversity of relationships between genes. The current analysis of whole-genome expression data focuses on relationships based on global correlation over a whole time-course, identifying clusters of genes whose expression levels simultaneously rise and fall. There are, of course, other potential relationships between genes, which are missed by such global clustering. These include activation, where one expects a time-delay between related expression profiles, and inhibition, where one expects an inverted relationship. Here, we propose a new method, which we call local Clustering, for identifying these time-delayed and inverted relationships. It is related to conventional gene-expression clustering in a fashion analogous to the way local sequence alignment (the Smith-Waterman algorithm) is derived from global alignment (Needleman-Wunsch). An integral part of our method is the use of random score distributions to assess the statistical significance of each cluster. We applied our method to the yeast cell-cycle expression dataset and were able to detect a considerable number of additional biological relationships between genes, beyond those resulting from conventional correlation. We related these new relationships between genes to their similarity in function (as determined from the MIPS scheme) or their having known protein-protein interactions (as determined from the large-scale two-hybrid experiment); we found that genes strongly related by local clustering were considerably more likely than random to have a known interaction or a similar cellular role. This suggests that local clustering may be useful in functional annotation of uncharacterized genes. We examined many of the new relationships in detail. Some of them were already well-documented examples of inhibition or activation, which provide corroboration for our results. For instance, we found an inverted expression profile relationship between genes YME1 and YNT20, where the latter has been experimentally documented as a bypass suppressor of the former. We also found new relationships involving uncharacterized yeast genes and were able to suggest functions for many of them. In particular, we found a time-delayed expression relationship between JO544 (which has not yet been functionally characterized) and four genes associated with the mitochondria. This suggests that 10544 may be involved in the control or activation of mitochondrial genes. We have also looked at other, less extensive datasets than the yeast cell-cycle and found further interesting relationships. Our clustering program and a detailed website of clustering results is available at http://www.bioinfo.mbb.yale.edu/expression/cluster (or http://www.genecensus.org/expression/cluster). (C) 2001 Academic Press.
引用
收藏
页码:1053 / 1066
页数:14
相关论文
共 60 条
[1]   IDENTIFICATION OF 2 NUCLEAR GENES (ATP11, ATP12) REQUIRED FOR ASSEMBLY OF THE YEAST F1-ATPASE [J].
ACKERMAN, SH ;
TZAGOLOFF, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (13) :4986-4990
[2]   Whole-genome expression analysis: challenges beyond clustering [J].
Altman, RB ;
Raychaudhuri, S .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2001, 11 (03) :340-347
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[5]  
[Anonymous], [No title captured]
[6]   Integrating functional genomic information into the Saccharomyces genome database [J].
Ball, CA ;
Dolinski, K ;
Dwight, SS ;
Harris, MA ;
Issel-Tarver, L ;
Kasarskis, A ;
Scafe, CR ;
Sherlock, G ;
Binkley, G ;
Jin, H ;
Kaloper, M ;
Orr, SD ;
Schroeder, M ;
Weng, S ;
Zhu, Y ;
Botstein, D ;
Cherry, JM .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :77-80
[7]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[8]   The molecular genetics of hexose transport in yeasts [J].
Boles, E ;
Hollenberg, CP .
FEMS MICROBIOLOGY REVIEWS, 1997, 21 (01) :85-111
[9]   PROLINE UTILIZATION IN SACCHAROMYCES-CEREVISIAE - ANALYSIS OF THE CLONED PUT2-GENE [J].
BRANDRISS, MC .
MOLECULAR AND CELLULAR BIOLOGY, 1983, 3 (10) :1846-1856
[10]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267