Large-scale prediction of Saccharomyces cerevisiae gene function using overlapping transcriptional clusters

被引:248
作者
Wu, LF
Hughes, TR
Davierwala, AP
Robinson, MD
Stoughton, R
Altschuler, SJ
机构
[1] Rosetta Inpharmat, Kirkland, WA USA
[2] Univ Toronto, Banting & Best Dept Med Res, Toronto, ON, Canada
关键词
D O I
10.1038/ng906
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome sequencing has led to the discovery of tens of thousands of potential new genes. Six years after the sequencing of the well-studied yeast Saccharomyces cerevisiae and the discovery that its genome encodes 6,000 predicted proteins, more than 2,000 have not yet been characterized experimentally, and determining their functions seems far from a trivial task. One crucial constraint is the generation of useful hypotheses about protein function. Using a new approach to interpret microarray data, we assign likely cellular functions with confidence values to these new yeast proteins. We perform extensive genome-wide validations of our predictions and offer visualization methods for exploration of the large numbers of functional predictions. We identify potential new members of many existing functional categories including 285 candidate proteins involved in transcription, processing and transport of non-coding RNA molecules. We present experimental validation confirming the involvement of several of these proteins in ribosomal RNA processing. Our methodology can be applied to a variety of genomics data types and organisms.
引用
收藏
页码:255 / 265
页数:11
相关论文
共 43 条
[11]   Exploring the metabolic and genetic control of gene expression on a genomic scale [J].
DeRisi, JL ;
Iyer, VR ;
Brown, PO .
SCIENCE, 1997, 278 (5338) :680-686
[12]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[13]  
Gari E, 1997, YEAST, V13, P837, DOI 10.1002/(SICI)1097-0061(199707)13:9<837::AID-YEA145>3.0.CO
[14]  
2-T
[15]   Bms1p, a novel GTP-binding protein, and the related Tsr1p are required for distinct steps of 40S ribosome biogenesis in yeast [J].
Gelperin, D ;
Horton, L ;
Beckman, J ;
Hensold, J ;
Lemmon, SK .
RNA, 2001, 7 (09) :1268-1283
[16]  
Goffeau A., 1996, SCIENCE, V274, p[546, 563]
[17]  
Goldstein DR, 2002, STAT SINICA, V12, P219
[18]  
Hartigan J. A., 1975, CLUSTERING ALGORITHM
[19]   Assessment of prediction accuracy of protein function from protein-protein interaction data [J].
Hishigaki, H ;
Nakai, K ;
Ono, T ;
Tanigami, A ;
Takagi, T .
YEAST, 2001, 18 (06) :523-531
[20]   The yeast proteome database (YPD): a model for the organization and presentation of genome-wide functional data [J].
Hodges, PE ;
McKee, AHZ ;
Davis, BP ;
Payne, WE ;
Garrels, JI .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :69-73