Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks

被引:303
作者
Wolfe, CJ
Kohane, IS
Butte, AJ
机构
[1] Childrens Hosp Informat Program, Boston, MA 02115 USA
[2] Harvard Mit Div Hlth Sci & Technol, Boston, MA 02115 USA
[3] Univ Hawaii Manoa, Hawaii Inst Geophys & Planetol, Honolulu, HI 96822 USA
[4] Stanford Univ, Sch Med, Dept Med & Pediat, Stanford, CA 94305 USA
关键词
D O I
10.1186/1471-2105-6-227
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biological processes are carried out by coordinated modules of interacting molecules. As clustering methods demonstrate that genes with similar expression display increased likelihood of being associated with a common functional module, networks of coexpressed genes provide one framework for assigning gene function. This has informed the guilt-by-association (GBA) heuristic, widely invoked in functional genomics. Yet although the idea of GBA is accepted, the breadth of GBA applicability is uncertain. Results: We developed methods to systematically explore the breadth of GBA across a large and varied corpus of expression data to answer the following question: To what extent is the GBA heuristic broadly applicable to the transcriptome and conversely how broadly is GBA captured by a priori knowledge represented in the Gene Ontology ( GO)? Our study provides an investigation of the functional organization of five coexpression networks using data from three mammalian organisms. Our method calculates a probabilistic score between each gene and each Gene Ontology category that reflects coexpression enrichment of a GO module. For each GO category we use Receiver Operating Curves to assess whether these probabilistic scores reflect GBA. This methodology applied to five different coexpression networks demonstrates that the signature of guilt-by-association is ubiquitous and reproducible and that the GBA heuristic is broadly applicable across the population of nine hundred Gene Ontology categories. We also demonstrate the existence of highly reproducible patterns of coexpression between some pairs of GO categories. Conclusion: We conclude that GBA has universal value and that transcriptional control may be more modular than previously realized. Our analyses also suggest that methodologies combining coexpression measurements across multiple genes in a biologically-defined module can aid in characterizing gene function or in characterizing whether pairs of functions operate together.
引用
收藏
页数:10
相关论文
共 24 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Standardizing global gene expression analysis between laboratories and across platforms [J].
Bammler, T ;
Beyer, RP ;
Bhattacharya, S ;
Boorman, GA ;
Boyles, A ;
Bradford, BU ;
Bumgarner, RE ;
Bushel, PR ;
Chaturvedi, K ;
Choi, D ;
Cunningham, ML ;
Dengs, S ;
Dressman, HK ;
Fannin, RD ;
Farun, FM ;
Freedman, JH ;
Fry, RC ;
Harper, A ;
Humble, MC ;
Hurban, P ;
Kavanagh, TJ ;
Kaufmann, WK ;
Kerr, KF ;
Jing, L ;
Lapidus, JA ;
Lasarev, MR ;
Li, J ;
Li, YJ ;
Lobenhofer, EK ;
Lu, X ;
Malek, RL ;
Milton, S ;
Nagalla, SR ;
O'Malley, JP ;
Palmer, VS ;
Pattee, P ;
Paules, RS ;
Perou, CM ;
Phillips, K ;
Qin, LX ;
Qiu, Y ;
Quigley, SD ;
Rodland, M ;
Rusyn, I ;
Samson, LD ;
Schwartz, DA ;
Shi, Y ;
Shin, JL ;
Sieber, SO ;
Slifer, S .
NATURE METHODS, 2005, 2 (05) :351-356
[3]   Data analysis and integration: of steps and arrows [J].
Bittner, M ;
Meltzer, P ;
Trent, J .
NATURE GENETICS, 1999, 22 (03) :213-215
[4]   Exploring the new world of the genome with DNA microarrays [J].
Brown, PO ;
Botstein, D .
NATURE GENETICS, 1999, 21 (Suppl 1) :33-37
[5]  
Clare Amanda, 2002, In Silico Biology, V2, P511
[6]   Global functional profiling of gene expression [J].
Draghici, S ;
Khatri, P ;
Martins, RP ;
Ostermeier, GC ;
Krawetz, SA .
GENOMICS, 2003, 81 (02) :98-104
[7]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[8]   A probabilistic view of gene function [J].
Fraser, AG ;
Marcotte, EM .
NATURE GENETICS, 2004, 36 (06) :559-564
[9]   THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE [J].
HANLEY, JA ;
MCNEIL, BJ .
RADIOLOGY, 1982, 143 (01) :29-36
[10]   From molecular to modular cell biology [J].
Hartwell, LH ;
Hopfield, JJ ;
Leibler, S ;
Murray, AW .
NATURE, 1999, 402 (6761) :C47-C52