Network enrichment analysis: extension of gene-set enrichment analysis to gene networks

被引:96
作者
Alexeyenko, Andrey [2 ,3 ]
Lee, Woojoo [4 ]
Pernemalm, Maria [3 ]
Guegan, Justin [5 ]
Dessen, Philippe [5 ]
Lazar, Vladimir [5 ]
Lehtio, Janne [3 ]
Pawitan, Yudi [1 ]
机构
[1] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
[2] Royal Inst Technol, Sch Biotechnol, Stockholm, Sweden
[3] Sci Life Lab, Stockholm, Sweden
[4] Inha Univ, Dept Stat, Inchon, South Korea
[5] Inst Gustave Roussy, Villejuif, France
关键词
SOMATIC MUTATIONS; GENOME; EXPRESSION; CANCER; PATHWAYS;
D O I
10.1186/1471-2105-13-226
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Background: Gene-set enrichment analyses (GEA or GSEA) are commonly used for biological characterization of an experimental gene-set. This is done by finding known functional categories, such as pathways or Gene Ontology terms, that are over-represented in the experimental set; the assessment is based on an overlap statistic. Rich biological information in terms of gene interaction network is now widely available, but this topological information is not used by GEA, so there is a need for methods that exploit this type of information in high-throughput data analysis. Results: We developed a method of network enrichment analysis (NEA) that extends the overlap statistic in GEA to network links between genes in the experimental set and those in the functional categories. For the crucial step in statistical inference, we developed a fast network randomization algorithm in order to obtain the distribution of any network statistic under the null hypothesis of no association between an experimental gene-set and a functional category. We illustrate the NEA method using gene and protein expression data from a lung cancer study. Conclusions: The results indicate that the NEA method is more powerful than the traditional GEA, primarily because the relationships between gene sets were more strongly captured by network connectivity rather than by simple overlaps.
引用
收藏
页数:11
相关论文
共 32 条
[21]
Network-based analysis of affected biological processes in type 2 diabetes models [J].
Liu, Manway ;
Liberzon, Arthur ;
Kong, Sek Won ;
Lai, Weil R. ;
Park, Peter J. ;
Kohane, Isaac S. ;
Kasif, Simon .
PLOS GENETICS, 2007, 3 (06) :958-972
[22]
Specificity and stability in topology of protein networks [J].
Maslov, S ;
Sneppen, K .
SCIENCE, 2002, 296 (5569) :910-913
[24]
Assortative mixing in networks [J].
Newman, MEJ .
PHYSICAL REVIEW LETTERS, 2002, 89 (20)
[25]
Network-based Identification of Novel Cancer Genes [J].
Ostlund, Gabriel ;
Lindskog, Mats ;
Sonnhammer, Erik L. L. .
MOLECULAR & CELLULAR PROTEOMICS, 2010, 9 (04) :648-655
[26]
Bias in the estimation of false discovery rate in microarray studies [J].
Pawitan, Y ;
Murthy, KRK ;
Michiels, S ;
Ploner, A .
BIOINFORMATICS, 2005, 21 (20) :3865-3872
[27]
Cytoscape: A software environment for integrated models of biomolecular interaction networks [J].
Shannon, P ;
Markiel, A ;
Ozier, O ;
Baliga, NS ;
Wang, JT ;
Ramage, D ;
Amin, N ;
Schwikowski, B ;
Ideker, T .
GENOME RESEARCH, 2003, 13 (11) :2498-2504
[28]
Network Enrichment Analysis in Complex Experiments [J].
Shojaie, Ali ;
Michailidis, George .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2010, 9 (01)
[29]
Smyth GK., 2004, Statistical Applications in Genetics and Molecular Biology, P3, DOI [10.2202/1544-6115.1027, DOI 10.2202/1544-6115.1027]
[30]
How and when should interactome-derived clusters be used to predict functional modules and protein function? [J].
Song, Jimin ;
Singh, Mona .
BIOINFORMATICS, 2009, 25 (23) :3143-3150