Network enrichment analysis: extension of gene-set enrichment analysis to gene networks

被引:96
作者
Alexeyenko, Andrey [2 ,3 ]
Lee, Woojoo [4 ]
Pernemalm, Maria [3 ]
Guegan, Justin [5 ]
Dessen, Philippe [5 ]
Lazar, Vladimir [5 ]
Lehtio, Janne [3 ]
Pawitan, Yudi [1 ]
机构
[1] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
[2] Royal Inst Technol, Sch Biotechnol, Stockholm, Sweden
[3] Sci Life Lab, Stockholm, Sweden
[4] Inha Univ, Dept Stat, Inchon, South Korea
[5] Inst Gustave Roussy, Villejuif, France
关键词
SOMATIC MUTATIONS; GENOME; EXPRESSION; CANCER; PATHWAYS;
D O I
10.1186/1471-2105-13-226
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Background: Gene-set enrichment analyses (GEA or GSEA) are commonly used for biological characterization of an experimental gene-set. This is done by finding known functional categories, such as pathways or Gene Ontology terms, that are over-represented in the experimental set; the assessment is based on an overlap statistic. Rich biological information in terms of gene interaction network is now widely available, but this topological information is not used by GEA, so there is a need for methods that exploit this type of information in high-throughput data analysis. Results: We developed a method of network enrichment analysis (NEA) that extends the overlap statistic in GEA to network links between genes in the experimental set and those in the functional categories. For the crucial step in statistical inference, we developed a fast network randomization algorithm in order to obtain the distribution of any network statistic under the null hypothesis of no association between an experimental gene-set and a functional category. We illustrate the NEA method using gene and protein expression data from a lung cancer study. Conclusions: The results indicate that the NEA method is more powerful than the traditional GEA, primarily because the relationships between gene sets were more strongly captured by network connectivity rather than by simple overlaps.
引用
收藏
页数:11
相关论文
共 32 条
[1]
Comparative study of gene set enrichment methods [J].
Abatangelo, Luca ;
Maglietta, Rosalia ;
Distaso, Angela ;
D'Addabbo, Annarita ;
Creanza, Teresa Maria ;
Mukherjee, Sayan ;
Ancona, Nicola .
BMC BIOINFORMATICS, 2009, 10 :275
[2]
Improved scoring of functional groups from gene expression data by decorrelating GO graph structure [J].
Alexa, Adrian ;
Rahnenfuehrer, Joerg ;
Lengauer, Thomas .
BIOINFORMATICS, 2006, 22 (13) :1600-1607
[3]
Dynamic Zebrafish Interactome Reveals Transcriptional Mechanisms of Dioxin Toxicity [J].
Alexeyenko, Andrey ;
Wassenberg, Deena M. ;
Lobenhofer, Edward K. ;
Yen, Jerry ;
Linney, Elwood ;
Sonnhammer, Erik L. L. ;
Meyer, Joel N. .
PLOS ONE, 2010, 5 (05)
[4]
Global networks of functional coupling in eukaryotes from comprehensive data integration [J].
Alexeyenko, Andrey ;
Sonnhammer, Erik L. L. .
GENOME RESEARCH, 2009, 19 (06) :1107-1116
[5]
GOing Bayesian: model-based gene set analysis of genome-scale data [J].
Bauer, Sebastian ;
Gagneur, Julien ;
Robinson, Peter N. .
NUCLEIC ACIDS RESEARCH, 2010, 38 (11) :3523-3532
[6]
Oncogenic pathway signatures in human cancers as a guide to targeted therapies [J].
Bild, AH ;
Yao, G ;
Chang, JT ;
Wang, QL ;
Potti, A ;
Chasse, D ;
Joshi, MB ;
Harpole, D ;
Lancaster, JM ;
Berchuck, A ;
Olson, JA ;
Marks, JR ;
Dressman, HK ;
West, M ;
Nevins, JR .
NATURE, 2006, 439 (7074) :353-357
[7]
Automated Network Analysis Identifies Core Pathways in Glioblastoma [J].
Cerami, Ethan ;
Demir, Emek ;
Schultz, Nikolaus ;
Taylor, Barry S. ;
Sander, Chris .
PLOS ONE, 2010, 5 (02)
[8]
Sequential Monte Carlo methods for statistical analysis of tables [J].
Chen, YG ;
Diaconis, P ;
Holmes, SR ;
Liu, JS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) :109-120
[9]
Prediction of protein function using protein-protein interaction data [J].
Deng, MH ;
Zhang, K ;
Mehta, S ;
Chen, T ;
Sun, FZ .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) :947-960
[10]
Somatic mutations affect key pathways in lung adenocarcinoma [J].
Ding, Li ;
Getz, Gad ;
Wheeler, David A. ;
Mardis, Elaine R. ;
McLellan, Michael D. ;
Cibulskis, Kristian ;
Sougnez, Carrie ;
Greulich, Heidi ;
Muzny, Donna M. ;
Morgan, Margaret B. ;
Fulton, Lucinda ;
Fulton, Robert S. ;
Zhang, Qunyuan ;
Wendl, Michael C. ;
Lawrence, Michael S. ;
Larson, David E. ;
Chen, Ken ;
Dooling, David J. ;
Sabo, Aniko ;
Hawes, Alicia C. ;
Shen, Hua ;
Jhangiani, Shalini N. ;
Lewis, Lora R. ;
Hall, Otis ;
Zhu, Yiming ;
Mathew, Tittu ;
Ren, Yanru ;
Yao, Jiqiang ;
Scherer, Steven E. ;
Clerc, Kerstin ;
Metcalf, Ginger A. ;
Ng, Brian ;
Milosavljevic, Aleksandar ;
Gonzalez-Garay, Manuel L. ;
Osborne, John R. ;
Meyer, Rick ;
Shi, Xiaoqi ;
Tang, Yuzhu ;
Koboldt, Daniel C. ;
Lin, Ling ;
Abbott, Rachel ;
Miner, Tracie L. ;
Pohl, Craig ;
Fewell, Ginger ;
Haipek, Carrie ;
Schmidt, Heather ;
Dunford-Shore, Brian H. ;
Kraja, Aldi ;
Crosby, Seth D. ;
Sawyer, Christopher S. .
NATURE, 2008, 455 (7216) :1069-1075