The Molecular Signatures Database Hallmark Gene Set Collection

被引:8305
作者
Liberzon, Arthur [1 ]
Birger, Chet [1 ]
Thorvaldsdottir, Helga [1 ]
Ghandi, Mahmoud [1 ]
Mesirov, Jill P. [1 ,2 ,3 ]
Tamayo, Pablo [1 ,2 ,3 ]
机构
[1] Broad Inst MIT & Harvard, 415 Main St, Cambridge, MA 02142 USA
[2] Univ Calif San Diego, Dept Med, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Moores Canc Ctr, La Jolla, CA 92093 USA
关键词
gene expression; gene set enrichment analysis; gene sets;
D O I
10.1016/j.cels.2015.12.004
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
The Molecular Signatures Database (MSigDB) is one of the most widely used and comprehensive databases of gene sets for performing gene set enrichment analysis. Since its creation, MSigDB has grown beyond its roots in metabolic disease and cancer to include >10,000 gene sets. These better represent a wider range of biological processes and diseases, but the utility of the database is reduced by increased redundancy across, and heterogeneity within, gene sets. To address this challenge, here we use a combination of automated approaches and expert curation to develop a collection of "hallmark'' gene sets as part of MSigDB. Each hallmark in this collection consists of a "refined'' gene set, derived from multiple "founder'' sets, that conveys a specific biological state or process and displays coherent expression. The hallmarks effectively summarize most of the relevant information of the original founder sets and, by reducing both variation and redundancy, provide more refined and concise inputs for gene set enrichment analysis.
引用
收藏
页码:417 / 425
页数:9
相关论文
共 35 条
[1]
Targeting the TGFβ signalling pathway in disease [J].
Akhurst, Rosemary J. ;
Hata, Akiko .
NATURE REVIEWS DRUG DISCOVERY, 2012, 11 (10) :790-811
[2]
[Anonymous], 2010, I MATH STAT ONOGRAPH
[3]
[Anonymous], 1948, AM STAT
[4]
Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1 [J].
Barbie, David A. ;
Tamayo, Pablo ;
Boehm, Jesse S. ;
Kim, So Young ;
Moody, Susan E. ;
Dunn, Ian F. ;
Schinzel, Anna C. ;
Sandy, Peter ;
Meylan, Etienne ;
Scholl, Claudia ;
Froehling, Stefan ;
Chan, Edmond M. ;
Sos, Martin L. ;
Michel, Kathrin ;
Mermel, Craig ;
Silver, Serena J. ;
Weir, Barbara A. ;
Reiling, Jan H. ;
Sheng, Qing ;
Gupta, Piyush B. ;
Wadlow, Raymond C. ;
Le, Hanh ;
Hoersch, Sebastian ;
Wittner, Ben S. ;
Ramaswamy, Sridhar ;
Livingston, David M. ;
Sabatini, David M. ;
Meyerson, Matthew ;
Thomas, Roman K. ;
Lander, Eric S. ;
Mesirov, Jill P. ;
Root, David E. ;
Gilliland, D. Gary ;
Jacks, Tyler ;
Hahn, William C. .
NATURE, 2009, 462 (7269) :108-U122
[5]
The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity [J].
Barretina, Jordi ;
Caponigro, Giordano ;
Stransky, Nicolas ;
Venkatesan, Kavitha ;
Margolin, Adam A. ;
Kim, Sungjoon ;
Wilson, Christopher J. ;
Lehar, Joseph ;
Kryukov, Gregory V. ;
Sonkin, Dmitriy ;
Reddy, Anupama ;
Liu, Manway ;
Murray, Lauren ;
Berger, Michael F. ;
Monahan, John E. ;
Morais, Paula ;
Meltzer, Jodi ;
Korejwa, Adam ;
Jane-Valbuena, Judit ;
Mapa, Felipa A. ;
Thibault, Joseph ;
Bric-Furlong, Eva ;
Raman, Pichai ;
Shipway, Aaron ;
Engels, Ingo H. ;
Cheng, Jill ;
Yu, Guoying K. ;
Yu, Jianjun ;
Aspesi, Peter, Jr. ;
de Silva, Melanie ;
Jagtap, Kalpana ;
Jones, Michael D. ;
Wang, Li ;
Hatton, Charles ;
Palescandolo, Emanuele ;
Gupta, Supriya ;
Mahan, Scott ;
Sougnez, Carrie ;
Onofrio, Robert C. ;
Liefeld, Ted ;
MacConaill, Laura ;
Winckler, Wendy ;
Reich, Michael ;
Li, Nanxin ;
Mesirov, Jill P. ;
Gabriel, Stacey B. ;
Getz, Gad ;
Ardlie, Kristin ;
Chan, Vivien ;
Myer, Vic E. .
NATURE, 2012, 483 (7391) :603-607
[6]
NCBI GEO: mining tens of millions of expression profiles - database and tools update [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Rudnev, Dmitry ;
Evangelista, Carlos ;
Kim, Irene F. ;
Soboleva, Alexandra ;
Tomashevsky, Maxim ;
Edgar, Ron .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D760-D765
[7]
CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[8]
Metagenes and molecular pattern discovery using matrix factorization [J].
Brunet, JP ;
Tamayo, P ;
Golub, TR ;
Mesirov, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) :4164-4169
[9]
Integrative Genomic Analysis of Medulloblastoma Identifies a Molecular Subgroup That Drives Poor Clinical Outcome [J].
Cho, Yoon-Jae ;
Tsherniak, Aviad ;
Tamayo, Pablo ;
Santagata, Sandro ;
Ligon, Azra ;
Greulich, Heidi ;
Berhoukim, Rameen ;
Amani, Vladimir ;
Goumnerova, Liliana ;
Eberhart, Charles G. ;
Lau, Ching C. ;
Olson, James M. ;
Gilbertson, Richard J. ;
Gajjar, Amar ;
Delattre, Olivier ;
Kool, Marcel ;
Ligon, Keith ;
Meyerson, Matthew ;
Mesirov, Jill P. ;
Pomeroy, Scott L. .
JOURNAL OF CLINICAL ONCOLOGY, 2011, 29 (11) :1424-1430
[10]
The role of Stat5 transcription factors as tumor suppressors or oncogenes [J].
Ferbeyre, G. ;
Moriggl, R. .
BIOCHIMICA ET BIOPHYSICA ACTA-REVIEWS ON CANCER, 2011, 1815 (01) :104-114