Learning Dysregulated Pathways in Cancers from Differential Variability Analysis

被引:30
作者
Afsari, Bahman [1 ]
Geman, Donald [2 ]
Fertig, Elana [1 ]
机构
[1] Johns Hopkins Univ, Div Biostat & Bioinformat, Sidney Kimmel Comprehens Canc Ctr, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Dept Appl Math & Stat, Baltimore, MD USA
基金
美国国家科学基金会;
关键词
gene set analysis; gene expression; variability analysis; multivariate analysis;
D O I
10.4137/CIN.S14066
中图分类号
R73 [肿瘤学];
学科分类号
100214 [肿瘤学];
摘要
Analysis of gene sets can implicate activity in signaling pathways that is responsible for cancer initiation and progression, but is not discernible from the analysis of individual genes. Multiple methods and software packages have been developed to infer pathway activity from expression measurements for set of genes targeted by that pathway. Broadly, three major methodologies have been proposed: over-representation, enrichment, and differential variability. Both over-representation and enrichment analyses are effective techniques to infer differentially regulated pathways from gene sets with relatively consistent differentially expressed (DE) genes. Specifically, these algorithms aggregate statistics from each gene in the pathway. However, they overlook multivariate patterns related to gene interactions and variations in expression. Therefore, the analysis of differential variability of multigene expression patterns can be essential to pathway inference in cancers. The corresponding methodologies and software packages for such multivariate variability analysis of pathways are reviewed here. We also introduce a new, computationally efficient algorithm, expression variation analysis (EVA), which has been implemented along with a previously proposed algorithm, Differential Rank Conservation (DIRAC), in an open source R package, gene set regulation (GSReg). EVA inferred similar pathways as DIRAC at reduced computational costs. Moreover, EVA also inferred different dysregulated pathways than those identified by enrichment analysis.
引用
收藏
页码:61 / 67
页数:7
相关论文
共 51 条
[1]
Afsari B., 2013, THESIS
[2]
Afsari B, GENE SET BENCHMARK R
[3]
Afsari B, 2014, ANN APPL STAT
[4]
Afsari B, GSREG PACKAGE GENE S
[5]
FatiGO:: a web tool for finding significant associations of Gene Ontology terms with groups of genes [J].
Al-Shahrour, F ;
Díaz-Uriarte, R ;
Dopazo, J .
BIOINFORMATICS, 2004, 20 (04) :578-580
[6]
[Anonymous], 2019, TOPGO ENRICHMENT ANA
[7]
Significance analysis of functional categories in gene expression studies: a structured permutation approach [J].
Barry, WT ;
Nobel, AB ;
Wright, FA .
BIOINFORMATICS, 2005, 21 (09) :1943-1949
[8]
Pathway databases and tools for their exploitation: benefits, current limitations and challenges [J].
Bauer-Mehren, Anna ;
Furlong, Laura I. ;
Sanz, Ferran .
MOLECULAR SYSTEMS BIOLOGY, 2009, 5
[9]
GOstat: find statistically overrepresented Gene Ontologies within a group of genes [J].
Beissbarth, T ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (09) :1464-1465
[10]
CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300