Probabilistic analysis of gene expression measurements from heterogeneous tissues

被引:64
作者
Erkkila, Timo [1 ,2 ]
Lehmusvaara, Saara [3 ,4 ]
Ruusuvuori, Pekka [1 ,2 ]
Visakorpi, Tapio [3 ,4 ]
Shmulevich, Ilya [1 ,2 ]
Lahdesmaki, Harri [1 ,5 ]
机构
[1] Tampere Univ Technol, Dept Signal Proc, FIN-33101 Tampere, Finland
[2] Inst Syst Biol, Seattle, WA USA
[3] Univ Tampere, Inst Med Technol, FIN-33101 Tampere, Finland
[4] Tampere Univ Hosp, Tampere, Finland
[5] Aalto Univ, Dept Informat & Comp Sci, FIN-02150 Espoo, Finland
基金
芬兰科学院; 美国国家卫生研究院;
关键词
MICROARRAY DATA; RNA-SEQ; PATTERNS; MICRODISSECTION; NORMALIZATION; STRATEGY; MODEL;
D O I
10.1093/bioinformatics/btq406
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: Tissue heterogeneity, arising from multiple cell types, is a major confounding factor in experiments that focus on studying cell types, e. g. their expression profiles, in isolation. Although sample heterogeneity can be addressed by manual microdissection, prior to conducting experiments, computational treatment on heterogeneous measurements have become a reliable alternative to perform this microdissection in silico. Favoring computation over manual purification has its advantages, such as time consumption, measuring responses of multiple cell types simultaneously, keeping samples intact of external perturbations and unaltered yield of molecular content. Results: We formalize a probabilistic model, DSection, and show with simulations as well as with real microarray data that DSection attains increased modeling accuracy in terms of (i) estimating cell-type proportions of heterogeneous tissue samples, (ii) estimating replication variance and (iii) identifying differential expression across cell types under various experimental conditions. As our reference we use the corresponding linear regression model, which mirrors the performance of the majority of current non-probabilistic modeling approaches.
引用
收藏
页码:2571 / 2577
页数:7
相关论文
共 29 条
[1]
Deconvolution of Blood Microarray Data Identifies Cellular Activation Patterns in Systemic Lupus Erythematosus [J].
Abbas, Alexander R. ;
Wolslegel, Kristen ;
Seshasayee, Dhaya ;
Modrusan, Zora ;
Clark, Hilary F. .
PLOS ONE, 2009, 4 (07)
[2]
An introduction to MCMC for machine learning [J].
Andrieu, C ;
de Freitas, N ;
Doucet, A ;
Jordan, MI .
MACHINE LEARNING, 2003, 50 (1-2) :5-43
[3]
[Anonymous], 2021, Bayesian data analysis
[4]
Markov chain Monte Carlo convergence diagnostics: A comparative review [J].
Cowles, MK ;
Carlin, BP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (434) :883-904
[5]
ARE A SET OF MICROARRAYS INDEPENDENT OF EACH OTHER? [J].
Efron, Bradley .
ANNALS OF APPLIED STATISTICS, 2009, 3 (03) :922-942
[6]
Laser capture microdissection [J].
EmmertBuck, MR ;
Bonner, RF ;
Smith, PD ;
Chuaqui, RF ;
Zhuang, ZP ;
Goldstein, SR ;
Weiss, RA ;
Liotta, LA .
SCIENCE, 1996, 274 (5289) :998-1001
[7]
Prior distributions for variance parameters in hierarchical models(Comment on an Article by Browne and Draper) [J].
Gelman, Andrew .
BAYESIAN ANALYSIS, 2006, 1 (03) :515-533
[8]
Electronically subtracting expression patterns from a mixed cell population [J].
Gosink, Mark M. ;
Petrie, Howard T. ;
Tsinoremas, Nicholas F. .
BIOINFORMATICS, 2007, 23 (24) :3328-3334
[9]
Reversible jump Markov chain Monte Carlo computation and Bayesian model determination [J].
Green, PJ .
BIOMETRIKA, 1995, 82 (04) :711-732
[10]
Methodology article -: Robust computational reconstitution -: a new method for the comparative analysis of gene expression in tissues and isolated cell fractions [J].
Hoffmann, Martin ;
Pohlers, Dirk ;
Koczan, Dirk ;
Thiesen, Hans-Jurgen ;
Wolfl, Stefan ;
Kinne, Raimund W. .
BMC BIOINFORMATICS, 2006, 7 (1)