DeMix: deconvolution for mixed cancer transcriptomes using raw measured data

被引:84
作者
Ahn, Jaeil [1 ,2 ]
Yuan, Ying [2 ]
Parmigiani, Giovanni [3 ]
Suraokar, Milind B. [4 ]
Diao, Lixia [2 ]
Wistuba, Ignacio I. [4 ]
Wang, Wenyi [1 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[3] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02215 USA
[4] Univ Texas MD Anderson Canc Ctr, Dept Translat Mol Pathol Thorac Head & Neck Med O, Houston, TX 77030 USA
关键词
EXPRESSION DECONVOLUTION; GENE-EXPRESSION; PATTERNS; NORMALIZATION; EXPLORATION;
D O I
10.1093/bioinformatics/btt301
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: Tissue samples of tumor cells mixed with stromal cells cause underdetection of gene expression signatures associated with cancer prognosis or response to treatment. In silico dissection of mixed cell samples is essential for analyzing expression data generated in cancer studies. Currently, a systematic approach is lacking to address three challenges in computational deconvolution: (i) violation of linear addition of expression levels from multiple tissues when logtransformed microarray data are used; (ii) estimation of both tumor proportion and tumor-specific expression, when neither is known a priori; and (iii) estimation of expression profiles for individual patients. Results: We have developed a statistical method for deconvolving mixed cancer transcriptomes, DeMix, which addresses the aforementioned issues in array-based expression data. We demonstrate the performance of our model in synthetic and real, publicly available, datasets. DeMix can be applied to ongoing biomarker-based clinical studies and to the vast expression datasets previously generated from mixed tumor and stromal cell samples.
引用
收藏
页码:1865 / 1871
页数:7
相关论文
共 22 条
[1]
Deconvolution of Blood Microarray Data Identifies Cellular Activation Patterns in Systemic Lupus Erythematosus [J].
Abbas, Alexander R. ;
Wolslegel, Kristen ;
Seshasayee, Dhaya ;
Modrusan, Zora ;
Clark, Hilary F. .
PLOS ONE, 2009, 4 (07)
[2]
[Anonymous], 2009, R LANG ENV STAT COMP
[3]
Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data [J].
Carvalho, Benilton ;
Bengtsson, Henrik ;
Speed, Terence P. ;
Irizarry, Rafael A. .
BIOSTATISTICS, 2007, 8 (02) :485-499
[4]
Statistical expression deconvolution from mixed tissue samples [J].
Clarke, Jennifer ;
Seo, Pearl ;
Clarke, Bertrand .
BIOINFORMATICS, 2010, 26 (08) :1043-1049
[6]
Laser capture microdissection [J].
EmmertBuck, MR ;
Bonner, RF ;
Smith, PD ;
Chuaqui, RF ;
Zhuang, ZP ;
Goldstein, SR ;
Weiss, RA ;
Liotta, LA .
SCIENCE, 1996, 274 (5289) :998-1001
[7]
Probabilistic analysis of gene expression measurements from heterogeneous tissues [J].
Erkkila, Timo ;
Lehmusvaara, Saara ;
Ruusuvuori, Pekka ;
Visakorpi, Tapio ;
Shmulevich, Ilya ;
Lahdesmaki, Harri .
BIOINFORMATICS, 2010, 26 (20) :2571-2577
[9]
Electronically subtracting expression patterns from a mixed cell population [J].
Gosink, Mark M. ;
Petrie, Howard T. ;
Tsinoremas, Nicholas F. .
BIOINFORMATICS, 2007, 23 (24) :3328-3334
[10]
Exploration, normalization, and summaries of high density oligonucleotide array probe level data [J].
Irizarry, RA ;
Hobbs, B ;
Collin, F ;
Beazer-Barclay, YD ;
Antonellis, KJ ;
Scherf, U ;
Speed, TP .
BIOSTATISTICS, 2003, 4 (02) :249-264