Deconfounding microarray analysis - Independent measurements of cell type proportions used in a regression model to resolve tissue heterogeneity bias

被引:20
作者
Jacobsen, M.
Repsilber, D.
Gutschmidt, A.
Neher, A.
Feldmann, K.
Mollenkopf, H. J.
Kaufmann, S. H. E.
Ziegler, A. [1 ]
机构
[1] Med Univ Lubeck, Inst Med Biometry & Stat, D-23538 Lubeck, Germany
[2] Max Planck Inst Infect Biol, Dept Immunol, Berlin, Germany
[3] Asklepios Ctr Resp Med & Thorac Surg, Gauting, Germany
[4] Max Planck Inst Infect Biol, Microarray Core Facil, Berlin, Germany
关键词
transcriptome; tissue heterogeneity; deconfounding;
D O I
10.1055/s-0038-1634118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives: Microarray analysis requires standardized specimens and evaluation procedures to achieve acceptable results. A major limitation of this method is caused by heterogeneity in the cellular composition of tissue specimens, which frequently confounds data analysis. We introduce a linear model to deconfound gene expression data from tissue heterogeneity for genes exclusively expressed by a single cell type. Methods: Gene expression data are deconfounded from tissue heterogeneity effects by analyzing them using an appropriate linear regression model. In our illustrating data set tissue heterogeneity is being measured using flow cytometry. Gene expression data ore determined in parallel by real time quantitative polymerase chain reaction (qPCR) and microarray analyses. Verification of deconfounding is enabled using protein quantification for the respective marker genes. Results. for our illustrating dataset, quantification of cell type proportions for peripheral blood mononuclear cells (PBMC) from tuberculosis patients and controls revealed differences in B cell and monocyte proportions between both study groups, and thus heterogeneity for the tissue under investigation. Gene expression analyses reflected these differences in celltype distribution. Filling an appropriate linear model allowed us to deconfound measured transcriptome levels from tissue heterogeneity effects. In the case of monocytes, additional differential expression on the single cell level could be proposed. Protein quantification verified these deconfounded results. Conclusions: Deconfounding of transcriptome analyses for cellular heterogeneity greatly improves interpretability, and hence the validity of transcriptome profiling results.
引用
收藏
页码:557 / 563
页数:7
相关论文
共 24 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] Sources of variability and effect of experimental approach on expression profiling data interpretation
    Bakay, M
    Chen, YW
    Borup, R
    Zhao, P
    Nagaraju, K
    Hoffman, EP
    [J]. BMC BIOINFORMATICS, 2002, 3 (1)
  • [3] BECK JS, 1985, CLIN EXP IMMUNOL, V60, P49
  • [4] Interferon and granulopoiesis signatures in systemic lupus erythematosus blood
    Bennett, L
    Palucka, AK
    Arce, E
    Cantrell, V
    Borvak, J
    Banchereau, J
    Pascual, V
    [J]. JOURNAL OF EXPERIMENTAL MEDICINE, 2003, 197 (06) : 711 - 723
  • [5] Laser capture microdissection
    EmmertBuck, MR
    Bonner, RF
    Smith, PD
    Chuaqui, RF
    Zhuang, ZP
    Goldstein, SR
    Weiss, RA
    Liotta, LA
    [J]. SCIENCE, 1996, 274 (5289) : 998 - 1001
  • [6] Mixture models for assessing differential expression in complex tissues using microarray data
    Ghosh, D
    [J]. BIOINFORMATICS, 2004, 20 (11) : 1663 - 1669
  • [7] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [8] Guidelines - Expression profiling - best practices for data generation and interpretation in clinical trials
    Hoffman, EP
    Awad, T
    Palma, J
    Webster, T
    Hubbell, E
    Warrington, JA
    Spirais, A
    Wright, G
    Buckley, J
    Triche, T
    Davis, R
    Tibshirani, R
    Xiao, WH
    Jones, W
    Tompkins, R
    West, M
    [J]. NATURE REVIEWS GENETICS, 2004, 5 (03) : 229 - 237
  • [9] A point mutation in PTPRC is associated with the development of multiple sclerosis
    Jacobsen, M
    Schweer, D
    Ziegler, A
    Gaber, R
    Schock, S
    Schwinzer, R
    Wonigeit, K
    Lindert, RB
    Kantarci, O
    Schaefer-Klein, J
    Schipper, HI
    Oertel, WH
    Heidenreich, F
    Weinshenker, BG
    Sommer, N
    Hemmer, B
    [J]. NATURE GENETICS, 2000, 26 (04) : 495 - 499
  • [10] Gene expression profile analysis by DNA microarrays - Promise and pitfalls
    King, HC
    Sinha, AA
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2001, 286 (18): : 2280 - 2288