A sequence-based approach to identify reference genes for gene expression analysis

被引:41
作者
Chari, Raj [1 ]
Lonergan, Kim M. [1 ]
Pikor, Larissa A. [1 ]
Coe, Bradley P. [1 ]
Zhu, Chang Qi [2 ]
Chan, Timothy H. W. [1 ,3 ]
MacAulay, Calum E. [1 ]
Tsao, Ming-Sound [2 ]
Lam, Stephen [1 ]
Ng, Raymond T. [1 ,3 ]
Lam, Wan L. [1 ]
机构
[1] British Columbia Canc Agcy, Dept Integrat Oncol, Res Ctr, Vancouver, BC V5Z 4E6, Canada
[2] Princess Margaret Hosp, Ontario Canc Inst, Toronto, ON M4X 1K9, Canada
[3] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1W5, Canada
基金
加拿大健康研究院;
关键词
MESSENGER-RNA LEVELS; CELL LUNG-CANCER; HOUSEKEEPING GENES; BETA-ACTIN; BRONCHIAL EPITHELIUM; REVERSE TRANSCRIPTION; MICROARRAY DATA; SERIAL ANALYSIS; RIBOSOMAL-RNA; NORMALIZATION;
D O I
10.1186/1755-8794-3-32
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Background: An important consideration when analyzing both microarray and quantitative PCR expression data is the selection of appropriate genes as endogenous controls or reference genes. This step is especially critical when identifying genes differentially expressed between datasets. Moreover, reference genes suitable in one context (e.g. lung cancer) may not be suitable in another (e.g. breast cancer). Currently, the main approach to identify reference genes involves the mining of expression microarray data for highly expressed and relatively constant transcripts across a sample set. A caveat here is the requirement for transcript normalization prior to analysis, and measurements obtained are relative, not absolute. Alternatively, as sequencing-based technologies provide digital quantitative output, absolute quantification ensues, and reference gene identification becomes more accurate. Methods: Serial analysis of gene expression (SAGE) profiles of non-malignant and malignant lung samples were compared using a permutation test to identify the most stably expressed genes across all samples. Subsequently, the specificity of the reference genes was evaluated across multiple tissue types, their constancy of expression was assessed using quantitative RT-PCR (qPCR), and their impact on differential expression analysis of microarray data was evaluated. Results: We show that (i) conventional references genes such as ACTB and GAPDH are highly variable between cancerous and non-cancerous samples, (ii) reference genes identified for lung cancer do not perform well for other cancer types (breast and brain), (iii) reference genes identified through SAGE show low variability using qPCR in a different cohort of samples, and (iv) normalization of a lung cancer gene expression microarray dataset with or without our reference genes, yields different results for differential gene expression and subsequent analyses. Specifically, key established pathways in lung cancer exhibit higher statistical significance using a dataset normalized with our reference genes relative to normalization without using our reference genes. Conclusions: Our analyses found NDUFA1, RPL19, RAB5C, and RPS18 to occupy the top ranking positions among 15 suitable reference genes optimal for normalization of lung tissue expression data. Significantly, the approach used in this study can be applied to data generated using new generation sequencing platforms for the identification of reference genes optimal within diverse contexts.
引用
收藏
页数:11
相关论文
共 35 条
[1]
Normalization of real-time quantitative reverse transcription-PCR data: A model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets [J].
Andersen, CL ;
Jensen, JL ;
Orntoft, TF .
CANCER RESEARCH, 2004, 64 (15) :5245-5250
[2]
[Anonymous], 2000, Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses
[3]
GAPDH as a housekeeping gene: analysis of GAPDH mRNA expression in a panel of 72 human tissues [J].
Barber, RD ;
Harmer, DW ;
Coleman, RA ;
Clark, BJ .
PHYSIOLOGICAL GENOMICS, 2005, 21 (03) :389-395
[4]
Utility of the housekeeping genes 18S rRNA, β-actin and glyceraldehyde-3-phosphate-dehydrogenase for normalization in real-time quantitative reverse transcriptase-polymerase chain reaction analysis of gene expression in human T lymphocytes [J].
Bas, A ;
Forsberg, G ;
Hammarström, S ;
Hammarström, ML .
SCANDINAVIAN JOURNAL OF IMMUNOLOGY, 2004, 59 (06) :566-573
[5]
Current smoking-specific gene expression signature in normal bronchial epithelium is enhanced in squamous cell lung cancer [J].
Boelens, Mirjam C. ;
van den Berg, Anke ;
Fehrmann, Rudolf S. N. ;
Geerlings, Marie ;
de Jong, Wouter K. ;
Meerman, Gerard J. te ;
Sietsma, Hannie ;
Timens, Winn ;
Postma, Dirkje S. ;
Groen, Harry J. M. .
JOURNAL OF PATHOLOGY, 2009, 218 (02) :182-191
[6]
An anatomy of normal and malignant gene expression [J].
Boon, K ;
Osório, EC ;
Greenhut, SF ;
Schaefer, CF ;
Shoemaker, J ;
Polyak, K ;
Morin, PJ ;
Buetow, KH ;
Strausberg, RL ;
de Souza, SJ ;
Riggins, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (17) :11287-11292
[7]
Effect of active smoking on the human bronchial epithelium transcriptome [J].
Chari, Raj ;
Lonergan, Kim M. ;
Ng, Raymond T. ;
MacAulay, Calum ;
Lam, Wan L. ;
Lam, Stephen .
BMC GENOMICS, 2007, 8 :297
[8]
Evidence Based Selection of Housekeeping Genes [J].
de Jonge, Hendrik J. M. ;
Fehrmann, Rudolf S. N. ;
de Bont, Eveline S. J. M. ;
Hofstra, Robert M. W. ;
Gerbens, Frans ;
Kamps, Willem A. ;
de Vries, Elisabeth G. E. ;
van der Zee, Ate G. J. ;
Meerman, Gerard J. te ;
ter Elst, Arja .
PLOS ONE, 2007, 2 (09)
[9]
Normalization of gene expression measurements in tumor tissues: comparison of 13 endogenous control genes [J].
de Kok, JB ;
Roelofs, RW ;
Giesendorf, BA ;
Pennings, JL ;
Waas, ET ;
Feuth, T ;
Swinkels, DW ;
Span, PN .
LABORATORY INVESTIGATION, 2005, 85 (01) :154-159
[10]
Eickhoff B, 1999, Nucleic Acids Res, V27, pe33, DOI 10.1093/nar/27.22.e33