Transformation of expression intensities across generations of affymetrix microarrays using sequence matching and regression modeling

被引:6
作者
Bhattacharya, S
Mariani, TJ
机构
[1] Brigham & Womens Hosp, Div Pulm Med, Boston, MA 02115 USA
[2] Harvard Univ, Sch Med, Lung Biol Ctr, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/nar/gni159
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The utility of previously generated microarray data is severely limited owing to small study size, leading to under-powered analysis, and failure of replication. Multiplicity of platforms and various sources of systematic noise limit the ability to compile existing data from similar studies. We present a model for transformation of data across different generations of Affymetrix arrays, developed using previously published datasets describing technical replicates performed with two generations of arrays. The transformation is based upon a probe set-specific regression model, generated from replicate measurements across platforms, performed using correlation coefficients. The model, when applied to the expression intensities of 5069 shared, sequence-matched probe sets in three different generations of Affymetrix Human oligonucleotide arrays, showed significant improvement in inter generation correlations between sample-wide means and individual probe set pairs. The approach was further validated by an observed reduction in Euclidean distance between signal intensities across generations for the predicted values. Finally, application of the model to independent, but related datasets resulted in improved clustering of samples based upon their biological, as opposed to technical, attributes. Our results suggest that this transformation method is a valuable tool for integrating microarray datasets from different generations of arrays.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 33 条
[1]  
[Anonymous], 2004, Guide to Analysis of DNA Microarray Data
[2]   Quantitative analysis of mRNA amplification by in vitro transcription [J].
Baugh, L. R. ;
Hill, A. A. ;
Brown, E. L. ;
Hunter, Craig P. .
NUCLEIC ACIDS RESEARCH, 2001, 29 (05)
[3]   Gene-expression profiles predict survival of patients with lung adenocarcinoma [J].
Beer, DG ;
Kardia, SLR ;
Huang, CC ;
Giordano, TJ ;
Levin, AM ;
Misek, DE ;
Lin, L ;
Chen, GA ;
Gharib, TG ;
Thomas, DG ;
Lizyness, ML ;
Kuick, R ;
Hayasaka, S ;
Taylor, JMG ;
Iannettoni, MD ;
Orringer, MB ;
Hanash, S .
NATURE MEDICINE, 2002, 8 (08) :816-824
[4]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[5]  
Bhattacharya Soumyaroop, 2003, Appl Bioinformatics, V2, P197
[6]   ArrayExpress - a public repository for microarray gene expression data at the EBI [J].
Brazma, A ;
Parkinson, H ;
Sarkans, U ;
Shojatalab, M ;
Vilo, J ;
Abeygunawardena, N ;
Holloway, E ;
Kapushesky, M ;
Kemmeren, P ;
Lara, GG ;
Oezcimen, A ;
Rocca-Serra, P ;
Sansone, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :68-71
[7]  
Chipman H., 2003, CLUSTERING MICROARRA
[8]   Gene Expression Omnibus: NCBI gene expression and hybridization array data repository [J].
Edgar, R ;
Domrachev, M ;
Lash, AE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :207-210
[9]  
Ghosh Debashis, 2003, Functional & Integrative Genomics, V3, P180
[10]   The Stanford Microarray Database: data access and quality assessment tools [J].
Gollub, J ;
Ball, CA ;
Binkley, G ;
Demeter, J ;
Finkelstein, DB ;
Hebert, JM ;
Hernandez-Boussard, T ;
Jin, H ;
Kaloper, M ;
Matese, JC ;
Schroeder, M ;
Brown, PO ;
Botstein, D ;
Sherlock, G .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :94-96