Spot shape modelling and data transformations for microarrays

被引:27
作者
Ekstrom, CT
Bak, S
Kristensen, C
Rudemo, M
机构
[1] Royal Vet & Agr Univ, Dept Math & Phys, DK-1871 Frederiksberg C, Denmark
[2] Royal Vet & Agr Univ, Dept Plant Biol, Plant Biochem Lab, Ctr Mol Plant Physiol PlaCe, DK-1871 Frederiksberg, Denmark
关键词
D O I
10.1093/bioinformatics/bth237
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: To study lowly expressed genes in microarray experiments, it is useful to increase the photometric gain in the scanning. However, a large gain may cause some pixels for highly expressed genes to become saturated. Spatial statistical models that model spot shapes on the pixel level may be used to infer information about the saturated pixel intensities. Other possible applications for spot shape models include data quality control and accurate determination of spot centres and spot diameters. Results: Spatial statistical models for spotted microarrays are studied including pixel level transformations and spot shape models. The models are applied to a dataset from 50mer oligonucleotide microarrays with 452 selected Arabidopsis genes. Logarithmic, Box-Cox and inverse hyperbolic sine transformations are compared in combination with four spot shape models: a cylindric plateau shape, an isotropic Gaussian distribution and a difference of two-scaled Gaussian distribution suggested in the literature, as well as a proposed new polynomial-hyperbolic spot shape model. A substantial improvement is obtained for the dataset studied by the polynomial-hyperbolic spot shape model in combination with the Box-Cox transformation. The spatial statistical models are used to correct spot measurements with saturation by extrapolating the censored data.
引用
收藏
页码:2270 / 2278
页数:9
相关论文
共 13 条
[1]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[2]   Expression profiling using cDNA microarrays [J].
Duggan, DJ ;
Bittner, M ;
Chen, YD ;
Meltzer, P ;
Trent, JM .
NATURE GENETICS, 1999, 21 (Suppl 1) :10-14
[3]  
Durbin B P, 2002, Bioinformatics, V18 Suppl 1, pS105
[4]   Combinatorial image analysis of DNA microarray features [J].
Glasbey, CA ;
Ghazal, P .
BIOINFORMATICS, 2003, 19 (02) :194-203
[5]  
Huber Wolfgang, 2002, Bioinformatics, V18 Suppl 1, pS96
[6]   Assessment of the sensitivity and specificity of oligonucleotide (50mer) microarrays [J].
Kane, MD ;
Jatkoe, TA ;
Stumpf, CR ;
Lu, J ;
Thomas, JD ;
Madore, SJ .
NUCLEIC ACIDS RESEARCH, 2000, 28 (22) :4552-4557
[7]   On the origin of family 1 plant glycosyltransferases [J].
Paquette, S ;
Moller, BL ;
Bak, S .
PHYTOCHEMISTRY, 2003, 62 (03) :399-413
[8]   Intron-exon organization and phylogeny in a large superfamily, the paralogous cytochrome P450 genes of Arabidopsis thaliana [J].
Paquette, SM ;
Bak, S ;
Feyereisen, R .
DNA AND CELL BIOLOGY, 2000, 19 (05) :307-317
[9]   Approximate variance-stabilizing transformations for gene-expression microarray data [J].
Rocke, DM ;
Durbin, B .
BIOINFORMATICS, 2003, 19 (08) :966-972
[10]   Resistance to an herbivore through engineered cyanogenic glucoside synthesis [J].
Tattersall, DB ;
Bak, S ;
Jones, PR ;
Olsen, CE ;
Nielsen, JK ;
Hansen, ML ;
Hoj, PB ;
Moller, BL .
SCIENCE, 2001, 293 (5536) :1826-1828