A new summarization method for affymetrix probe level data

被引:185
作者
Hochreiter, S [1 ]
Clevert, DA
Obermayer, K
机构
[1] Tech Univ Berlin, Dept Elect Engn & Comp Sci, D-10587 Berlin, Germany
[2] Johannes Kepler Univ Linz, Inst Bioinformat, A-4040 Linz, Austria
关键词
D O I
10.1093/bioinformatics/btl033
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: We propose a new model-based technique for summarizing high-density oligonucleotide array data at probe level for Affymetrix GeneChips. The new summarization method is based on a factor analysis model for which a Bayesian maximum a posteriori method optimizes the model parameters under the assumption of Gaussian measurement noise. Thereafter, the RNA concentration is estimated from the model. In contrast to previous methods our new method called 'Factor Analysis for Robust Microarray Summarization (FARMS)' supplies both P-values indicating interesting information and signal intensity values. Results: We compare FARMS on Affymetrix's spike-in and Gene Logic's dilution data to established algorithms like Affymetrix Microarray Suite (MAS) 5.0, Model Based Expression Index (MBEI), Robust Multi-array Average (RMA). Further, we compared FARMS with 43 other methods via the 'Affycomp II' competition. The experimental results show that FARMS with default parameters outperforms previous methods if both sensitivity and specificity are simultaneously considered by the area under the receiver operating curve (AUC). We measured two quantities through the AUC: correctly detected expression changes versus wrongly detected (fold change) and correctly detected significantly different expressed genes in two sets of arrays versus wrongly detected (P-value). Furthermore FARMS is computationally less expensive then RMA, MAS and MBEI.
引用
收藏
页码:943 / 949
页数:7
相关论文
共 24 条
  • [1] Affymetrix, 2001, MICR SUIT US GUID, V5th
  • [2] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [3] Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset
    Choe, SE
    Boutros, M
    Michelson, AM
    Church, GM
    Halfon, MS
    [J]. GENOME BIOLOGY, 2005, 6 (02)
  • [4] Chudin E, 2002, GENOME BIOL, V3
  • [5] A benchmark for affymetrix GeneChip expression measures
    Cope, LM
    Irizarry, RA
    Jaffee, HA
    Wu, ZJ
    Speed, TP
    [J]. BIOINFORMATICS, 2004, 20 (03) : 323 - 331
  • [6] Statistical tests for differential expression in cDNA microarray experiments
    Cui, XQ
    Churchill, GA
    [J]. GENOME BIOLOGY, 2003, 4 (04)
  • [7] DeGroot M., 1970, OPTIMAL STAT DECISIO
  • [8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [9] Dudoit S, 2002, STAT SINICA, V12, P111
  • [10] Freudenberg J, 2004, METHOD INFORM MED, V43, P434