Assessment of reliability of microarray data and estimation of signal thresholds using mixture modeling

被引:23
作者
Asyali, MH
Shoukri, MM
Demirkaya, O
Khabar, KSA
机构
[1] King Faisal Specialist Hosp & Res Ctr, Dept Biostat Epidemiol & Sci Comp, Riyadh 11211, Saudi Arabia
[2] King Faisal Specialist Hosp & Res Ctr, Dept Biol & Med Res, Riyadh 11211, Saudi Arabia
关键词
D O I
10.1093/nar/gkh544
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DNA microarray is an important tool for the study of gene activities but the resultant data consisting of thousands of points are error-prone. A serious limitation in microarray analysis is the unreliability of the data generated from low signal intensities. Such data may produce erroneous gene expression ratios and cause unnecessary validation or post-analysis follow-up tasks. In this study, we describe an approach based on normal mixture modeling for determining optimal signal intensity thresholds to identify reliable measurements of the microarray elements and subsequently eliminate false expression ratios. We used univariate and bivariate mixture modeling to segregate the microarray data into two classes, low signal intensity and reliable signal intensity populations, and applied Bayesian decision theory to find the optimal signal thresholds. The bivariate analysis approach was found to be more accurate than the univariate approach; both approaches were superior to a conventional method when validated against a reference set of biological data that consisted of true and false gene expression data. Elimination of unreliable signal intensities in microarray data should contribute to the quality of microarray data including reproducibility and reliability of gene expression ratios.
引用
收藏
页码:2323 / 2335
页数:13
相关论文
共 33 条
  • [1] A carbocyclic nucleoside analogue is a TNF-α inhibitor with immunosuppressive action:: Role of prostaglandin E2 and protein kinase C and comparison with pentoxifylline
    Al-Humidan, A
    Edwards, CK
    Al-Sofi, A
    Dzimiri, M
    Al-Sedairy, ST
    Khabar, ESA
    [J]. CELLULAR IMMUNOLOGY, 1998, 188 (01) : 12 - 18
  • [2] ARED 2.0: an update of AU-rich element mRNA database
    Bakheet, T
    Williams, BRG
    Khabar, KSA
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 421 - 423
  • [3] Defining signal thresholds in DNA microarrays: exemplary application for invasive cancer
    Bilban, M
    Buehler, LK
    Head, S
    Desoye, G
    Quaranta, V
    [J]. BMC GENOMICS, 2002, 3 (1)
  • [4] Significance and statistical errors in the analysis of DNA microarray data
    Brody, JP
    Williams, BA
    Wold, BJ
    Quake, SR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (20) : 12975 - 12978
  • [5] Gene expression signature of fibroblast serum response predicts human cancer progression: Similarities between tumors and wounds
    Chang, HY
    Sneddon, JB
    Alizadeh, AA
    Sood, R
    West, RB
    Montgomery, K
    Chi, JT
    van de Rijn, M
    Botstein, D
    Brown, PO
    [J]. PLOS BIOLOGY, 2004, 2 (02) : 206 - 214
  • [6] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
    COHEN, J
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
  • [7] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [8] Duda R. O., 2000, PATTERN CLASSIFICATI
  • [9] GP3: GenePix post-processing program for automated analysis of raw microarray data
    Fielden, MR
    Halgren, RG
    Dere, E
    Zacharewski, TR
    [J]. BIOINFORMATICS, 2002, 18 (05) : 771 - 773
  • [10] p38 mitogen-activated protein kinase-dependent and -independent signaling of mRNA stability of AU-rich element-containing transcripts
    Frevel, MAE
    Bakheet, T
    Silva, AM
    Hissong, JG
    Khabar, KSA
    Williams, BRG
    [J]. MOLECULAR AND CELLULAR BIOLOGY, 2003, 23 (02) : 425 - 436