Identification and handling of artifactual gene expression profiles emerging in microarray hybridization experiments

被引:6
作者
Brodsky, L
Leontovich, A
Shtutman, M
Feinstein, E
机构
[1] Quark Biotech Inc, QBI Enterprises Ltd, IL-70400 Ness Ziona, Israel
[2] Moscow MV Lomonosov State Univ, Belozersky Inst Physicochem Biol, Moscow, Russia
关键词
D O I
10.1093/nar/gnh043
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Mathematical methods of analysis of microarray hybridizations deal with gene expression profiles as elementary units. However, some of these profiles do not reflect a biologically relevant transcriptional response, but rather stem from technical artifacts. Here, we describe two technically independent but rationally interconnected methods for identification of such artifactual profiles. Our diagnostics are based on detection of deviations from uniformity, which is assumed as the main underlying principle of microarray design. Method 1 is based on detection of non-uniformity of microarray distribution of printed genes that are clustered based on the similarity of their expression profiles. Method 2 is based on evaluation of the presence of gene-specific microarray spots within the slides' areas characterized by an abnormal concentration of low/high differential expression values, which we define as 'patterns of differentials'. Applying two novel algorithms, for nested clustering (method 1) and for pattern detection (method 2), we can make a dual estimation of the profile's quality for almost every printed gene. Genes with artifactual profiles detected by method 1 may then be removed from further analysis. Suspicious differential expression values detected by method 2 may be either removed or weighted according to the probabilities of patterns that cover them, thus diminishing their input in any further data analysis.
引用
收藏
页数:12
相关论文
共 31 条
[1]   The lymphochip: A specialized cDNA microarray for the genomic-scale analysis of gene expression in normal and malignant lymphocytes [J].
Alizadeh, A ;
Eisen, M ;
Davis, RE ;
Ma, C ;
Sabet, H ;
Tran, T ;
Powell, JI ;
Yang, L ;
Marti, GE ;
Moore, DT ;
Hudson, JR ;
Chan, WC ;
Greiner, T ;
Weisenburger, D ;
Armitage, JO ;
Lossos, I ;
Levy, R ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 1999, 64 :71-78
[2]   A CLUSTERING TECHNIQUE FOR SUMMARIZING MULTIVARIATE DATA [J].
BALL, GH ;
HALL, DJ .
BEHAVIORAL SCIENCE, 1967, 12 (02) :153-&
[3]   Gene expression informatics - it's all in your mine [J].
Bassett, DE ;
Eisen, MB ;
Boguski, MS .
NATURE GENETICS, 1999, 21 (Suppl 1) :51-55
[4]   Calculation of the minimum number of replicate spots required for detection of significant gene expression fold change in microarray experiments [J].
Black, MA ;
Doerge, RW .
BIOINFORMATICS, 2002, 18 (12) :1609-1616
[5]   Significance and statistical errors in the analysis of DNA microarray data [J].
Brody, JP ;
Williams, BA ;
Wold, BJ ;
Quake, SR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (20) :12975-12978
[6]   Exploring the new world of the genome with DNA microarrays [J].
Brown, PO ;
Botstein, D .
NATURE GENETICS, 1999, 21 (Suppl 1) :33-37
[7]   Ratio statistics of gene expression levels and applications to microarray data analysis [J].
Chen, YD ;
Kamat, V ;
Dougherty, ER ;
Bittner, ML ;
Meltzer, PS ;
Trent, JM .
BIOINFORMATICS, 2002, 18 (09) :1207-1215
[8]   SNOMAD (Standardization and NOrmalization of MicroArray Data): web-accessible gene expression data analysis [J].
Colantuoni, C ;
Henry, G ;
Zeger, S ;
Pevsner, J .
BIOINFORMATICS, 2002, 18 (11) :1540-1541
[9]  
DOBBIN K, STAT DESIGN REVERSE
[10]  
Eisen MB, 1999, METHOD ENZYMOL, V303, P179