Consistency Analysis of Redundant Probe Sets on Affymetrix Three-Prime Expression Arrays and Applications to Differential mRNA Processing

被引:16
作者
Cui, Xiangqin [1 ]
Loraine, Ann E. [1 ]
机构
[1] Univ N Carolina, Dept Bioinformat & Genom, Charlotte, NC USA
来源
PLOS ONE | 2009年 / 4卷 / 01期
关键词
GENE-EXPRESSION; MICROARRAYS; NETAFFX; TESTS; CALL;
D O I
10.1371/journal.pone.0004229
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Affymetrix three-prime expression microarrays contain thousands of redundant probe sets that interrogate different regions of the same gene. Differential expression analysis methods rarely consider probe redundancy, which can lead to inaccurate inference about overall gene expression or cause investigators to overlook potentially valuable information about differential regulation of variant mRNA products. We investigated the behaviour and consistency of redundant probe sets in a publicly-available data set containing samples from mouse brain amygdala and hippocampus and asked how applying filtering methods to the data affected consistency of results obtained from redundant probe sets. A genome-based filter that screens and groups probe sets according to their overlapping genomic alignments significantly improved redundant probe set consistency. Screening based on qualitative Present-Absent calls from MAS5 also improved consistency. However, even after applying these filters, many redundant probe sets showed significant fold-change differences relative to each other, suggesting differential regulation of alternative transcript production. Visual inspection of these loci using an interactive genome visualization tool (igb.bioviz.org) exposed thirty putative examples of differential regulation of alternative splicing or polyadenylation across brain regions in mouse. This work demonstrates how P/A-call and genome-based filtering can improve consistency among redundant probe sets while at the same time exposing possible differential regulation of RNA processing pathways across sample types.
引用
收藏
页数:11
相关论文
共 26 条
[1]  
*AFF, 2002, STAT ALG DESCR DOC
[2]  
Agresti A., 1990, CATEGORICAL DATA ANA
[3]  
Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
[4]  
Benjamini Y, 2000, J EDUC BEHAV STAT, V25, P60, DOI 10.2307/1165312
[5]   ANOSVA: a statistical method for detecting splice variation from expression data [J].
Cline, MS ;
Blume, J ;
Cawley, S ;
Clark, TA ;
Hu, JS ;
Lu, G ;
Salomonis, N ;
Wang, H ;
Williams, A .
BIOINFORMATICS, 2005, 21 :I107-I115
[6]  
CUI X, 2006, GLOBAL CORRELATION A, P223
[7]   Improved statistical tests for differential gene expression by shrinking variance components estimates [J].
Cui, XG ;
Hwang, JTG ;
Qiu, J ;
Blades, NJ ;
Churchill, GA .
BIOSTATISTICS, 2005, 6 (01) :59-75
[8]  
D'mello Veera, 2006, Appl Bioinformatics, V5, P249, DOI 10.2165/00822942-200605040-00007
[9]   Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data [J].
Dai, MH ;
Wang, PL ;
Boyd, AD ;
Kostov, G ;
Athey, B ;
Jones, EG ;
Bunney, WE ;
Myers, RM ;
Speed, TP ;
Akil, H ;
Watson, SJ ;
Meng, F .
NUCLEIC ACIDS RESEARCH, 2005, 33 (20) :e175.1-e175.9
[10]   Alternative mapping of probes to genes for Affymetrix chips -: art. no. 111 [J].
Gautier, L ;
Moller, M ;
Friis-Hansen, L ;
Knudsen, S .
BMC BIOINFORMATICS, 2004, 5 (1)