Sequence biases in large scale gene expression profiling data

被引:44
作者
Siddiqui, Asim S. [1 ]
Delaney, Allen D. [1 ]
Schnerch, Angelique [1 ]
Griffith, Obi L. [1 ]
Jones, Steven J. M. [1 ]
Marra, Marco A. [1 ]
机构
[1] British Columbia Canc Agcy, British Columbia Canc Res Ctr, Canadas Michael Smith Genome Sci Ctr, Vancouver, BC V5Z 4S6, Canada
基金
加拿大健康研究院;
关键词
D O I
10.1093/nar/gkl404
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present the results of a simple, statistical assay that measures the G+C content sensitivity bias of gene expression experiments without the requirement of a duplicate experiment. We analyse five gene expression profiling methods: Affymetrix GeneChip, Long Serial Analysis of Gene Expression (LongSAGE), LongSAGELite, 'Classic' Massively Parallel Signature Sequencing (MPSS) and 'Signature' MPSS. We demonstrate the methods have systematic and random errors leading to a different G+C content sensitivity. The relationship between this experimental error and the G+C content of the probe set or tag that identifies each gene influences whether the gene is detected and, if detected, the level of gene expression measured. LongSAGE has the least bias, while Signature MPSS shows a strong bias to G+C rich tags and Affymetrix data show different bias depending on the data processing method (MAS 5.0, RMA or GC-RMA). The bias in the Affymetrix data primarily impacts genes expressed at lower levels. Despite the larger sampling of the MPSS library, SAGE identifies significantly more genes (60% more RefSeq genes in a single comparison).
引用
收藏
页数:9
相关论文
共 33 条
[1]   Reliability analysis of microarray data using fuzzy c-means and normal mixture modeling based classification methods [J].
Asyali, MH ;
Alci, M .
BIOINFORMATICS, 2005, 21 (05) :644-649
[2]   The significance of digital gene expression profiles [J].
Audic, S ;
Claverie, JM .
GENOME RESEARCH, 1997, 7 (10) :986-995
[3]  
Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
[4]   An anatomy of normal and malignant gene expression [J].
Boon, K ;
Osório, EC ;
Greenhut, SF ;
Schaefer, CF ;
Shoemaker, J ;
Polyak, K ;
Morin, PJ ;
Buetow, KH ;
Strausberg, RL ;
de Souza, SJ ;
Riggins, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (17) :11287-11292
[5]   Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays [J].
Brenner, S ;
Johnson, M ;
Bridgham, J ;
Golda, G ;
Lloyd, DH ;
Johnson, D ;
Luo, SJ ;
McCurdy, S ;
Foy, M ;
Ewan, M ;
Roth, R ;
George, D ;
Eletr, S ;
Albrecht, G ;
Vermaas, E ;
Williams, SR ;
Moon, K ;
Burcham, T ;
Pallas, M ;
DuBridge, RB ;
Kirchner, J ;
Fearon, K ;
Mao, J ;
Corcoran, K .
NATURE BIOTECHNOLOGY, 2000, 18 (06) :630-634
[6]   Redefinition of affymetrix probe sets by sequence overlap with cDNA microarray probes reduces cross-platform inconsistencies in cancer-associated gene expression measurements [J].
Carter, SL ;
Eklund, AC ;
Mecham, BH ;
Kohane, IS ;
Szallasi, Z .
BMC BIOINFORMATICS, 2005, 6 (1)
[7]   Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution [J].
Cheng, J ;
Kapranov, P ;
Drenkow, J ;
Dike, S ;
Brubaker, S ;
Patel, S ;
Long, J ;
Stern, D ;
Tammana, H ;
Helt, G ;
Sementchenko, V ;
Piccolboni, A ;
Bekiranov, S ;
Bailey, DK ;
Ganesh, M ;
Ghosh, S ;
Bell, I ;
Gerhard, DS ;
Gingeras, TR .
SCIENCE, 2005, 308 (5725) :1149-1154
[8]   Reproducibility, bioinformatic analysis and power of the SAGE method to evaluate changes in transcriptome [J].
Dinel, S ;
Bolduc, C ;
Belleau, P ;
Boivin, A ;
Yoshioka, M ;
Calvo, E ;
Piedboeuf, B ;
Snyder, EE ;
Labrie, F ;
St-Amand, J .
NUCLEIC ACIDS RESEARCH, 2005, 33 (03) :1-8
[9]   Transcript profiling of human platelets using microaray and serial analysis of gene expression [J].
Gnatenko, DV ;
Dunn, JJ ;
McCorkle, SR ;
Weissmann, D ;
Perrotta, PL ;
Bahou, WF .
BLOOD, 2003, 101 (06) :2285-2293
[10]   Assessment and integration of publicly available SAGE, cDNA microarray, and oligonucleotide microarray expression data for global coexpression analyses [J].
Griffith, OL ;
Pleasance, ED ;
Fulton, DL ;
Oveisi, M ;
Ester, M ;
Siddiqui, AS ;
Jones, SJM .
GENOMICS, 2005, 86 (04) :476-488