Identification and prevention of a GC content bias in SAGE libraries

被引:53
作者
Margulies, Elliott H. [1 ]
Kardia, Sharon L. R. [3 ]
Innis, Jeffrey W. [1 ,2 ]
机构
[1] Univ Michigan, Sch Med, Dept Human Genet, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Sch Med, Dept Pediat & Communicable Dis, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Sch Publ Hlth, Dept Epidemiol, Ann Arbor, MI 48109 USA
关键词
D O I
10.1093/nar/29.12.e60
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Serial Analysis of Gene Expression (SAGE) is becoming a widely used gene expression profiling method for the study of development, cancer and other human diseases. Investigators using SAGE rely heavily on the quantitative aspect of this method for cataloging gene expression and comparing multiple SAGE libraries. We have developed additional computational and statistical tools to assess the quality and reproducibility of a SAGE library. Using these methods, a critical variable in the SAGE protocol was identified that has the potential to bias the Tag distribution relative to the GC content of the 10 bp SAGE Tag DNA sequence. We also detected this bias in a number of publicly available SAGE libraries. It is important to note that the GC content bias went undetected by quality control procedures in the current SAGE protocol and was only identified with the use of these statistical analyses on as few as 750 SAGE Tags. In addition to keeping any solution of free DiTags on ice, an analysis of the GC content should be performed before sequencing large numbers of SAGE Tags to be confident that SAGE libraries are free from experimental bias.
引用
收藏
页数:8
相关论文
共 23 条
[1]   Improved NlaIII digestion of PAGE-purified 102 bp ditags by addition of a single purification step in both the SAGE and microSAGE protocols [J].
Angelastro, James M. ;
Klimaschewski, Lars P. ;
Vitolo, Ottavio V. .
NUCLEIC ACIDS RESEARCH, 2000, 28 (12) :62
[2]   The significance of digital gene expression profiles [J].
Audic, S ;
Claverie, JM .
GENOME RESEARCH, 1997, 7 (10) :986-995
[3]   The mouse brain transcriptome by SAGE: Differences in gene expression between P30 brains of the partial trisomy 16 mouse model of Down syndrome (Ts65Dn) and normals [J].
Chrast, R ;
Scott, HS ;
Papasavvas, MP ;
Rossier, C ;
Antonarakis, ES ;
Barras, C ;
Davisson, MT ;
Schmidt, C ;
Estivill, X ;
Dierssen, M ;
Pritchard, M ;
Antonarakis, SE .
GENOME RESEARCH, 2000, 10 (12) :2006-2021
[4]   MicroSAGE: a modified procedure for serial analysis of gene expression in limited amounts of tissue [J].
Datson, NA ;
van der Perk-de Jong, J ;
van den Berg, MP ;
de Kloet, ER ;
Vreugdenhil, E .
NUCLEIC ACIDS RESEARCH, 1999, 27 (05) :1300-1307
[5]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[6]   Dynamics of gene expression revealed by comparison of serial analysis of gene expression transcript profiles from yeast grown on two different carbon sources [J].
Kal, AJ ;
van Zonneveld, AJ ;
Benes, V ;
van den Berg, M ;
Koerkamp, MG ;
Albermann, K ;
Strack, N ;
Ruijter, JM ;
Richter, A ;
Dujon, B ;
Ansorge, W ;
Tabak, HF .
MOLECULAR BIOLOGY OF THE CELL, 1999, 10 (06) :1859-1872
[7]   Substantially enhanced cloning efficiency of SAGE (Serial Analysis of Gene Expression) by adding a heating step to the original protocol [J].
Kenzelmann, M ;
Mühlemann, K .
NUCLEIC ACIDS RESEARCH, 1999, 27 (03) :917-918
[8]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[9]   SAGEmap: A public gene expression resource [J].
Lash, AE ;
Tolstoshev, CM ;
Wagner, L ;
Schuler, GD ;
Strausberg, RL ;
Riggins, GJ ;
Altschul, SF .
GENOME RESEARCH, 2000, 10 (07) :1051-1060
[10]   POWER_SAGE: comparing statistical tests for SAGE experiments [J].
Man, MZ ;
Wang, XN ;
Wang, YX .
BIOINFORMATICS, 2000, 16 (11) :953-959