Incidence of "quasi-ditags" in catalogs generated by Serial Analysis of Gene Expression (SAGE)

被引:5
作者
Anisimov, SV [1 ]
Sharov, AA
机构
[1] Lund Univ, Wallenberg Neurosci Ctr, Sect Neuronal Survival, S-22184 Lund, Sweden
[2] NIA, Genet Lab, NIH, Baltimore, MD 21224 USA
关键词
D O I
10.1186/1471-2105-5-152
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Serial Analysis of Gene Expression (SAGE) is a functional genomic technique that quantitatively analyzes the cellular transcriptome. The analysis of SAGE libraries relies on the identification of ditags from sequencing files; however, the software used to examine SAGE libraries cannot distinguish between authentic versus false ditags ("quasi-ditags"). Results: We provide examples of quasi-ditags that originate from cloning and sequencing artifacts (i.e. genomic contamination or random combinations of nucleotides) that are included in SAGE libraries. We have employed a mathematical model to predict the frequency of quasi-ditags in random nucleotide sequences, and our data show that clones containing less than or equal to 2 ditags (which include chromosomal cloning artifacts) should be excluded from the analysis of SAGE catalogs. Conclusions: Cloning and sequencing artifacts contaminating SAGE libraries could be eliminated using simple pre-screening procedure to increase the reliability of the data.
引用
收藏
页数:9
相关论文
共 12 条
[1]   Blue-white selection step enhances the yield of SAGE concatemers [J].
Angelastro, JM ;
Ryu, EJ ;
Törocsik, B ;
Fiske, BK ;
Greene, LA .
BIOTECHNIQUES, 2002, 32 (03) :484-+
[2]   A quantitative and validated SAGE transcriptome reference for adult mouse heart [J].
Anisimov, SV ;
Tarasov, KV ;
Stern, MD ;
Lakatta, EG ;
Boheler, KR .
GENOMICS, 2002, 80 (02) :213-222
[3]   SAGE identification of gene transcripts with profiles unique to pluripotent mouse R1 embryonic stem cells [J].
Anisimov, SV ;
Tarasov, KV ;
Tweedie, D ;
Stern, MD ;
Wobus, AM ;
Boheler, KR .
GENOMICS, 2002, 79 (02) :169-176
[4]   SAGE identification of differentiation responsive genes in P19 embryonic cells induced to form cardiomyocytes in vitro [J].
Anisimov, SV ;
Tarasov, KV ;
Riordan, D ;
Wobus, AM ;
Boheler, KR .
MECHANISMS OF DEVELOPMENT, 2002, 117 (1-2) :25-74
[5]   Expression and functional characterization of the cardiac muscle ryanodine receptor Ca2+ release channel in Chinese hamster ovary cells [J].
Bhat, MB ;
Hayek, SM ;
Zhao, JY ;
Zang, WJ ;
Takeshima, H ;
Wier, WG ;
Ma, JJ .
BIOPHYSICAL JOURNAL, 1999, 77 (02) :808-816
[6]   The new role of SAGE in gene discovery [J].
Boheler, KR ;
Stern, MD .
TRENDS IN BIOTECHNOLOGY, 2003, 21 (02) :55-57
[7]   Substantially enhanced cloning efficiency of SAGE (Serial Analysis of Gene Expression) by adding a heating step to the original protocol [J].
Kenzelmann, M ;
Mühlemann, K .
NUCLEIC ACIDS RESEARCH, 1999, 27 (03) :917-918
[8]  
Potapova O, 2002, CANCER RES, V62, P3257
[9]   Complete genomes in WWW Entrez: data representation and analysis [J].
Tatusova, TA ;
Karsch-Mizrachi, I ;
Ostell, JA .
BIOINFORMATICS, 1999, 15 (7-8) :536-543
[10]  
van den Berg A, 1999, Nucleic Acids Res, V27, pe17, DOI 10.1093/nar/27.17.e17