Transcriptome analysis for Caenorhabditis elegans based on novel expressed sequence tags

被引:41
作者
Shin, Heesun [1 ,2 ]
Hirst, Martin [2 ]
Bainbridge, Matthew N. [2 ]
Magrini, Vincent [3 ]
Mardis, Elaine [3 ]
Moerman, Donald G. [4 ]
Marra, Marco A. [2 ]
Baillie, David L. [1 ]
Jones, Steven J. M. [2 ]
机构
[1] Simon Fraser Univ, Burnaby, BC V5A 1S6, Canada
[2] British Columbia Canc Agcy, British Columbia Canc Res Ctr, Canadas Michael Smith Genome Sci Ctr, Vancouver, BC V5Z 4E6, Canada
[3] Washington Univ, Sch Med, Genome Sequencing Ctr, St Louis, MO USA
[4] Univ British Columbia, Dept Zool, Vancouver, BC, Canada
基金
加拿大健康研究院;
关键词
D O I
10.1186/1741-7007-6-30
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: We have applied a high-throughput pyrosequencing technology for transcriptome profiling of Caenorhabditis elegans in its first larval stage. Using this approach, we have generated a large amount of data for expressed sequence tags, which provides an opportunity for the discovery of putative novel transcripts and alternative splice variants that could be developmentally specific to the first larval stage. This work also demonstrates the successful and efficient application of a next generation sequencing methodology. Results: We have generated over 30 million bases of novel expressed sequence tags from first larval stage worms utilizing high- throughput sequencing technology. We have shown that approximately 14% of the newly sequenced expressed sequence tags map completely or partially to genomic regions where there are no annotated genes or splice variants and therefore, imply that these are novel genetic structures. Expressed sequence tags, which map to intergenic ( around 1000) and intronic regions ( around 580), may represent novel transcribed regions, such as unannotated or unrecognized small protein- coding or non- protein- coding genes or splice variants. Expressed sequence tags, which map across intron-exon boundaries ( around 300), indicate possible alternative splice sites, while expressed sequence tags, which map near the ends of known transcripts ( around 600), suggest extension of the coding or untranslated regions. We have also discovered that intergenic and intronic expressed sequence tags, which are well conserved across different nematode species, are likely to represent non- coding RNAs. Lastly, we have incorporated available serial analysis of gene expression data generated from first larval stage worms, in order to predict novel transcripts that might be specifically or predominantly expressed in the first larval stage. Conclusion: We have demonstrated the use of a high- throughput sequencing methodology to efficiently produce a snap-shot of transcriptional activities occurring in the first larval stage of C. elegans development. Such application of this new sequencing technique allows for high- throughput, genome-wide experimental verification of known and novel transcripts. This study provides a more complete C. elegans transcriptome profile and, furthermore, gives insight into the evolutionary and biological complexity of this organism.
引用
收藏
页数:14
相关论文
共 31 条
[1]   Functions of the exosome in rRNA, snoRNA and snRNA synthesis [J].
Allmang, C ;
Kufel, J ;
Chanfreau, G ;
Mitchell, P ;
Petfalski, E ;
Tollervey, D .
EMBO JOURNAL, 1999, 18 (19) :5399-5410
[2]   Analysis of the prostate cancer cell line LNCaP transcriptome using a sequencing-by-synthesis approach [J].
Bainbridge, Matthew N. ;
Warren, Rene L. ;
Hirst, Martin ;
Romanuik, Tammy ;
Zeng, Thomas ;
Go, Anne ;
Delaney, Allen ;
Griffith, Malachi ;
Hickenbotham, Matthew ;
Magrini, Vincent ;
Mardis, Elaine R. ;
Sadar, Marianne D. ;
Siddiqui, Asim S. ;
Marra, Marco A. ;
Jones, Steven J. M. .
BMC GENOMICS, 2006, 7 (1)
[3]   The SWISS-PROT protein sequence data bank and its new supplement TREMBL [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 1996, 24 (01) :21-25
[4]   Crosstalk between RNA metabolic pathways: an RNOMICS approach [J].
Beggs, JD ;
Tollervey, D .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2005, 6 (05) :423-429
[5]   GOstat: find statistically overrepresented Gene Ontologies within a group of genes [J].
Beissbarth, T ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (09) :1464-1465
[6]   Human microRNAs are processed from capped, polyadenylated transcripts that can also function as mRNAs [J].
Cai, XZ ;
Hagedorn, CH ;
Cullen, BR .
RNA, 2004, 10 (12) :1957-1966
[7]   A computational approach to identify genes for functional RNAs in genomic sequences [J].
Carter, RJ ;
Dubchak, I ;
Holbrook, SR .
NUCLEIC ACIDS RESEARCH, 2001, 29 (19) :3928-3938
[8]   Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology [J].
Cheung, Foo ;
Haas, Brian J. ;
Goldberg, Susanne M. D. ;
May, Gregory D. ;
Xiao, Yongli ;
Town, Christopher D. .
BMC GENOMICS, 2006, 7 (1)
[9]   Organization of the Caenorhabditis elegans small non-coding transcriptome:: Genomic features, biogenesis, and expression [J].
Deng, W ;
Zhu, XP ;
Skogerbo, G ;
Zhao, Y ;
Fu, Z ;
Wang, YD ;
He, HS ;
Cai, L ;
Sun, H ;
Liu, CN ;
Li, B ;
Bai, BY ;
Wang, J ;
Jia, D ;
Sun, SW ;
He, H ;
Cui, Y ;
Wang, Y ;
Bu, DB ;
Chen, RS .
GENOME RESEARCH, 2006, 16 (01) :20-29
[10]   Computational analysis of RNAs [J].
Eddy, S. R. .
COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 2006, 71 :117-128