Large-scale RACE approach for proactive experimental definition of C. elegans ORFeome

被引:11
作者
Salehi-Ashtiani, Kourosh [1 ,2 ,3 ]
Lin, Chenwei [1 ,2 ,3 ]
Hao, Tong [1 ,2 ,3 ]
Shen, Yun [1 ,2 ,3 ]
Szeto, David [1 ,2 ,3 ]
Yang, Xinping [1 ,2 ,3 ]
Ghamsari, Lila [1 ,2 ,3 ]
Lee, HanJoo [1 ,2 ,3 ]
Fan, Changyu [1 ,2 ,3 ]
Murray, Ryan R. [1 ,2 ,3 ]
Milstein, Stuart [1 ,2 ,3 ]
Svrzikapa, Nenad [1 ,2 ,3 ]
Cusick, Michael E. [1 ,2 ,3 ]
Roth, Frederick P. [4 ]
Hill, David E. [1 ,2 ,3 ]
Vidal, Marc [1 ,2 ,3 ]
机构
[1] Dana Farber Canc Inst, CCSB, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Dept Canc Biol, Boston, MA 02115 USA
[3] Harvard Univ, Dept Genet, Sch Med, Boston, MA 02115 USA
[4] Harvard Univ, Dept Biol Chem & Mol Pharmacol, Sch Med, Boston, MA 02115 USA
关键词
HIGH-THROUGHPUT ANALYSIS; FULL-LENGTH CDNA; MESSENGER-RNA; GENOME ANNOTATION; LEADER SEQUENCE; VERSION; 1.1; TRANSCRIPTOME; CLONING; GENE; AMPLIFICATION;
D O I
10.1101/gr.098640.109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
Although a highly accurate sequence of the Caenorhabditis elegans genome has been available for 10 years, the exact transcript structures of many of its protein-coding genes remain unsettled. Approximately two-thirds of the ORFeome has been verified reactively by amplifying and cloning computationally predicted transcript models; still a full third of the ORFeome remains experimentally unverified. To fully identify the protein-coding potential of the worm genome including transcripts that may not satisfy existing heuristics for gene prediction, we developed a computational and experimental platform adapting rapid amplification of cDNA ends (RACE) for large-scale structural transcript annotation. We interrogated 2000 unverified protein-coding genes using this platform. We obtained RACE data for approximately two-thirds of the examined transcripts and reconstructed ORF and transcript models for close to 1000 of these. We defined untranslated regions, identified new exons, and redefined previously annotated exons. Our results show that as much as 20% of the C. elegans genome may be incorrectly annotated. Many annotation errors could be corrected proactively with our large-scale RACE platform.
引用
收藏
页码:2334 / 2342
页数:9
相关论文
共 35 条
[1]
MAPPING THE 5'-TERMINUS OF RICE TUNGRO BACILLIFORM VIRAL GENOMIC RNA [J].
BAO, YM ;
HULL, R .
VIROLOGY, 1993, 197 (01) :445-448
[2]
A global analysis of Caenorhabditis elegans operons [J].
Blumenthal, T ;
Evans, D ;
Link, CD ;
Guffanti, A ;
Lawson, D ;
Thierry-Mieg, J ;
Thierry-Mieg, D ;
Chiu, WL ;
Duke, K ;
Kiraly, M ;
Kim, SK .
NATURE, 2002, 417 (6891) :851-854
[3]
Blumenthal Thomas, 2005, WormBook, P1, DOI 10.1895/wormbook.1.5.1
[4]
Genome sequence of the nematode C-elegans:: A platform for investigating biology [J].
不详 .
SCIENCE, 1998, 282 (5396) :2012-2018
[5]
Chenchik A, 1996, BIOTECHNIQUES, V21, P526
[6]
Pathway aberrations of murine melanoma cells observed in Paired-End diTag transcriptomes [J].
Chiu, Kuo Ping ;
Ariyaratne, Pramila ;
Xu, Han ;
Tan, Adrian ;
Ng, Patrick ;
Liu, Edison Tak-Bun ;
Ruan, Yijun ;
Wei, Chia-Lin ;
Sung, Wing-Kin Ken .
BMC CANCER, 2007, 7
[7]
Stem cell transcriptome profiling via massive-scale mRNA sequencing [J].
Cloonan, Nicole ;
Forrest, Alistair R. R. ;
Kolle, Gabriel ;
Gardiner, Brooke B. A. ;
Faulkner, Geoffrey J. ;
Brown, Mellissa K. ;
Taylor, Darrin F. ;
Steptoe, Anita L. ;
Wani, Shivangi ;
Bethel, Graeme ;
Robertson, Alan J. ;
Perkins, Andrew C. ;
Bruce, Stephen J. ;
Lee, Clarence C. ;
Ranade, Swati S. ;
Peckham, Heather E. ;
Manning, Jonathan M. ;
McKernan, Kevin J. ;
Grimmond, Sean M. .
NATURE METHODS, 2008, 5 (07) :613-619
[8]
CONRAD R, 1995, RNA, V1, P164
[9]
Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses [J].
Fullwood, Melissa J. ;
Wei, Chia-Lin ;
Liu, Edison T. ;
Ruan, Yijun .
GENOME RESEARCH, 2009, 19 (04) :521-532
[10]
C elegans sequences that control trans-splicing and operon pre-mRNA processing [J].
Graber, Joel H. ;
Salisbury, Jesse ;
Hutchins, Lucie N. ;
Blumenthal, Thomas .
RNA, 2007, 13 (09) :1409-1426