Mapping the C-elegans noncoding transcriptome with a whole-genome tiling microarray

被引:45
作者
He, Housheng
Wang, Jie
Liu, Tao
Liu, X. Shirley
Li, Tiantian
Wang, Yunfei
Qian, Zuwei
Zheng, Haixia
Zhu, Xiaopeng [1 ]
Wu, Tao
Shi, Baochen
Deng, Wei
Zhou, Wei
Skogerbo, Geir
Chen, Runsheng
机构
[1] Chinese Acad Sci, Inst Biophys, Bioinformat Lab, Beijing 100101, Peoples R China
[2] Chinese Acad Sci, Inst Biophys, Natl Lab Biomacromol, Beijing 100101, Peoples R China
[3] Chinese Acad Sci, Grad Sch, Beijing 100080, Peoples R China
[4] Harvard Univ, Sch Med, Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
[5] Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USA
[6] Affymetrix Inc, Santa Clara, CA 95051 USA
[7] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Bioinformat Res Grp, Beijing 100080, Peoples R China
[8] Chinese Natl Human Genome Ctr, Beijing 100176, Peoples R China
关键词
D O I
10.1101/gr.6611807
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The number of annotated protein coding genes in the genome of Caenorhabditis elegans is similar to that of other animals, but the extent of its non- protein- coding transcriptome remains unknown. Expression profiling on whole- genome tiling microarrays applied to a mixed- stage C. elegans population verified the expression of 71% of all annotated exons. Only a small fraction ( 11%) of the polyadenylated transcription is non- annotated and appears to consist of similar to 3200 missed or alternative exons and 7800 small transcripts of unknown function ( TUFs). Almost half ( 44%) of the detected transcriptional output is non- polyadenylated and probably not protein coding, and of this, 70% overlaps the boundaries of protein- coding genes in a complex manner. Specific analysis of small non- polyadenylated transcripts verified 97% of all annotated small ncRNAs and suggested that the transcriptome contains similar to 1200 small (< 500 nt) unannotated noncoding loci. After combining overlapping transcripts, we estimate that at least 70% of the total C. elegans genome is transcribed.
引用
收藏
页码:1471 / 1477
页数:7
相关论文
共 31 条
[1]   Design optimization methods for genomic DNA tiling arrays [J].
Bertone, P ;
Trifonov, V ;
Rozowsky, JS ;
Schubert, F ;
Emanuelsson, O ;
Karro, J ;
Kao, MY ;
Snyder, M ;
Gerstein, M .
GENOME RESEARCH, 2006, 16 (02) :271-281
[2]   Global identification of human transcribed sequences with genome tiling arrays [J].
Bertone, P ;
Stolc, V ;
Royce, TE ;
Rozowsky, JS ;
Urban, AE ;
Zhu, XW ;
Rinn, JL ;
Tongprasit, W ;
Samanta, M ;
Weissman, S ;
Gerstein, M ;
Snyder, M .
SCIENCE, 2004, 306 (5705) :2242-2246
[3]   Trans-splicing and polyadenylation of let-7 microRNA primary transcripts [J].
Bracht, J ;
Hunter, S ;
Eachus, R ;
Weeks, P ;
Pasquinelli, AE .
RNA, 2004, 10 (10) :1586-1594
[4]   WormBase:: a comprehensive data resource for Caenorhabditis biology and genomics [J].
Chen, NS ;
Harris, TW ;
Antoshechkin, I ;
Bastiani, C ;
Bieri, T ;
Blasiar, D ;
Bradnam, K ;
Canaran, P ;
Chan, J ;
Chen, CK ;
Chen, WJ ;
Cunningham, F ;
Davis, P ;
Kenny, E ;
Kishore, R ;
Lawson, D ;
Lee, R ;
Muller, HM ;
Nakamura, C ;
Pai, S ;
Ozersky, P ;
Petcherski, A ;
Rogers, A ;
Sab, A ;
Schwarz, EM ;
Van Auken, K ;
Wang, QH ;
Durbin, R ;
Spieth, J ;
Sternberg, PW ;
Stein, LD .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D383-D389
[5]   Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution [J].
Cheng, J ;
Kapranov, P ;
Drenkow, J ;
Dike, S ;
Brubaker, S ;
Patel, S ;
Long, J ;
Stern, D ;
Tammana, H ;
Helt, G ;
Sementchenko, V ;
Piccolboni, A ;
Bekiranov, S ;
Bailey, DK ;
Ganesh, M ;
Ghosh, S ;
Bell, I ;
Gerhard, DS ;
Gingeras, TR .
SCIENCE, 2005, 308 (5725) :1149-1154
[6]   A high-resolution map of transcription in the yeast genome [J].
David, L ;
Huber, W ;
Granovskaia, M ;
Toedling, J ;
Palm, CJ ;
Bofkin, L ;
Jones, T ;
Davis, RW ;
Steinmetz, LM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (14) :5320-5325
[7]   Organization of the Caenorhabditis elegans small non-coding transcriptome:: Genomic features, biogenesis, and expression [J].
Deng, W ;
Zhu, XP ;
Skogerbo, G ;
Zhao, Y ;
Fu, Z ;
Wang, YD ;
He, HS ;
Cai, L ;
Sun, H ;
Liu, CN ;
Li, B ;
Bai, BY ;
Wang, J ;
Jia, D ;
Sun, SW ;
He, H ;
Cui, Y ;
Wang, Y ;
Bu, DB ;
Chen, RS .
GENOME RESEARCH, 2006, 16 (01) :20-29
[8]   Complex loci in human and mouse genomes [J].
Engstrom, Par G. ;
Suzuki, Harukazu ;
Ninomiya, Noriko ;
Akalin, Altuna ;
Sessa, Luca ;
Lavorgna, Giovanni ;
Brozzi, Alessandro ;
Luzi, Lucilla ;
Tan, Sin Lam ;
Yang, Liang ;
Kunarso, Galih ;
ng, Edwin Lian-Cho Ng ;
Batalov, Serge ;
Wahlestedt, Claes ;
Kai, Chikatoshi ;
Kawai, Jun ;
Carninci, Piero ;
Hayashizaki, Yoshihide ;
Wells, Christine ;
Bajic, Vladimir B. ;
Orlando, Valerio ;
Reid, James F. ;
Lenhard, Boris ;
Lipovich, Leonard .
PLOS GENETICS, 2006, 2 (04) :564-577
[9]   The ENCODE (ENCyclopedia of DNA elements) Project [J].
Feingold, EA ;
Good, PJ ;
Guyer, MS ;
Kamholz, S ;
Liefer, L ;
Wetterstrand, K ;
Collins, FS ;
Gingeras, TR ;
Kampa, D ;
Sekinger, EA ;
Cheng, J ;
Hirsch, H ;
Ghosh, S ;
Zhu, Z ;
Pate, S ;
Piccolboni, A ;
Yang, A ;
Tammana, H ;
Bekiranov, S ;
Kapranov, P ;
Harrison, R ;
Church, G ;
Struhl, K ;
Ren, B ;
Kim, TH ;
Barrera, LO ;
Qu, C ;
Van Calcar, S ;
Luna, R ;
Glass, CK ;
Rosenfeld, MG ;
Guigo, R ;
Antonarakis, SE ;
Birney, E ;
Brent, M ;
Pachter, L ;
Reymond, A ;
Dermitzakis, ET ;
Dewey, C ;
Keefe, D ;
Denoeud, F ;
Lagarde, J ;
Ashurst, J ;
Hubbard, T ;
Wesselink, JJ ;
Castelo, R ;
Eyras, E ;
Myers, RM ;
Sidow, A ;
Batzoglou, S .
SCIENCE, 2004, 306 (5696) :636-640
[10]   miRBase: microRNA sequences, targets and gene nomenclature [J].
Griffiths-Jones, Sam ;
Grocock, Russell J. ;
van Dongen, Stijn ;
Bateman, Alex ;
Enright, Anton J. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D140-D144