Pseudo-messenger RNA: Phantoms of the transcriptome

被引:51
作者
Frith, Martin C.
Wilming, Laurens G.
Forrest, Alistair
Kawaji, Hideya
Tan, Sin Lam
Wahlestedt, Claes
Bajic, Vladimir B.
Kai, Chikatoshi
Kawai, Jun
Carninci, Piero
Hayashizaki, Yoshihide
Bailey, Timothy L.
Huminiecki, Lukasz [1 ]
机构
[1] Karolinska Inst, Ctr Genom & Bioinformat, Stockholm, Sweden
[2] RIKEN, Genom Sci Ctr, Yokohama Inst, Genome Explorat Res Grp,Genome Network Project Co, Yokohama, Kanagawa, Japan
[3] Univ Queensland, Inst Mol Biosci, Brisbane, Qld, Australia
[4] Wellcome Trust Sanger Inst, Hinxton, England
[5] Inst Infocomm Res, Singapore, Singapore
[6] Univ Western Cape, S African Natl Bioinformat Inst, ZA-7535 Bellville, South Africa
[7] Scripps Res Inst, Dept Biomed Sci, Jupiter, FL USA
[8] RIKEN, Genome Sci Lab, Discovery Res Inst, Wako Inst, Wako, Saitama 35101, Japan
[9] Uppsala Univ, Ludwig Inst Canc Res, Uppsala, Sweden
关键词
D O I
10.1371/journal.pgen.0020023
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The mammalian transcriptome harbours shadowy entities that resist classification and analysis. In analogy with pseudogenes, we define pseudo-messenger RNA to be RNA molecules that resemble protein- coding mRNA, but cannot encode full-length proteins owing to disruptions of the reading frame. Using a rigorous computational pipeline, which rules out sequencing errors, we identify 10,679 pseudo - messenger RNAs ( approximately half of which are transposonassociated) among the 102,801 FANTOM3 mouse cDNAs: just over 10% of the FANTOM3 transcriptome. These comprise not only transcribed pseudogenes, but also disrupted splice variants of otherwise protein- coding genes. Some may encode truncated proteins, only a minority of which appear subject to nonsense- mediated decay. The presence of an excess of transcripts whose only disruptions are opal stop codons suggests that there are more selenoproteins than currently estimated. We also describe compensatory frameshifts, where a segment of the gene has changed frame but remains translatable. In summary, we survey a large class of non- standard but potentially functional transcripts that are likely to encode genetic information and effect biological processes in novel ways. Many of these transcripts do not correspond cleanly to any identifiable object in the genome, implying fundamental limits to the goal of annotating all functional elements at the genome sequence level.
引用
收藏
页码:504 / 514
页数:11
相关论文
共 33 条
[1]   Nonsense-mediated RNA decay: a molecular system micromanaging individual gene activities and suppressing genomic noise [J].
Alonso, CR .
BIOESSAYS, 2005, 27 (05) :463-466
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Pseudogenes: Are they "Junk" or functional DNA? [J].
Balakirev, ES ;
Ayala, FJ .
ANNUAL REVIEW OF GENETICS, 2003, 37 :123-151
[4]   Targeting a complex transcriptome: The construction of the mouse full-length cDNA encyclopedia [J].
Carninci, P ;
Waki, K ;
Shiraki, T ;
Konno, H ;
Shibata, K ;
Itoh, M ;
Aizawa, K ;
Arakawa, T ;
Ishii, Y ;
Sasaki, D ;
Bono, H ;
Kondo, S ;
Sugahara, Y ;
Saito, R ;
Osato, N ;
Fukuda, S ;
Sato, K ;
Watahiki, A ;
Hirozane-Kishikawa, T ;
Nakamura, M ;
Shibata, Y ;
Yasunishi, A ;
Kikuchi, N ;
Yoshiki, A ;
Kusakabe, M ;
Gustincich, S ;
Beisel, K ;
Pavan, W ;
Aidinis, V ;
Nakagawara, A ;
Held, WA ;
Iwata, H ;
Kono, T ;
Nakauchi, H ;
Lyons, P ;
Wells, C ;
Hume, DA ;
Fagiolini, M ;
Hensch, TK ;
Brinkmeier, M ;
Camper, S ;
Hirota, J ;
Mombaerts, P ;
Muramatsu, M ;
Okazaki, Y ;
Kawai, J ;
Hayashizaki, Y .
GENOME RESEARCH, 2003, 13 (6B) :1273-1289
[5]   The transcriptional landscape of the mammalian genome [J].
Carninci, P ;
Kasukawa, T ;
Katayama, S ;
Gough, J ;
Frith, MC ;
Maeda, N ;
Oyama, R ;
Ravasi, T ;
Lenhard, B ;
Wells, C ;
Kodzius, R ;
Shimokawa, K ;
Bajic, VB ;
Brenner, SE ;
Batalov, S ;
Forrest, ARR ;
Zavolan, M ;
Davis, MJ ;
Wilming, LG ;
Aidinis, V ;
Allen, JE ;
Ambesi-Impiombato, X ;
Apweiler, R ;
Aturaliya, RN ;
Bailey, TL ;
Bansal, M ;
Baxter, L ;
Beisel, KW ;
Bersano, T ;
Bono, H ;
Chalk, AM ;
Chiu, KP ;
Choudhary, V ;
Christoffels, A ;
Clutterbuck, DR ;
Crowe, ML ;
Dalla, E ;
Dalrymple, BP ;
de Bono, B ;
Della Gatta, G ;
di Bernardo, D ;
Down, T ;
Engstrom, P ;
Fagiolini, M ;
Faulkner, G ;
Fletcher, CF ;
Fukushima, T ;
Furuno, M ;
Futaki, S ;
Gariboldi, M .
SCIENCE, 2005, 309 (5740) :1559-1563
[6]   Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution [J].
Cheng, J ;
Kapranov, P ;
Drenkow, J ;
Dike, S ;
Brubaker, S ;
Patel, S ;
Long, J ;
Stern, D ;
Tammana, H ;
Helt, G ;
Sementchenko, V ;
Piccolboni, A ;
Bekiranov, S ;
Bailey, DK ;
Ganesh, M ;
Ghosh, S ;
Bell, I ;
Gerhard, DS ;
Gingeras, TR .
SCIENCE, 2005, 308 (5725) :1149-1154
[7]   INFORMATION ENHANCEMENT METHODS FOR LARGE-SCALE SEQUENCE-ANALYSIS [J].
CLAVERIE, JM ;
STATES, DJ .
COMPUTERS & CHEMISTRY, 1993, 17 (02) :191-201
[8]   The amazing complexity of the human transcriptome [J].
Frith, MC ;
Pheasant, M ;
Mattick, JS .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2005, 13 (08) :894-897
[9]   Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability [J].
Harrison, PM ;
Zheng, DY ;
Zhang, ZL ;
Carriero, N ;
Gerstein, M .
NUCLEIC ACIDS RESEARCH, 2005, 33 (08) :2374-2383
[10]   An expressed pseudogene regulates the messenger-RNA stability of its homologous coding gene [J].
Hirotsune, S ;
Yoshida, N ;
Chen, A ;
Garrett, L ;
Sugiyama, F ;
Takahashi, S ;
Yagami, K ;
Wynshaw-Boris, A ;
Yoshiki, A .
NATURE, 2003, 423 (6935) :91-96