Impact of alternative initiation, splicing, and termination on the diversity of the mRNA transcripts encoded by the mouse transcriptome

被引:157
作者
Zavolan, M
Kondo, S
Schönbach, C
Adachi, J
Hume, DA
Hayashizaki, Y
Gaasterland, T
机构
[1] Rockefeller Univ, Lab Computat Genom, New York, NY 10021 USA
[2] RIKEN, Yokohama Inst, GSC, Bioinformat Grp,Biomed Knowledge Discovery Team, Yokohama, Kanagawa 2300045, Japan
[3] Univ Queensland, Inst Mol Biosci, ARC Special Res Ctr Funct & Appl Genom, Brisbane, Qld 4072, Australia
[4] RIKEN, Genome Sci Lab, Wako, Saitama 3510198, Japan
关键词
D O I
10.1101/gr.1017303
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We analyzed the FANTOM2 clone set of 60,770 RIKEN full-length mouse cDNA sequences and 44,122 public mRNA sequences. We developed a new computational procedure to identify and classify the forms of splice variation evident in this data set and organized the results into a publicly accessible database that can be used for future expression array construction, structural genomics, and analyses of the mechanism and regulation of alternative splicing. Statistical analysis shows that at least 41% and possibly as much as 60% of multiexon genes in mouse have multiple splice forms. Of the transcription units with multiple splice forms, 49% contain transcripts in which the apparent use of an alternative transcription start (stop) is accompanied by alternative splicing of the initial (terminal) exon. This implies that alternative transcription may frequently induce alternative splicing. The fact that 73% of all exons with splice variation fall within the annotated coding region indicates that most splice variation is likely to affect the protein form. Finally, we compared the set of constitutive (present in all transcripts) exons with the set of cryptic (present only in some transcripts) exons and found statistically significant differences in their length distributions, the nucleoticle distributions around their splice junctions, and the frequencies of occurrence of several short sequence motifs.
引用
收藏
页码:1290 / 1300
页数:11
相关论文
共 51 条
[1]   The intracisternal A-particle proximal enhancer-binding protein activates transcription and is identical to the RNA- and DNA-binding protein p54(nrb)/NonO [J].
Basu, A ;
Dong, BH ;
Krainer, AR ;
Howe, CC .
MOLECULAR AND CELLULAR BIOLOGY, 1997, 17 (02) :677-686
[2]   EXON RECOGNITION IN VERTEBRATE SPLICING [J].
BERGET, SM .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1995, 270 (06) :2411-2414
[3]   The SRm160/300 splicing coactivator subunits [J].
Blencowe, BJ ;
Baurén, G ;
Eldridge, AG ;
Issner, R ;
Nickerson, JA ;
Rosonina, E ;
Sharp, PA .
RNA, 2000, 6 (01) :111-120
[4]   Alternative splicing and genome complexity [J].
Brett, D ;
Pospisil, H ;
Valcárcel, J ;
Reich, J ;
Bork, P .
NATURE GENETICS, 2002, 30 (01) :29-30
[5]   EST comparison indicates 38% of human mRNAs contain possible alternative splice forms [J].
Brett, D ;
Hanke, J ;
Lehmann, G ;
Haase, S ;
Delbrück, S ;
Krueger, S ;
Reich, J ;
Bork, P .
FEBS LETTERS, 2000, 474 (01) :83-86
[6]   RNA-BINDING SPECIFICITY OF HNRNP A1 - SIGNIFICANCE OF HNRNP A1 HIGH-AFFINITY BINDING-SITES IN PRE-MESSENGER-RNA SPLICING [J].
BURD, CG ;
DREYFUSS, G .
EMBO JOURNAL, 1994, 13 (05) :1197-1204
[7]   Alternative gene form discovery and candidate gene selection from gene indexing projects [J].
Burke, J ;
Wang, H ;
Hide, W ;
Davison, DB .
GENOME RESEARCH, 1998, 8 (03) :276-290
[8]  
CACERES JF, 1997, EUKARYOTIC MRNA PROC, P174
[9]   High-efficiency full-length cDNA cloning by biotinylated CAP trapper [J].
Carninci, P ;
Kvam, C ;
Kitamura, A ;
Ohsumi, T ;
Okazaki, Y ;
Itoh, M ;
Kamiya, M ;
Shibata, K ;
Sasaki, N ;
Izawa, M ;
Muramatsu, M ;
Hayashizaki, Y ;
Schneider, C .
GENOMICS, 1996, 37 (03) :327-336
[10]   An intron element modulating 5' splice site selection in the hnRNP A1 pre-mRNA interacts with hnRNP A1 [J].
Chabot, B ;
Blanchette, M ;
Lapierre, I ;
LaBranche, H .
MOLECULAR AND CELLULAR BIOLOGY, 1997, 17 (04) :1776-1786