A simple physical model predicts small exon length variations

被引:58
作者
Chern, Tzu-Ming
van Nimwegen, Erik
Kai, Chikatoshi
Kawai, Jun
Carninci, Piero
Hayashizaki, Yoshihide
Zavolan, Mihaela [1 ]
机构
[1] Univ Basel, Biozentrum, Div Bioinformat, Basel, Switzerland
[2] RIKEN, Genome Explorat Res Grp, Genome Network Project Core Grp, Genome Sci Ctr,Yokohama Inst, Yokohama, Kanagawa, Japan
[3] RIKEN, Genome Sci Lab, Discovery Res Inst, Wako Inst, Wako, Saitama 35101, Japan
关键词
D O I
10.1371/journal.pgen.0020045
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
One of the most common splice variations are small exon length variations caused by the use of alternative donor or acceptor splice sites that are in very close proximity on the pre- mRNA. Among these, three-nucleotide variations at so-called NAGNAG tandem acceptor sites have recently attracted considerable attention, and it has been suggested that these variations are regulated and serve to fine-tune protein forms by the addition or removal of a single amino acid. In this paper we first show that in-frame exon length variations are generally overrepresented and that this overrepresentation can be quantitatively explained by the effect of nonsense-mediated decay. Our analysis allows us to estimate that about 50% of frame-shifted coding transcripts are targeted by nonsense-mediated decay. Second, we show that a simple physical model that assumes that the splicing machinery stochastically binds to nearby splice sites in proportion to the affinities of the sites correctly predicts the relative abundances of different small length variations at both boundaries. Finally, using the same simple physical model, we show that for NAGNAG sites, the difference in affinities of the neighboring sites for the splicing machinery accurately predicts whether splicing will occur only at the first site, splicing will occur only at the second site, or three- nucleotide splice variants are likely to occur. Our analysis thus suggests that small exon length variations are the result of stochastic binding of the spliceosome at neighboring splice sites. Small exon length variations occur when there are nearby alternative splice sites that have similar affinity for the splicing machinery.
引用
收藏
页码:606 / 613
页数:8
相关论文
共 30 条
[11]   Functional annotation of a full-length mouse cDNA collection [J].
Kawai, J ;
Shinagawa, A ;
Shibata, K ;
Yoshino, M ;
Itoh, M ;
Ishii, Y ;
Arakawa, T ;
Hara, A ;
Fukunishi, Y ;
Konno, H ;
Adachi, J ;
Fukuda, S ;
Aizawa, K ;
Izawa, M ;
Nishi, K ;
Kiyosawa, H ;
Kondo, S ;
Yamanaka, I ;
Saito, T ;
Okazaki, Y ;
Gojobori, T ;
Bono, H ;
Kasukawa, T ;
Saito, R ;
Kadota, K ;
Matsuda, H ;
Ashburner, M ;
Batalov, S ;
Casavant, T ;
Fleischmann, W ;
Gaasterland, T ;
Gissi, C ;
King, B ;
Kochiwa, H ;
Kuehl, P ;
Lewis, S ;
Matsuo, Y ;
Nikaido, I ;
Pesole, G ;
Quackenbush, J ;
Schriml, LM ;
Staubli, F ;
Suzuki, R ;
Tomita, M ;
Wagner, L ;
Washio, T ;
Sakai, K ;
Okido, T ;
Furuno, M ;
Aono, H .
NATURE, 2001, 409 (6821) :685-690
[12]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[13]   The exon-exon junction complex provides a binding platform for factors involved in mRNA export and nonsense-mediated mRNA decay [J].
Le Hir, H ;
Gatfield, D ;
Izaurralde, E ;
Moore, MJ .
EMBO JOURNAL, 2001, 20 (17) :4987-4997
[14]   Nonsense-mediated mRNA decay: Splicing, translation and mRNP dynamics [J].
Maquat, LE .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2004, 5 (02) :89-99
[15]  
MARTIN A, 2006, NUCLEIC ACIDS RES, V34, P23
[16]   Understanding alternative splicing: Towards a cellular code [J].
Matlin, AJ ;
Clark, F ;
Smith, CWJ .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2005, 6 (05) :386-398
[17]   Frequent alternative splicing of human genes [J].
Mironov, AA ;
Fickett, JW ;
Gelfand, MS .
GENOME RESEARCH, 1999, 9 (12) :1288-1293
[18]   Genome-wide detection of alternative splicing in expressed sequences of human genes [J].
Modrek, B ;
Resch, A ;
Grasso, C ;
Lee, C .
NUCLEIC ACIDS RESEARCH, 2001, 29 (13) :2850-2859
[19]   NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins [J].
Pruitt, KD ;
Tatusova, T ;
Maglott, DR .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D501-D504
[20]   Alternative pre-mRNA splicing:: the logic of combinatorial control [J].
Smith, CWJ ;
Valcárcel, J .
TRENDS IN BIOCHEMICAL SCIENCES, 2000, 25 (08) :381-388