Logitlinear models for the prediction of splice sites in plant pre-mRNA sequences

被引:31
作者
Kleffe, J [1 ]
Hermann, K [1 ]
Vahrson, W [1 ]
Wittig, B [1 ]
Brendel, V [1 ]
机构
[1] STANFORD UNIV,DEPT MATH,STANFORD,CA 94305
关键词
D O I
10.1093/nar/24.23.4709
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Pre-mRNA splicing in plants, while generally similar to the processes in vertebrates and yeast, is thought to involve plant specific cis-acting elements, Both monocot and dicot introns are typically strongly enriched in U nucleotides, and AU- or U-rich segments are thought to be involved in intron recognition, splice site selection, and splicing efficiency. We have applied logit-linear models to find optimal combinations of splice site variables for the purpose of separating true splice sites from a large excess of potential sites, It is shown that plant splice site prediction from sequence inspection is greatly improved when compositional contrast between exons and introns is considered in addition to degree of matching to the splice site consensus (signal quality), The best model involves subclassification of splice sites according to the identity of the base immediately upstream of the GU and AG signals and gives substantial performance gains compared with conventional profile methods.
引用
收藏
页码:4709 / 4718
页数:10
相关论文
共 34 条