Modeling splicing sites with pairwise correlations

被引:86
作者
Arita, M [1 ]
Tsuda, K [1 ]
Asai, K [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, CBRC, Koto Ku, Tokyo 1350064, Japan
关键词
D O I
10.1093/bioinformatics/18.suppl_2.S27
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A new method for finding subtle patterns in sequences is introduced. It approximates the multiple correlations among residuals with pair-wise correlations, with the learning cost O(m(2)n) where n is the number of training sequences, each of length m. The method suits to model splicing sites in human DNA, which are reported to have higher-order dependencies. Results: By computational experiments, the prediction accuracy of our model was shown to surpass that of previously reported Markov models for the prediction of acceptor sites in human.
引用
收藏
页码:S27 / S34
页数:8
相关论文
共 19 条
[1]  
Agarwal P., 1998, Proceedings of the Second Annual International Conference on Computational Molecular Biology, RECOMB '98, P2
[2]  
Asai K, 1998, Pac Symp Biocomput, P228
[3]  
Bahadur R, 1961, STUDIES ITEM ANAL PR, V6, P158
[4]   PREDICTION OF HUMAN MESSENGER-RNA DONOR AND ACCEPTOR SITES FROM THE DNA-SEQUENCE [J].
BRUNAK, S ;
ENGELBRECHT, J ;
KNUDSEN, S .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 220 (01) :49-65
[5]  
Burge CB, 1998, N COMP BIOC, V32, P129
[6]   Modeling splice sites with Bayes networks [J].
Cai, DY ;
Delcher, A ;
Kao, B ;
Kasif, S .
BIOINFORMATICS, 2000, 16 (02) :152-158
[7]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[8]  
Durbin R., 1998, BIOL SEQUENCE ANAL P
[9]  
Hertz J., 1991, Introduction to the Theory of Neural Computation
[10]  
Humphreys K, 1999, ARTIFICIAL INTELLIGENCE AND STATISTICS 99, PROCEEDINGS, P209