Translation initiation sites prediction with mixture Gaussian models in human cDNA sequences

被引:21
作者
Li, GL
Leong, TY
Zhang, LX
机构
[1] Natl Univ Singapore, Med Comp Lab, Sch Comp, Singapore 117543, Singapore
[2] Natl Univ Singapore, Dept Math, Singapore 117543, Singapore
关键词
bioinformatics; classification; feature extraction; mixture Gaussian model; translation initiation sites;
D O I
10.1109/TKDE.2005.133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Translation initiation sites (TISs) are important signals in cDNA sequences. Many research efforts have tried to predict TISs in cDNA sequences. In this paper, we propose to use mixture Gaussian models for TIS prediction. Using both local features and some features generated from global measures, the proposed method predicts TISs with a sensitivity of 98 percent and a specificity of 93.6 percent. Our method outperforms many other existing methods in sensitivity while keeping specificity high. We attribute the improvement in sensitivity to the nature of the global features and the mixture Gaussian models.
引用
收藏
页码:1152 / 1160
页数:9
相关论文
共 33 条
[1]  
Agarwal P., 1998, Proceedings of the Second Annual International Conference on Computational Molecular Biology, RECOMB '98, P2
[2]  
Bishop C. M., 1996, Neural networks for pattern recognition
[3]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[4]   TRANSFER RNA(IMET) FUNCTIONS IN DIRECTING THE SCANNING RIBOSOME TO THE START SITE OF TRANSLATION [J].
CIGAN, AM ;
FENG, L ;
DONAHUE, TF .
SCIENCE, 1988, 242 (4875) :93-97
[5]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]  
Derst C., 2000, International Journal of Computers, Systems and Signals, V1, P169
[7]  
DEVAUL RW, 2004, CREATING GAUSSIAN MI
[8]   Gene-specific regulation by general translation factors [J].
Dever, TE .
CELL, 2002, 108 (04) :545-556
[9]   The gene identification problem: An overview for developers [J].
Fickett, JW .
COMPUTERS & CHEMISTRY, 1996, 20 (01) :103-118
[10]   Translation initiation start prediction in human cDNAs with high accuracy [J].
Hatzigeorgiou, AG .
BIOINFORMATICS, 2002, 18 (02) :343-350