Extensive feature detection of N-terminal protein sorting signals

被引:591
作者
Bannai, H
Tamada, Y
Maruyama, O
Nakai, K
Miyano, S
机构
[1] Univ Tokyo, Inst Med Sci, Ctr Human Genome, Minato Ku, Tokyo 1088639, Japan
[2] Tokai Univ, Dept Math Sci, Hiratsuka, Kanagawa 2591292, Japan
[3] Kyushu Univ 36, Kyushu Univ, Fac Math, Fukuoka 8128581, Japan
关键词
D O I
10.1093/bioinformatics/18.2.298
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The prediction of localization sites of various proteins is an important and challenging problem in the field of molecular biology. TargetP, by Emanuelsson et al. (J. Mol. Biot., 300, 1005-1016, 2000) is a neural network based system which is currently the best predictor in the literature for N-terminal sorting signals. One drawback of neural networks, however, is that it is generally difficult to understand and interpret how and why they make such predictions. In this paper, we aim to generate simple and interpretable rules as predictors, and still achieve a practical prediction accuracy. We adopt an approach which consists of an extensive search for simple rules and various attributes which is partially guided by human intuition. Results: We have succeeded in finding rules whose prediction accuracies come close to that of TargetP, while still retaining a very simple and interpretable form. We also discuss and interpret the discovered rules.
引用
收藏
页码:298 / 305
页数:8
相关论文
共 21 条
[1]  
Bannai H., 2001, Proceedings of the Fourteenth International Florida Artificial Intelligence Research Society Conference, P233
[2]   Chloroplast transit peptides: structure, function and evolution [J].
Bruce, BD .
TRENDS IN CELL BIOLOGY, 2000, 10 (10) :440-447
[3]   Using subsite coupling to predict signal peptides [J].
Chou, KC .
PROTEIN ENGINEERING, 2001, 14 (02) :75-79
[4]   Computational method to predict mitochondrially imported proteins and their targeting sequences [J].
Claros, MG ;
Vincens, P .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1996, 241 (03) :779-786
[5]   SOLVATION ENERGY IN PROTEIN FOLDING AND BINDING [J].
EISENBERG, D ;
MCLACHLAN, AD .
NATURE, 1986, 319 (6050) :199-203
[6]   Predicting subcellular localization of proteins based on their N-terminal amino acid sequence [J].
Emanuelsson, O ;
Nielsen, H ;
Brunak, S ;
von Heijne, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 300 (04) :1005-1016
[7]   ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites [J].
Emanuelsson, O ;
Nielsen, H ;
Von Heijne, G .
PROTEIN SCIENCE, 1999, 8 (05) :978-984
[8]   AAindex: Amino acid index database [J].
Kawashima, S ;
Kanehisa, M .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :374-374
[9]   A SIMPLE METHOD FOR DISPLAYING THE HYDROPATHIC CHARACTER OF A PROTEIN [J].
KYTE, J ;
DOOLITTLE, RF .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 157 (01) :105-132
[10]  
Maruyama O, 1998, LECT NOTES ARTIF INT, V1532, P105