Particle Swarm Optimisation for Protein Motif Discovery

被引:25
作者
Bill C. H. Chang
Asanga Ratnaweera
Saman K. Halgamuge
Harry C. Watson
机构
[1] University of Melbourne,Mechatronics Research Group, Mechanical and Manufacturing Engineering
[2] University of Melbourne,Thermofluids Research Group, Mechanical and Manufacturing Engineering
关键词
particle swarm optimisation; protein sequence motif; motif discovery; symbolic data optimisation;
D O I
10.1023/B:GENP.0000023688.42515.92
中图分类号
学科分类号
摘要
In this paper, a modified particle swarm optimisation algorithm is proposed for protein sequence motif discovery. Protein sequences are represented as a chain of symbols and a protein sequence motif is a short sequence that exists in most of the protein sequence families. Protein sequence symbols are converted into numbers using a one to one amino acid translation table. The simulation uses EGF protein and C2H2 Zinc Finger protein families obtained from the PROSITE database. Simulation results show that the modified particle swarm optimisation algorithm is effective in obtaining global optimum sequence patterns, achieving 96.9 and 99.5 classification accuracy respectively in EGF and C2H2 Zinc Finger protein families. A better true positive hit result is achieved when compared to the motifs published in PROSITE database.
引用
收藏
页码:203 / 214
页数:11
相关论文
empty
未找到相关数据