Machine learning in bioinformatics

被引:498
作者
Larranaga, Pedro
Calvo, Borja
Santana, Roberto
Bielza, Concha
Galdiano, Josu
Inza, Inaki
Lozano, Jose A.
Armananzas, Ruben
Santafe, Guzman
Perez, Aritz
Robles, Victor
机构
[1] Univ Basque Country, Dept Comp Sci & Artificial Intelligence, Intelligent Syst Grp, San Sebastian 20018, Spain
[2] Madrid Tech Univ, Sch Comp Sci, Madrid, Spain
[3] Harvard Univ, Sch Med, Cambridge, MA 02138 USA
[4] Univ Politecn Madrid, Dept Comp Syst Architecture & Technol, E-28040 Madrid, Spain
关键词
machine learning; bioinformatics; supervised classification; clustering; probabilistic graphical models; optimisation; heuristic; genomics; proteomics; microarray; system biology; evolution; text mining;
D O I
10.1093/bib/bbk007
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This article reviews machine learning methods for bioinformatics. It presents modelling methods, such as supervised classification, clustering and probabilistic graphical models for knowledge discovery, as well as deterministic and stochastic heuristics for optimization. Applications in genomics, proteomics, systems biology, evolution and text mining are also shown.
引用
收藏
页码:86 / 112
页数:27
相关论文
共 273 条
[21]   Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information [J].
Bao, L ;
Cui, Y .
BIOINFORMATICS, 2005, 21 (10) :2185-2190
[22]   LVB: parsimony and simulated annealing in the search for phylogenetic trees [J].
Barker, D .
BIOINFORMATICS, 2004, 20 (02) :274-275
[23]   Supervised machine learning techniques for the classification of metabolic disorders in newborns [J].
Baumgartner, C ;
Böhm, C ;
Baumgartner, D ;
Marini, G ;
Weinberger, K ;
Olgemöller, B ;
Liebl, B ;
Roscher, AA .
BIOINFORMATICS, 2004, 20 (17) :2985-2996
[24]  
Ben-Bassat M., 1982, Handbook of statistics, V2, P773, DOI DOI 10.1016/S0169-7161(82)02038-0
[25]   Tissue classification with gene expression profiles [J].
Ben-Dor, A ;
Bruhn, L ;
Friedman, N ;
Nachman, I ;
Schummer, M ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :559-583
[26]   Parallel Monte Carlo methods for physical mapping of chromosomes [J].
Bhandarkar, SM ;
Huang, JL ;
Arnold, J .
CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, :64-75
[27]  
BLANCO R, 2001, P WORKSH BAY MOD MED, P29
[28]  
Blazewicz J, 2005, LECT NOTES COMPUT SC, V3449, P22
[29]   Application of tabu search strategy for finding low energy structure of protein [J].
Blazewicz, J ;
Lukasiak, P ;
Milostan, M .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 35 (1-2) :135-145
[30]   RNA tertiary structure determination: NOE pathways construction by tabu search [J].
Blazewicz, J ;
Szachniuk, M ;
Wojtowicz, A .
BIOINFORMATICS, 2005, 21 (10) :2356-2361