Finding kinetic parameters using text mining

被引:26
作者
Hakenberg, J
Schmeier, S
Kowald, A
Klipp, E
Leser, U
机构
[1] Humboldt Univ, Dept Comp Sci, D-10099 Berlin, Germany
[2] Max Planck Inst Mol Genet, Kinet Modeling Grp, Berlin, Germany
关键词
D O I
10.1089/1536231041388366
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The mathematical modeling and description of complex biological processes has become more and more important over the last years. Systems biology aims at the computational simulation of complex systems, up to whole cell simulations. An essential part focuses on solving a large number of parameterized differential equations. However, measuring those parameters is an expensive task, and finding them in the literature is very laborious. We developed a text mining system that supports researchers in their search for experimentally obtained parameters for kinetic models. Our system classifies full text documents regarding the question whether or not they contain appropriate data using a support vector machine. We evaluated our approach on a manually tagged corpus of 800 documents and found that it outperforms keyword searches in abstracts by a factor of five in terms of precision.
引用
收藏
页码:131 / 152
页数:22
相关论文
共 38 条
[1]  
AIZERMAN MA, 1965, AUTOMAT REM CONTR+, V25, P821
[2]  
[Anonymous], 1908, BIOMETRIKA, V6, P1
[3]  
[Anonymous], 1998, EUR C MACH LEARN
[4]  
Blaschke Christian, 2002, Brief Bioinform, V3, P154, DOI 10.1093/bib/3.2.154
[5]  
Cristianini N., 2000, Intelligent Data Analysis: An Introduction, DOI 10.1017/CBO9780511801389
[6]  
DEBRUIJN B, 2002, EFMI WORKSH NAT LANG
[7]   PreBIND and Textomy - mining the biomedical literature for protein-protein interactions using a support vector machine [J].
Donaldson, I ;
Martin, J ;
de Bruijn, B ;
Wolting, C ;
Lay, V ;
Tuekam, B ;
Zhang, SD ;
Baskin, B ;
Bader, GD ;
Michalickova, K ;
Pawson, T ;
Hogue, CWV .
BMC BIOINFORMATICS, 2003, 4 (1)
[8]  
Ewens W.J., 2001, STAT METHODS BIOINFO
[9]  
FAULSTICH L, 2003, WORKSH DAT MIN TEXT
[10]   The FlyBase database of the Drosophila genome projects and community literature [J].
Gelbart, W ;
Bayraktaroglu, L ;
Bettencourt, B ;
Campbell, K ;
Crosby, M ;
Emmert, D ;
Hradecky, P ;
Huang, Y ;
Letovsky, S ;
Matthews, B ;
Russo, S ;
Schroeder, A ;
Smutniak, F ;
Zhou, P ;
Zytkovicz, M ;
Ashburner, M ;
Drysdale, R ;
de Grey, A ;
Foulger, R ;
Millburn, G ;
Yamada, C ;
Kaufman, T ;
Matthews, K ;
Gilbert, D ;
Grumbling, G ;
Strelets, V ;
Shemen, C ;
Rubin, G ;
Berman, B ;
Frise, E ;
Gibson, M ;
Harris, N ;
Kaminker, J ;
Lewis, S ;
Marshall, B ;
Misra, S ;
Mungall, C ;
Prochnik, S ;
Richter, J ;
Smith, C ;
Shu, S ;
Tupy, J ;
Wiel, C .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :172-175