MutationFinder: a high-performance system for extracting point mutation mentions from text

被引:90
作者
Caporaso, J. Gregory [1 ]
Baumgartner, William A., Jr.
Randolph, David A.
Cohen, K. Bretonnel
Hunter, Lawrence
机构
[1] Univ Colorado, Hlth Sci Ctr, Dept Biochem & Mol Genet, Aurora, CO USA
[2] Univ Colorado, Hlth Sci Ctr, Ctr Computat Pharmacol, Aurora, CO USA
[3] Motorola Mobile Devices, Libertyville, IL USA
[4] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[5] Univ Colorado, Dept Linguist, Boulder, CO 80309 USA
关键词
D O I
10.1093/bioinformatics/btm235
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
ummary: Discussion of point mutations is ubiquitous in biomedical literature, and manually compiling databases or literature on mutations in specific genes or proteins is tedious. We present an open-source, rule-based system, MutationFincler, for extracting point mutation mentions from text. On blind test data, it achieves nearly perfect precision and a markedly improved recall over a baseline.
引用
收藏
页码:1862 / 1865
页数:4
相关论文
共 7 条
[1]   Mutation mining - A prospector's tale [J].
Baker, CJO ;
Witte, R .
INFORMATION SYSTEMS FRONTIERS, 2006, 8 (01) :47-57
[2]   OSIRIS: a tool for retrieving literature about sequence variants [J].
Bonis, Julio ;
Furlong, Laura Ines ;
Sanz, Ferran .
BIOINFORMATICS, 2006, 22 (20) :2567-2569
[3]  
Hatzivassiloglou V., 2001, Bioinformatics, V17, P97
[4]   Automated extraction of mutation data from the literature: application of MuteXt to G protein-coupled receptors and nuclear hormone receptors [J].
Horn, F ;
Lau, AL ;
Cohen, FE .
BIOINFORMATICS, 2004, 20 (04) :557-568
[5]  
OGREN PV, 2006, P 9 INT PROT C, P73
[6]   Automatic extraction of mutations from Medline and cross-validation with OMIM [J].
Rebholz-Schuhmann, D ;
Marcel, S ;
Albert, S ;
Tolle, R ;
Casari, G ;
Kirsch, H .
NUCLEIC ACIDS RESEARCH, 2004, 32 (01) :135-142
[7]  
YEH A, 2005, BMC BIOINFORMATIC S1, V6