Getting to the (c)ore of knowledge: mining biomedical literature

被引:79
作者
de Bruijn, B [1 ]
Martin, J [1 ]
机构
[1] Natl Res Council Canada, Inst Informat Technol, Ottawa, ON K1A 0R6, Canada
关键词
natural language processing; Medline; molecular biology; knowledge acquisition (computer); semantics; indexing and abstracting;
D O I
10.1016/S1386-5056(02)00050-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Literature mining is the process of extracting and combining facts from scientific publications. In recent years, many computer programs have been designed to extract various molecular biology findings from Medline abstracts or full-text articles. The present article describes the range of text mining techniques that have been applied to scientific documents. It divides 'automated reading' into four general subtasks: text categorization, named entity tagging, fact extraction, and collection-wide analysis. Literature mining offers powerful methods to support knowledge discovery and the construction of topic maps and ontologies. An overview is given of recent developments in medical language processing. Special attention is given to the domain particularities of molecular biology, and the emerging synergy between literature mining and molecular databases accessible through Internet. Crown Copyright (C) 2002 Published by Elsevier Science Ireland Ltd. All rights reserved.
引用
收藏
页码:7 / 18
页数:12
相关论文
共 79 条