A survey of current work in biomedical text mining

被引:421
作者
Cohen, AM [1 ]
Hersh, WR [1 ]
机构
[1] Oregon Hlth & Sci Univ, Sch Med, Dept Med Informat & Clin Epidemiol, Portland, OR USA
关键词
text-mining; bioinformatics; natural language processing;
D O I
10.1093/bib/6.1.57
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The volume of published biomedical research, and therefore the underlying biomedical knowledge base, is expanding at an increasing rate. Among the tools that can aid researchers in coping with this information overload are text mining and knowledge extraction. Significant progress has been made in applying text mining to named entity recognition, text classification, terminology extraction, relationship extraction and hypothesis generation. Several research groups are constructing integrated flexible text-mining systems intended for multiple uses. The major challenge of biomedical text mining over the next 5-10 years is to make these systems useful to biomedical researchers. This will require enhanced access to full text, better understanding of the feature space of biomedical literature, better methods for measuring the usefulness of systems to users, and continued cooperation with the biomedical research community to ensure that their needs are addressed.
引用
收藏
页码:57 / 71
页数:15
相关论文
共 84 条
  • [61] Regev Y., 2002, ACM SIGKDD EXPLOR NE, V4, P90
  • [62] REGEV Y, 2003, ACM SIGKDD EXPLORATI, V4, P90
  • [63] Rindflesch TC, 1999, J AM MED INFORM ASSN, P127
  • [64] Schwartz Ariel S, 2003, Pac Symp Biocomput, P451
  • [65] Settles B., 2004, P INT JOINT WORKSH N
  • [66] SHI M, 2002, ACM SIGKDD EXPLORATI, V4, P93
  • [67] Text mining: Generating hypotheses from MEDLINE
    Srinivasan, P
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (05): : 396 - 413
  • [68] SRINIVASAN P, 2004, BIOLINK 2004 LINKING, P33
  • [69] SRINIVASAN P, 2003, P TEXT MIN WORKSH 3
  • [70] Mining MEDLINE for implicit links between dietary substances and diseases
    Srinivasan, Padmini
    Libbus, Bisharah
    [J]. BIOINFORMATICS, 2004, 20 : 290 - 296