Mining the biomedical literature in the genomic era: An overview

被引:139
作者
Shatkay, H [1 ]
Feldman, R
机构
[1] Queens Univ, Sch Comp, Kingston, ON K7L 3N6, Canada
[2] Bar Ilan Univ, Dept Comp Sci, IL-52900 Ramat Gan, Israel
关键词
biomedical literature mining; information retrieval; information extraction; text mining; PubMed; genomics;
D O I
10.1089/106652703322756104
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The past decade has seen a tremendous growth in the amount of experimental and computational biomedical data, specifically in the areas of genomics and proteomics. This growth is accompanied by an accelerated increase in the number of biomedical publications discussing the findings. In the last few years, there has been a lot of interest within the scientific community in literature-mining tools to help sort through this abundance of literature and find the nuggets of information most relevant and useful for specific analysis tasks. This paper provides a road map to the various literature-mining methods, both in general and within bioinformatics. It surveys the disciplines involved in unstructured-text analysis, categorizes current work in biomedical literature mining with respect to these disciplines, and provides examples of text analysis methods applied towards meeting some of the current challenges in bioinformatics.
引用
收藏
页码:821 / 855
页数:35
相关论文
共 149 条
[1]  
ALLEN J, 1995, NATURAL LANGAUGE UND
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
ANDRADE MA, 1997, P AAAI C INT SYST MO
[4]  
[Anonymous], PSB 2000
[5]  
[Anonymous], P 3 INT C KNOWL DISC
[6]  
[Anonymous], 2005, EUR C MACH LEARN
[7]  
APPELT DE, 1999, INT JOINT C ART INT
[8]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[9]  
Aumann Y, 1999, LECT NOTES ARTIF INT, V1704, P277
[10]  
Bader GD, 2003, NUCLEIC ACIDS RES, V31, P248, DOI 10.1093/nar/gkg056