Mining the biomedical literature in the genomic era: An overview

被引:139
作者
Shatkay, H [1 ]
Feldman, R
机构
[1] Queens Univ, Sch Comp, Kingston, ON K7L 3N6, Canada
[2] Bar Ilan Univ, Dept Comp Sci, IL-52900 Ramat Gan, Israel
关键词
biomedical literature mining; information retrieval; information extraction; text mining; PubMed; genomics;
D O I
10.1089/106652703322756104
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The past decade has seen a tremendous growth in the amount of experimental and computational biomedical data, specifically in the areas of genomics and proteomics. This growth is accompanied by an accelerated increase in the number of biomedical publications discussing the findings. In the last few years, there has been a lot of interest within the scientific community in literature-mining tools to help sort through this abundance of literature and find the nuggets of information most relevant and useful for specific analysis tasks. This paper provides a road map to the various literature-mining methods, both in general and within bioinformatics. It surveys the disciplines involved in unstructured-text analysis, categorizes current work in biomedical literature mining with respect to these disciplines, and provides examples of text analysis methods applied towards meeting some of the current challenges in bioinformatics.
引用
收藏
页码:821 / 855
页数:35
相关论文
共 149 条
[11]  
Bafna V, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P3
[12]   Gene expression informatics - it's all in your mine [J].
Bassett, DE ;
Eisen, MB ;
Boguski, MS .
NATURE GENETICS, 1999, 21 (Suppl 1) :51-55
[13]  
BEAR J, 1997, P 6 TEXT RETR C TREC, P367
[14]   Clustering gene expression patterns [J].
Ben-Dor, A ;
Shamir, R ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :281-297
[15]  
Blaschke C, 2002, IEEE INTELL SYST, V17, P14, DOI 10.1109/MIS.2002.999215
[16]  
Blaschke C, 1999, Proc Int Conf Intell Syst Mol Biol, P60
[17]  
BLASCHKE C, 2003, BIOCREATIVE CRITICAL
[18]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[19]  
BRILL E, 1992, P 3 ANN APPL NAT LAN
[20]  
BRILL E, 1999, NATURAL LANGAUGE PRO