MedlineRanker: flexible ranking of biomedical literature

被引:67
作者
Fontaine, Jean-Fred [1 ]
Barbosa-Silva, Adriano [1 ]
Schaefer, Martin [1 ]
Huska, Matthew R. [1 ]
Muro, Enrique M. [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Max Delbruck Ctr Mol Med, Computat Biol & Data Min Grp, D-13125 Berlin, Germany
关键词
DATABASE; TOOL;
D O I
10.1093/nar/gkp353
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The biomedical literature is represented by millions of abstracts available in the Medline database. These abstracts can be queried with the PubMed interface, which provides a keyword-based Boolean search engine. This approach shows limitations in the retrieval of abstracts related to very specific topics, as it is difficult for a nonexpert user to find all of the most relevant keywords related to a biomedical topic. Additionally, when searching for more general topics, the same approach may return hundreds of unranked references. To address these issues, text mining tools have been developed to help scientists focus on relevant abstracts. We have implemented the MedlineRanker webserver, which allows a flexible ranking of Medline for a topic of interest without expert knowledge. Given some abstracts related to a topic, the program deduces automatically the most discriminative words in comparison to a random selection. These words are used to score other abstracts, including those from not yet annotated recent publications, which can be then ranked by relevance. We show that our tool can be highly accurate and that it is able to process millions of abstracts in a practical amount of time. MedlineRanker is free for use and is available at http://cbdm.mdc-berlin.de/tools/medlineranker.
引用
收藏
页码:W141 / W146
页数:6
相关论文
共 18 条
[1]   MINT: the molecular INTeraction database [J].
Chatr-aryamontri, Andrew ;
Ceol, Arnaud ;
Palazzi, Luisa Montecchi ;
Nardelli, Giuliano ;
Schneider, Maria Victoria ;
Castagnoli, Luisa ;
Cesareni, Gianni .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D572-D574
[2]   GoPubMed: Exploring PubMed with the gene ontology [J].
Doms, A ;
Schroeder, M .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W783-W786
[3]   askMEDLINE: A free-text, natural language query tool for MEDLINE/PubMed [J].
Fontelo P. ;
Liu F. ;
Ackerman M. .
BMC Medical Informatics and Decision Making, 5 (1)
[4]   PubFinder: a tool for improving retrieval rate of relevant PubMed abstracts [J].
Goetz, T ;
von der Lieth, CW .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W774-W778
[5]  
Lewis DD, 1998, European Conference on Machine Learning, P4
[6]   Text similarity: an alternative way to search MEDLINE [J].
Lewis, James ;
Ossowski, Stephan ;
Hicks, Justin ;
Errami, Mounir ;
Garner, Harold R. .
BIOINFORMATICS, 2006, 22 (18) :2298-2304
[7]   PubMed related articles: a probabilistic topic-based model for content similarity [J].
Lin, Jimmy ;
Wilbur, W. John .
BMC BIOINFORMATICS, 2007, 8 (1)
[8]   XplorMed: a tool for exploring MEDLINE abstracts [J].
Perez-Iratxeta, C ;
Bork, P ;
Andrade, MA .
TRENDS IN BIOCHEMICAL SCIENCES, 2001, 26 (09) :573-575
[9]  
POULTER GL, 2008, THESIS U CAPE TOWN C
[10]   MScanner: a classifier for retrieving medline citations [J].
Poulter, Graham L. ;
Rubin, Daniel L. ;
Altman, Russ B. ;
Seoighe, Cathal .
BMC BIOINFORMATICS, 2008, 9 (1)