GeneView: a comprehensive semantic search engine for PubMed

被引:45
作者
Thomas, Philippe [1 ]
Starlinger, Johannes [1 ]
Vowinkel, Alexander [1 ]
Arzt, Sebastian [1 ]
Leser, Ulf [1 ]
机构
[1] Univ Berlin, Inst Comp Sci, D-10099 Berlin, Germany
关键词
SYSTEM; TEXT; IDENTIFICATION; TASK;
D O I
10.1093/nar/gks563
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Research results are primarily published in scientific literature and curation efforts cannot keep up with the rapid growth of published literature. The plethora of knowledge remains hidden in large text repositories like MEDLINE. Consequently, life scientists have to spend a great amount of time searching for specific information. The enormous ambiguity among most names of biomedical objects such as genes, chemicals and diseases often produces too large and unspecific search results. We present GeneView, a semantic search engine for biomedical knowledge. GeneView is built upon a comprehensively annotated version of PubMed abstracts and openly available PubMed Central full texts. This semi-structured representation of biomedical texts enables a number of features extending classical search engines. For instance, users may search for entities using unique database identifiers or they may rank documents by the number of specific mentions they contain. Annotation is performed by a multitude of state-of-the-art text-mining tools for recognizing mentions from 10 entity classes and for identifying protein-protein interactions. GeneView currently contains annotations for > 194 million entities from 10 classes for similar to 21 million citations with 271 000 full text bodies. GeneView can be searched at http://bc3.informatik.hu-berlin.de/.
引用
收藏
页码:W585 / W591
页数:7
相关论文
共 24 条
[1]   BioCreative III interactive task: an overview [J].
Arighi, Cecilia N. ;
Roberts, Phoebe M. ;
Agarwal, Shashank ;
Bhattacharya, Sanmitra ;
Cesareni, Gianni ;
Chatr-aryamontri, Andrew ;
Clematide, Simon ;
Gaudet, Pascale ;
Giglio, Michelle Gwinn ;
Harrow, Ian ;
Huala, Eva ;
Krallinger, Martin ;
Leser, Ulf ;
Li, Donghui ;
Liu, Feifan ;
Lu, Zhiyong ;
Maltais, Lois J. ;
Okazaki, Naoaki ;
Perfetto, Livia ;
Rinaldi, Fabio ;
Saetre, Rune ;
Salgado, David ;
Srinivasan, Padmini ;
Thomas, Philippe E. ;
Toldo, Luca ;
Hirschman, Lynette ;
Wu, Cathy H. .
BMC BIOINFORMATICS, 2011, 12
[2]   Manual curation is not sufficient for annotation of genomic databases [J].
Baumgartner, William A., Jr. ;
Cohen, K. Bretonnel ;
Fox, Lynne M. ;
Acquaah-Mensah, George ;
Hunter, Lawrence .
BIOINFORMATICS, 2007, 23 (13) :I41-I48
[3]   MutationFinder: a high-performance system for extracting point mutation mentions from text [J].
Caporaso, J. Gregory ;
Baumgartner, William A., Jr. ;
Randolph, David A. ;
Cohen, K. Bretonnel ;
Hunter, Lawrence .
BIOINFORMATICS, 2007, 23 (14) :1862-1865
[4]   Understanding PubMed® user search behavior through log analysis [J].
Dogan, Rezarta Islamaj ;
Murray, G. Craig ;
Neveol, Aurelie ;
Lu, Zhiyong .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2009,
[5]   GoPubMed: Exploring PubMed with the gene ontology [J].
Doms, A ;
Schroeder, M .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W783-W786
[6]   iHOP web services [J].
Fernandez, Jose M. ;
Hoffmann, Robert ;
Valencia, Alfonso .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W21-W26
[7]   LINNAEUS: A species name identification system for biomedical literature [J].
Gerner, Martin ;
Nenadic, Goran ;
Bergman, Casey M. .
BMC BIOINFORMATICS, 2010, 11
[8]  
Giuliano C., 2006, 11 C EUR CHAPT ASS C, P401
[9]   The GNAT library for local and remote gene mention normalization [J].
Hakenberg, Joerg ;
Gerner, Martin ;
Haeussler, Maximilian ;
Solt, Illes ;
Plake, Conrad ;
Schroeder, Michael ;
Gonzalez, Graciela ;
Nenadic, Goran ;
Bergman, Casey M. .
BIOINFORMATICS, 2011, 27 (19) :2769-2771
[10]  
Kola?rik C., 2008, P WORKSHOP BUILDING, P51