LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships

被引:22
作者
Barbosa-Silva, Adriano [1 ,2 ,3 ]
Soldatos, Theodoros G. [3 ,4 ]
Magalhaes, Ivan L. F. [1 ]
Pavlopoulos, Georgios A. [3 ]
Fontaine, Jean-Fred [2 ]
Andrade-Navarro, Miguel A. [2 ]
Schneider, Reinhard [3 ]
Ortega, J. Miguel [1 ]
机构
[1] Univ Fed Minas Gerais, ICB, Lab Biodados, Dpto Bioquim & Imunol, BR-31270901 Belo Horizonte, MG, Brazil
[2] Max Delbruck Ctr Mol Med, Computat Biol & Data Min Grp, D-13125 Berlin, Germany
[3] EMBL Heidelberg, D-69117 Heidelberg, Germany
[4] LIFE Biosyst GmbH, D-69115 Heidelberg, Germany
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
MOLECULAR-BIOLOGY; SALICYLIC-ACID; INFORMATION; STRESS; EXTRACTION; NETWORKS; VIEW;
D O I
10.1186/1471-2105-11-70
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biological knowledge is represented in scientific literature that often describes the function of genes/proteins (bioentities) in terms of their interactions (biointeractions). Such bioentities are often related to biological concepts of interest that are specific of a determined research field. Therefore, the study of the current literature about a selected topic deposited in public databases, facilitates the generation of novel hypotheses associating a set of bioentities to a common context. Results: We created a text mining system (LAITOR: Literature Assistant for Identification of Terms co-Occurrences and Relationships) that analyses co-occurrences of bioentities, biointeractions, and other biological terms in MEDLINE abstracts. The method accounts for the position of the co-occurring terms within sentences or abstracts. The system detected abstracts mentioning protein-protein interactions in a standard test (BioCreative II IAS test data) with a precision of 0.82-0.89 and a recall of 0.48-0.70. We illustrate the application of LAITOR to the detection of plant response genes in a dataset of 1000 abstracts relevant to the topic. Conclusions: Text mining tools combining the extraction of interacting bioentities and biological concepts with network displays can be helpful in developing reasonable hypotheses in different scientific backgrounds.
引用
收藏
页数:10
相关论文
共 40 条
[1]   Text mining and its potential applications in systems biology [J].
Ananiadou, Sophia ;
Kell, Douglas B. ;
Tsujii, Jun-ichi .
TRENDS IN BIOTECHNOLOGY, 2006, 24 (12) :571-579
[2]   Automated extraction of information in molecular biology [J].
Andrade, MA ;
Bork, P .
FEBS LETTERS, 2000, 476 (1-2) :12-17
[3]  
Blaschke C, 2001, Genome Inform, V12, P123
[4]  
Blaschke C, 1999, Proc Int Conf Intell Syst Mol Biol, P60
[5]   The role of ethylene in host-pathoven interactions [J].
Broekaert, Willem F. ;
Delaure, Stijn L. ;
De Bolle, Miguel F. C. ;
Cammue, Bruno P. A. .
ANNUAL REVIEW OF PHYTOPATHOLOGY, 2006, 44 :393-416
[6]   MINT and IntAct contribute to the Second BioCreative challenge: serving the text-mining community with high quality molecular interaction data [J].
Chatr-aryamontri, Andrew ;
Kerrien, Samuel ;
Khadake, Jyoti ;
Orchard, Sandra ;
Ceol, Arnaud ;
Licata, Luana ;
Castagnoli, Luisa ;
Costa, Stefano ;
Derow, Cathy ;
Huntley, Rachael ;
Aranda, Bruno ;
Leroy, Catherine ;
Thorneycroft, Dave ;
Apweiler, Rolf ;
Cesareni, Gianni ;
Hermjakob, Henning .
GENOME BIOLOGY, 2008, 9
[7]  
Ding J, 2002, Pac Symp Biocomput, P326
[8]   MedlineRanker: flexible ranking of biomedical literature [J].
Fontaine, Jean-Fred ;
Barbosa-Silva, Adriano ;
Schaefer, Martin ;
Huska, Matthew R. ;
Muro, Enrique M. ;
Andrade-Navarro, Miguel A. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W141-W146
[9]   Crosstalk between abiotic and biotic stress responses: a current view from the points of convergence in the stress signaling networks [J].
Fujita, Miki ;
Fujita, Yasunari ;
Noutoshi, Yoshiteru ;
Takahashi, Fuminori ;
Narusaka, Yoshihiro ;
Yamaguchi-Shinozaki, Kazuko ;
Shinozaki, Kazuo .
CURRENT OPINION IN PLANT BIOLOGY, 2006, 9 (04) :436-442
[10]   Implementing the iHOP concept for navigation of biomedical literature [J].
Hoffmann, R ;
Valencia, A .
BIOINFORMATICS, 2005, 21 :252-258