Beyond the clause: extraction of phosphorylation information from medline abstracts

被引:30
作者
Narayanaswamy, M
Ravikumar, KE
Vijay-Shanker, K [1 ]
机构
[1] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
[2] Anna Univ, AU KBC Res Ctr, Madras 600025, Tamil Nadu, India
关键词
D O I
10.1093/bioinformatics/bti1011
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phosphorylation is an important biochemical reaction that plays a critical role in signal transduction pathways and cell-cycle processes. A text mining system to extract the phosphorylation relation from the literature is reported. The focus of this paper is on the new methods developed and implemented to connect and merge pieces of information about phosphorylation mentioned in different sentences in the text. The effectiveness and accuracy of the system as a whole as well as that of the methods for extraction beyond a clause/sentence is evaluated using an independently annotated dataset, the Phospho. ELM database. The new methods developed to merge pieces of information from different sentences are shown to be effective in significantly raising the recall without much difference in precision.
引用
收藏
页码:I319 / I327
页数:9
相关论文
共 20 条
[1]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[2]  
Blaschke C, 2002, IEEE INTELL SYST, V17, P14, DOI 10.1109/MIS.2002.999215
[3]  
Blaschke Christian, 2002, Brief Bioinform, V3, P154, DOI 10.1093/bib/3.2.154
[4]  
Brill E, 1995, COMPUT LINGUIST, V21, P543
[5]   Phospho.ELM:: A database of experimentally verified phosphorylation sites in eukaryotic proteins -: art. no. 79 [J].
Diella, F ;
Cameron, S ;
Gemünd, C ;
Linding, R ;
Via, A ;
Kuster, B ;
Sicheritz-Pontén, T ;
Blom, N ;
Gibson, TJ .
BMC BIOINFORMATICS, 2004, 5 (1)
[6]  
FRIEDMAN C, 2001, BIOINFORMATICS S1, V17, P74
[7]  
HUMPHREYS K, 2000, PAC S BIOC, P502
[8]   DynGO: a tool for visualizing and mining of Gene Ontology and its associations [J].
Liu, HF ;
Hu, ZZ ;
Wu, CH .
BMC BIOINFORMATICS, 2005, 6 (1)
[9]   Mining literature for protein-protein interactions [J].
Marcotte, EM ;
Xenarios, I ;
Eisenberg, D .
BIOINFORMATICS, 2001, 17 (04) :359-363
[10]  
Narayanaswamy Meenakshi, 2003, Pac Symp Biocomput, P427