An Unsupervised Graph Based Continuous Word Representation Method for Biomedical Text Mining

被引:11
作者
Jiang, Zhenchao [1 ]
Li, Lishuang [1 ]
Huang, Degen [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural language processing; machine learning; connectionism and neural nets; object representation; EXTRACTION;
D O I
10.1109/TCBB.2015.2478467
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In biomedical text mining tasks, distributed word representation has succeeded in capturing semantic regularities, but most of them are shallow-window based models, which are not sufficient for expressing the meaning of words. To represent words using deeper information, we make explicit the semantic regularity to emerge in word relations, including dependency relations and context relations, and propose a novel architecture for computing continuous vector representation by leveraging those relations. The performance of our model is measured on word analogy task and Protein-Protein Interaction Extraction (PPIE) task. Experimental results show that our method performs overall better than other word representation models on word analogy task and have many advantages on biomedical text mining.
引用
收藏
页码:634 / 642
页数:9
相关论文
共 21 条
[1]  
[Anonymous], 2011, Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11
[2]  
[Anonymous], 2013, P WORKSHOP ICLR 2013
[3]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[4]   Comparative experiments on learning information extractors for proteins and their interactions [J].
Bunescu, R ;
Ge, RF ;
Kate, RJ ;
Marcotte, EM ;
Mooney, RJ ;
Ramani, AK ;
Wong, YW .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (02) :139-155
[5]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[6]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[7]  
2-9
[8]  
Ding J, 2002, Pac Symp Biocomput, P326
[9]   RelEx -: Relation extraction using dependency parse trees [J].
Fundel, Katrin ;
Kueffner, Robert ;
Zimmer, Ralf .
BIOINFORMATICS, 2007, 23 (03) :365-371
[10]  
Hinton GE, 1986, P 8 ANN C COGN SCI S, P12, DOI DOI 10.1109/69.917563