Task Independent Fine Tuning for Word Embeddings

被引:9
作者
Yang, Xuefeng [1 ]
Mao, Kezhi [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Fine tuning; word embedding; word representation learning; NEURAL-NETWORKS; LANGUAGE;
D O I
10.1109/TASLP.2016.2644863
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Representation learning of words, also known as word embedding technique, is based on the distributional hypothesis that words with similar semantic meanings have similar context. The selection of context window naturally has an influence on word vectors learned. However, it is found that the word vectors are often very sensitive to the defined context window, and unfortunately there is no unified optimal context window for all words. One impact of this issues is that, under a predefined context window, the semantic meanings of some words may not be well represented by the learned vectors. To alleviate the problem and improve word embeddings, we propose a task-independent fine-tuning framework in this paper. The main idea of the task-independent fine tuning is to integrate multiple word embeddings and lexical semantic resources to fine tune a target word embedding. The effectiveness of the proposed framework is tested by tasks of semantic similarity prediction, analogical reasoning, and sentence completion. Experiments results on six word embeddings and eight datasets show that the proposed fine-tuning framework could significantly improve word embeddings.
引用
收藏
页码:885 / 894
页数:10
相关论文
共 42 条
[1]   Syntactic and Semantic Features For Code-Switching Factored Language Models [J].
Adel, Heike ;
Ngoc Thang Vu ;
Kirchhoff, Katrin ;
Telaar, Dominic ;
Schultz, Tanja .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (03) :431-440
[2]  
[Anonymous], 1957, Studies in Linguistic Analysis
[3]  
[Anonymous], P 31 INT C MACH LEAR
[4]  
[Anonymous], 2009, Advances in neural information processing systems
[5]  
[Anonymous], 2002, ADV NEURAL INFORM PR
[6]  
[Anonymous], P 52 ANN M ASS COMP, DOI DOI 10.3115/VL/P14-1146
[7]  
[Anonymous], 2005, INT WORKSHOP ARTIFIC
[8]  
[Anonymous], P ACL
[9]  
[Anonymous], 2001, Proceedings of the 10th international conference on World Wide Web
[10]  
[Anonymous], 2012, P 29 INT C MACH LEAR