The learning vector quantization algorithm applied to automatic text classification tasks

被引:41
作者
Martin-Valdivia, M. T. [1 ]
Urena-Lopez, L. A. [1 ]
Garcia-Vega, M. [1 ]
机构
[1] Univ Jaen, Dept Comp, E-23071 Jaen, Spain
关键词
Learning Vector Quantization (LVQ); Word Sense Disambiguation (WSD); Text Categorization (TC); SENSEVAL; Reuters-2 1578 text collection; Natural Language Processing (NLP);
D O I
10.1016/j.neunet.2006.12.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
Automatic text classification is an important task for many natural language processing applications. This paper presents a neural approach to develop a text classifier based on the Learning Vector Quantization (LVQ) algorithm. The LVQ model is a classification method that uses a competitive Supervised learning algorithm. The proposed method has been applied to two specific tasks: text categorization and word sense disambiguation. Experiments were carried Out using the REUTERs-21578 text collection (for text categorization) and the SENSEVAL-3 Corpus (for word sense disambiguation). The results obtained are very promising and show that our neural approach based on the LVQ algorithm is an alternative to other classification systems. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:748 / 756
页数:9
相关论文
共 61 条
[1]
[Anonymous], 2005, INT J HYBRID INTELL, DOI DOI 10.3233/HIS-2004-13-402
[2]
[Anonymous], 2000, HDB NATURAL LANGUAGE
[3]
[Anonymous], 2000, FDN STAT NATURAL LAN
[4]
AUTOMATED LEARNING OF DECISION RULES FOR TEXT CATEGORIZATION [J].
APTE, C ;
DAMERAU, F ;
WEISS, SM .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1994, 12 (03) :233-251
[5]
Baeza-Yates R.A., 1999, Modern Information Retrieval
[6]
Brown Peter F., 1991, P 29 ANN M ASS COMP
[7]
Chen HC, 1998, J AM SOC INFORM SCI, V49, P582, DOI 10.1002/(SICI)1097-4571(1998)49:7<582::AID-ASI2>3.0.CO
[8]
2-V
[9]
DITTENBACH M, 2001, P ICANN01 INT C ART, P500
[10]
EYHERAMENDY S, 2003, SPARSE BAYESIAN CLAS