A sentence classification technique using intention association expressions

被引:15
作者
Kadoya, Y [1 ]
Morita, K [1 ]
Fuketa, M [1 ]
Oono, M [1 ]
Atlam, ES [1 ]
Sumitomo, T [1 ]
Aoe, JI [1 ]
机构
[1] Univ Tokushima, Dept Informat Sci & Intelligent Syst, Tokushima 7708506, Japan
关键词
sentence classification; intention association expressions; deterministic multi-attribute; pattern matching; intention understanding;
D O I
10.1080/00207160412331336071
中图分类号
O29 [应用数学];
学科分类号
070104 [应用数学];
摘要
Although there are many text classification techniques depending on the vector space, it is difficult to detect the meaning related to the user's intention (complaint, encouragement, request, invitation, etc.). The approach be discussed in this paper is very useful for understanding focus points in conversation. We present a technique for determining the speaker's intention for sentences in conversation. Intention association expressions are introduced, and formal descriptions with weights are defined using these expressions to construct an intention classification. A deterministic multi-attribute pattern-matching algorithm is used to determine the intention class efficiently. In simulation results for 681 email messages of 5859 sentences, the multi-attribute pattern-matching algorithm is about 44.5 times faster than the Aho and Corasick method. The precision and recall of intention classification of sentences are 91% and 95%, respectively. The precision and recall of extraction of unnecessary sentences are 98% and 96%, respectively. The precision and recall of the classification of each email are 88% and 89%, respectively.
引用
收藏
页码:777 / 792
页数:16
相关论文
共 24 条
[1]
EFFICIENT STRING MATCHING - AID TO BIBLIOGRAPHIC SEARCH [J].
AHO, AV ;
CORASICK, MJ .
COMMUNICATIONS OF THE ACM, 1975, 18 (06) :333-340
[2]
Efficient multi-attribute pattern matching [J].
Ando, K ;
Mizobuchi, S ;
Shishibori, M ;
Aoe, J .
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1998, 66 (1-2) :21-38
[3]
An improvement of the Aho-Corasick machine [J].
Ando, K ;
Kinoshita, T ;
Shishibori, M ;
Aoe, J .
INFORMATION SCIENCES, 1998, 111 (1-4) :139-151
[5]
AN EFFICIENT DIGITAL SEARCH ALGORITHM BY USING A DOUBLE-ARRAY STRUCTURE [J].
AOE, JI .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1989, 15 (09) :1066-1077
[6]
A new method for selecting English field association terms of compound words and its knowledge representation [J].
Atlam, E ;
Morita, K ;
Fuketa, M ;
Aoe, J .
INFORMATION PROCESSING & MANAGEMENT, 2002, 38 (06) :807-821
[7]
Documents similarity measurement using field association terms [J].
Atlam, ES ;
Fuketa, M ;
Morita, K ;
Aoe, J .
INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (06) :809-824
[8]
Fellbaum C, 1998, WORDNET ELECT LEXICA
[9]
A document classification method by using field association words [J].
Fuketa, M ;
Lee, S ;
Tsuji, T ;
Okada, M ;
Aoe, J .
INFORMATION SCIENCES, 2000, 126 (1-4) :57-70
[10]
KWON O, 1999, P 18 INT C COMP PROC, V1, P153