Detection and Interpretation of Opinion Expressions in Spoken Surveys

被引：4

作者：

Camelin, Nathalie ^{[1
]}

Bechet, Frederic ^{[1
]}

Damnati, Geraldine ^{[2
]}

De Mori, Renato ^{[1
]}

机构：

[1] Univ Avignon, LIA, F-84911 Avignon 09, France

[2] France Telecom R&D, F-22307 Lannion 07, France

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2010年 / 18卷 / 02期

关键词：

Automatic detection of in-domain speech data; automatic processing of telephone surveys; automatic speech recognition; spoken language understanding; spoken opinion analysis;

D O I：

10.1109/TASL.2009.2028918

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper describes a system for automatic opinion analysis from spoken messages collected in the context of a user satisfaction survey. Opinion analysis is performed from the perspective of opinion monitoring. A process is outlined for detecting segments expressing opinions in a speech signal. Methods are proposed for accepting or rejecting segments from messages that are not reliably analyzed due to the limitations of automatic speech recognition processes, for assigning opinion hypotheses to segments and for evaluating hypothesis opinion proportions. Specific language models are introduced for representing opinion concepts. These models are used for hypothesizing opinion carrying segments in a spoken message. Each segment is interpreted by a classifier based on the Adaboost algorithm which associates a pair of topic and polarity labels to each segment. The different processes are trained and evaluated on a telephone corpus collected in a deployed customer care service. The use of conditional random fields (CRFs) is also considered for detecting segments and results are compared for different types of data and approaches. By optimizing the choice of the strategy parameters, it is possible to estimate user opinion proportions with a Kullback-Leibler divergence of 0.047 bits with respect to the true proportions obtained with a manual annotation of the spoken messages. The proportions estimated with such a low divergence are accurate enough for monitoring user satisfaction over time.

引用

页码：369 / 381

页数：13

共 35 条

[1]

[Anonymous], P WORKSH SENT SUBJ T

[2]

[Anonymous], P 20 INT C COMPUTATI, DOI DOI 10.3115/1220355.1220555

[3]

[Anonymous], P INT PITTSB PA SEPT

[4]

BETHARD S, 2004, P AAAI SPRING S EXPL, P22

[5]

Bruce R. F., 1999, Natural Language Engineering, V5, P187, DOI 10.1017/S1351324999002181

[6]

CAMELIN N, 2008, P INT BRISB AUSTR, P475

[7]

CAMELIN N, 2009, P INT BRIGHT UK

[8]

CAMELIN N, 2006, P INT C SPOK LANG PR, P1041

[9]

Choi Y., 2005, P C HUMAN LANGUAGE T, P355

[10]

Dave K., 2003, Proceedings of the 12th international conference on world wide web, P519, DOI DOI 10.1145/775152.775226

← 1 2 3 4 →