Improving graph-based random walks for complex question answering using syntactic, shallow semantic and extended string subsequence kernels

被引：17

作者：

Chali, Yllias ^{[1
]}

Hasan, Sadid A. ^{[1
]}

Joty, Shafiq R. ^{[2
]}

机构：

[1] Univ Lethbridge, Lethbridge, AB T1K 3M4, Canada

[2] Univ British Columbia, Vancouver, BC V5Z 1M9, Canada

来源：

INFORMATION PROCESSING & MANAGEMENT | 2011年 / 47卷 / 06期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Complex question answering; Graph-based method; Syntactic kernel; Shallow semantic kernel; Extended string subsequence kernel;

D O I：

10.1016/j.ipm.2010.10.002

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The task of answering complex questions requires inferencing and synthesizing information from multiple documents that can be seen as a kind of topic-oriented, informative multi-document summarization. In generic summarization the stochastic, graph-based random walk method to compute the relative importance of textual units (i.e. sentences) is proved to be very successful. However, the major limitation of the TF*IDF approach is that it only retains the frequency of the words and does not take into account the sequence, syntactic and semantic information. This paper presents the impact of syntactic and semantic information in the graph-based random walk method for answering complex questions. Initially, we apply tree kernel functions to perform the similarity measures between sentences in the random walk framework. Then, we extend our work further to incorporate the Extended String Subsequence Kernel (ESSK) to perform the task in a similar manner. Experimental results show the effectiveness of the use of kernels to include the syntactic and semantic information for this task. (C) 2010 Elsevier Ltd. All rights reserved.

引用

页码：843 / 855

页数：13

共 29 条

[1]

Amigo E., 2004, ACL 04 P 42 ANN M AS, P207

[2]

[Anonymous], P 5 C LANG RES EV GE

[3] Word-sequence kernels [J].

Cancedda, N ;

Gaussier, E ;

Goutte, C ;

Renders, JM .

JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1059-1082

[4]

Chali Y., 2008, P 20 IEEE INT C TOOL

[5]

Chali Y., 2007, P 4 INT C SEMANTIC E, P476

[6]

Chali Y., 2008, P 46 ANN M ASS COMP

[7]

CHARNIAK E, 1999, CS9912 BROWN U COMP

[8]

Collins M, 2002, ADV NEUR IN, V14, P625

[9] LexRank: Graph-based lexical centrality as salience in text summarization [J].

Erkan, G ;

Radev, DR .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :457-479

[10]

Fellbaum C., 1998, WordNet, DOI DOI 10.7551/MITPRESS/7287.001.0001

← 1 2 3 →