Category-specific models for ranking effective paraphrases in community Question Answering

被引:27
作者
Figueroa, Alejandro [1 ,2 ]
Neumann, Guenter [3 ]
机构
[1] Yahoo, Res Latin Amer, Santiago, Chile
[2] Univ Diego Port, Escuela Ingn Informat, Santiago, Chile
[3] DFKI GmbH, D-66123 Saarbrucken, Germany
关键词
Community-based Question Answering; Learning to rank; Question paraphrases; Question categories;
D O I
10.1016/j.eswa.2014.02.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Platforms for community-based Question Answering (cQA) are playing an increasing role in the synergy of information-seeking and social networks. Being able to categorize user questions is very important, since these categories are good predictors for the underlying question goal, viz, informational or subjective. Furthermore, an effective cQA platform should be capable of detecting similar past questions and relevant answers, because it is known that a high number of best answers are reusable. Therefore, question paraphrasing is not only a useful but also an essential ingredient for effective search in cQA. However, the generated paraphrases do not necessarily lead to the same answer set, and might differ in their expected quality of retrieval, for example, in their power of identifying and ranking best answers higher. We propose a novel category-specific learning to rank approach for effectively ranking paraphrases for cQA. We describe a number of different large-scale experiments using logs from Yahoo! Search and Yahoo! Answers, and demonstrate that the subjective and objective nature of cQA questions dramatically affect the recall and ranking of past answers, when fine-grained category information is put into its place. Then, category-specific models are able to adapt well to the different degree of objectivity and subjectivity of each category, and the more specific the models are, the better the results, especially when benefiting from effective semantic and syntactic features. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4730 / 4742
页数:13
相关论文
共 34 条
[1]  
Agichtein Eugene, 2008, Proceedings of the 17th International Conference on World Wide Web, P467
[2]  
[Anonymous], AAAI 2013
[3]  
[Anonymous], 2008, P 2008 INT C WEB SEA
[4]  
[Anonymous], 2008, P 22 INT C COMP LING
[5]  
[Anonymous], 2008, P 31 ANN INT ACM SIG
[6]  
[Anonymous], 2006, P ACM C KNOWLEDGE DI
[7]  
[Anonymous], 2011, AAAI
[8]   Evolutionary optimization for ranking how-to questions based on user-generated contents [J].
Atkinson, John ;
Figueroa, Alejandro ;
Andrade, Christian .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (17) :7060-7068
[9]  
Blooma M. J., 2011, Proceedings of the 2011 Eighth International Conference on Information Technology: New Generations (ITNG), P591, DOI 10.1109/ITNG.2011.108
[10]  
Blooma M. J., 2012, P 2012 PAC AS C INF