A new model for linguistic summarization of heterogeneous data: an application to tourism web data sources

被引:18
作者
Carrasco, Ramon A. [1 ]
Villar, Pedro [1 ]
机构
[1] Univ Granada, Dept Software Engn, E-18071 Granada, Spain
关键词
Data summarization; Fuzzy linguistic modelling; Opinion aggregation; Heterogeneous data integration; GROUP DECISION-MAKING; INFORMATION-RETRIEVAL;
D O I
10.1007/s00500-011-0740-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present the problem of aggregating heterogeneous data from various websites with opinions about high end hotels into a database. We present the fuzzy model based on the semantic translation as a tool to obtain a linguistic summarization. The characteristics of this model (necessary to solve the problem) are not together on any of the existing linguistic models: the management of the input heterogeneous data (natural language included); the procurement of linguistic results with high precision and good interpretability; and the use of unbalanced linguistic term sets described by trapezoidal membership functions for defining the initial linguistic terms. We applied it to aggregate data from certain high end hotels websites and we show a case study using the high end hotels located in Granada (Spain) from such websites during a year. With this aggregated information, a data analyst can make several analyses with the benefit of easy linguistic interpretability and a high precision. The solution proposed here can be used to similar aggregation problems.
引用
收藏
页码:135 / 151
页数:17
相关论文
共 57 条
[1]  
[Anonymous], 2011, OFF WID SEL BEST PRI, P2011
[2]  
[Anonymous], 2011, PREM INT ONL SERV TR
[3]  
[Anonymous], 2011, TRAV AG PROM RECR AC
[4]  
[Anonymous], 2011, EUR LEAD ONL HOT RES
[5]  
[Anonymous], 2011, MULT MAG DED TOUR
[6]  
[Anonymous], P 12 INT C INF PROC
[7]  
[Anonymous], 2003, INTELL DATA ANAL, DOI DOI 10.3233/IDA-2003-7206
[8]  
[Anonymous], ORACLE TEXT REFERENC
[9]  
[Anonymous], 2011, LUX TRAV WEBS COND N
[10]  
[Anonymous], 2002, P 8 ACM SIGKDD INT C, DOI [DOI 10.1145/775047.775098, 10.1145/775047.775098]