Sentiment classification of online reviews to travel destinations by supervised machine learning approaches

被引:368
作者
Ye, Qiang [1 ,2 ]
Zhang, Ziqiong [1 ]
Law, Rob [2 ]
机构
[1] Harbin Inst Technol, Sch Management, Harbin 150006, Peoples R China
[2] Hong Kong Polytech Univ, Sch Hotel & Tourism Management, Hong Kong, Hong Kong, Peoples R China
关键词
Sentiment classification; Online reviews; Travel destinations; Supervised machine learning algorithm;
D O I
10.1016/j.eswa.2008.07.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid growth in Internet applications in tourism has lead to an enormous amount of personal reviews for travel-related information on the Web. These reviews can appear in different forms like BBS, blogs, Wiki or forum websites. More importantly, the information in these reviews is valuable to both travelers and practitioners for various understanding and planning processes. An intrinsic problem of the overwhelming information on the Internet, however, is information overloading as users are simply unable to read all the available information. Query functions in search engines like Yahoo and Google can help users find some of the reviews that they needed about specific destinations. The returned pages from these search engines are still beyond the visual capacity of humans. In this research, sentiment classification techniques were incorporated into the domain of mining reviews from travel blogs. Specifically, we compared three supervised machine learning algorithms of Naive Bayes, SVM and the character based N-gram model for sentiment classification of the reviews on travel blogs for seven popular travel destinations in the US and Europe. Empirical findings indicated that the SVM and N-gram approaches outperformed the Naive Bayes approach, and that when training datasets had a large number of reviews, all three approaches reached accuracies of at least 80%. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6527 / 6535
页数:9
相关论文
共 34 条
[1]  
*AL I, 2006, LINGPIPE NAT LANG TO
[2]  
Anderson EW, 1998, J. Service Res, V1, P5, DOI 10.1177/109467059800100102
[3]  
[Anonymous], 2007, P 11 INT C ART INT L
[4]  
Beineke P., 2004, P 12 ANN M ASS COMPU, P263, DOI DOI 10.3115/1218955.1218989
[5]  
Carpenter B., 2005, P 2005 ASS COMPUTATI, P1
[6]  
Chaovalit P., 2005, HAWAII INT C SYSTEM, V4, p112c, DOI DOI 10.1109/HICSS.2005.445
[7]  
Cheung C.M. Y., 2004, Proceedings of the 8th Pacific-Asia conference on information systems, P2100
[8]   Destination image representation on the web: Content analysis of Macau travel related websites [J].
Choi, Soojin ;
Lehto, Xinran Y. ;
Morrison, Alastair M. .
TOURISM MANAGEMENT, 2007, 28 (01) :118-129
[9]  
Dave K., 2003, holderProceedings Of The 12th International Conference On World Wide Web, P519, DOI [DOI 10.1145/775152.775226, 10.1145/775152.775226]
[10]   The digitization of word of mouth: Promise and challenges of online feedback mechanisms [J].
Dellarocas, C .
MANAGEMENT SCIENCE, 2003, 49 (10) :1407-1424