SentiHealth-Cancer: A sentiment analysis tool to help detecting mood of patients in online social networks

被引:82
作者
Rodrigues, Ramon Gouveia [1 ]
das Dores, Rafael Marques [1 ]
Camilo-Junior, Celso G. [1 ]
Rosa, Thierson Couto [1 ]
机构
[1] Univ Fed Goias, Inst Informat, BR-74001970 Goiania, Go, Brazil
关键词
Sentiment analysis; Opinion mining; Online social networks; Facebook; Cancer; BREAST-CANCER; SUPPORT NETWORK; INTERNET USE; INFORMATION; COMMUNICATION; HEALTH; DOMAIN; CLASSIFICATION; ADAPTATION; SYMPTOMS;
D O I
10.1016/j.ijmedinf.2015.09.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Background: Cancer is a critical disease that affects millions of people and families around the world. In 2012 about 14.1 million new cases of cancer occurred globally. Because of many reasons like the severity of some cases, the side effects of some treatments and death of other patients, cancer patients tend to be affected by serious emotional disorders, like depression, for instance. Thus, monitoring the mood of the patients is an important part of their treatment. Many cancer patients are users of online social networks and many of them take part in cancer virtual communities where they exchange messages commenting about their treatment or giving support to other patients in the community. Most of these communities are of public access and thus are useful sources of information about the mood of patients. Based on that, Sentiment Analysis methods can be useful to automatically detect positive or negative mood of cancer patients by analyzing their messages in these online communities. Objective: The objective of this work is to present a Sentiment Analysis tool, named SentiHealth-Cancer (SHC-pt), that improves the detection of emotional state of patients in Brazilian online cancer communities, by inspecting their posts written in Portuguese language. The SHC-pt is a sentiment analysis tool which is tailored specifically to detect positive, negative or neutral messages of patients in online communities of cancer patients. We conducted a comparative study of the proposed method with a set of general-purpose sentiment analysis tools adapted to this context. Methods: Different collections of posts were obtained from two cancer communities in Facebook. Additionally, the posts were analyzed by sentiment analysis tools that support the Portuguese language (Semantria and SentiStrength) and by the tool SHC-pt, developed based on the method proposed in this paper called SentiHealth. Moreover, as a second alternative to analyze the texts in Portuguese, the collected texts were automatically translated into English, and submitted to sentiment analysis tools that do not support the Portuguese language (AlchemyAPI and Textalytics) and also to Semantria and SentiStrength, using the English option of these tools. Six experiments were conducted with some variations and different origins of the collected posts. The results were measured using the following metrics: precision, recall, Fl-measure and accuracy Results: The proposed tool SHC-pt reached the best averages for accuracy and Fl-measure (harmonic mean between recall and precision) in the three sentiment classes addressed (positive, negative and neutral) in all experimental settings. Moreover, the worst accuracy value (58%) achieved by SHC-pt in any experiment is 11.53% better than the greatest accuracy (52%) presented by other addressed tools. Finally, the worst average Fl (48.46%) reached by SHC-pt in any experiment is 4.14% better than the greatest average Fl (46.53%) achieved by other addressed tools. Thus, even when we compare the SHC-pt results in complex scenario versus others in easier scenario the SHC-pt is better. Conclusions: This paper presents two contributions. First, it proposes the method SentiHealth to detect the mood of cancer patients that are also users of communities of patients in online social networks. Second, it presents an instantiated tool from the method, called SentiHealth-Cancer (SHC-pt), dedicated to automatically analyze posts in communities of cancer patients, based on SentiHealth. This contexttailored tool outperformed other general-purpose sentiment analysis tools at least in the cancer context. This suggests that the SentiHealth method could be instantiated as other disease-based tools during future works, for instance SentiHealth-HIV, SentiHealth-Stroke and SentiHealth-Sclerosis. (C) 2015 Published by Elsevier Ireland Ltd.
引用
收藏
页码:80 / 95
页数:16
相关论文
共 48 条
[1]
A. C. Society, 2014, WHAT IS CANCER
[2]
Abbasi A, 2014, LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P823
[3]
Network-Based Modeling and Intelligent Data Mining of Social Media for Improving Care [J].
Akay, Altug ;
Dragomir, Andrei ;
Erlandsson, Bjorn-Erik .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (01) :210-218
[4]
AlchemyAPI, 2014, SENT AN API
[5]
Internet use by the public to search for health-related information [J].
AlGhamdi, Khalid M. ;
Moussa, Noura A. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2012, 81 (06) :363-373
[6]
Changes in Female Support Network Systems and Adaptation After Breast Cancer Diagnosis: Differences Between Older and Younger Patients [J].
Ashida, Sato ;
Palmquist, Aunchalee E. L. ;
Basen-Engquist, Karen ;
Singletary, S. Eva ;
Koehly, Laura M. .
GERONTOLOGIST, 2009, 49 (04) :549-559
[7]
Care more about customers: Unsupervised domain-independent aspect detection for sentiment analysis of customer reviews [J].
Bagheri, Ayoub ;
Saraee, Mohamad ;
de Jong, Franciska .
KNOWLEDGE-BASED SYSTEMS, 2013, 52 :201-213
[8]
Baojun Qiu, 2011, Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and IEEE Third International Conference on Social Computing (PASSAT/SocialCom 2011), P274, DOI 10.1109/PASSAT/SocialCom.2011.127
[9]
A systematic critique of diabetes on the world wide web for patients and their physicians [J].
Bedell, SE ;
Agrawal, A ;
Petersen, LE .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2004, 73 (9-10) :687-694
[10]
Seeking Support on Facebook: A Content Analysis of Breast Cancer Groups [J].
Bender, Jacqueline L. ;
Jimenez-Marroquin, Maria-Carolina ;
Jadad, Alejandro R. .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2011, 13 (01) :165-175