"Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information

被引:75
作者
Burton, Scott H. [1 ]
Tanner, Kesler W. [1 ]
Giraud-Carrier, Christophe G. [1 ]
West, Joshua H. [2 ]
Barnes, Michael D. [2 ]
机构
[1] Brigham Young Univ, Dept Comp Sci, Computat Hlth Sci Res Grp, Provo, UT 84602 USA
[2] Brigham Young Univ, Dept Hlth Sci, Computat Hlth Sci Res Grp, Provo, UT 84602 USA
关键词
Twitter; GPS Location; Infodemiology; Surveillance; Intervention; Social Media; SOCIAL MEDIA; SURVEILLANCE; WEB; INTERVENTIONS; BEHAVIOR; AGE;
D O I
10.2196/jmir.2121
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Twitter provides various types of location data, including exact Global Positioning System (GPS) coordinates, which could be used for infoveillance and infodemiology (ie, the study and monitoring of online health information), health communication, and interventions. Despite its potential, Twitter location information is not well understood or well documented, limiting its public health utility. Objective: The objective of this study was to document and describe the various types of location information available in Twitter. The different types of location data that can be ascertained from Twitter users are described. This information is key to informing future research on the availability, usability, and limitations of such location data. Methods: Location data was gathered directly from Twitter using its application programming interface (API). The maximum tweets allowed by Twitter were gathered (1% of the total tweets) over 2 separate weeks in October and November 2011. The final dataset consisted of 23.8 million tweets from 9.5 million unique users. Frequencies for each of the location options were calculated to determine the prevalence of the various location data options by region of the world, time zone, and state within the United States. Data from the US Census Bureau were also compiled to determine population proportions in each state, and Pearson correlation coefficients were used to compare each state's population with the number of Twitter users who enable the GPS location option. Results: The GPS location data could be ascertained for 2.02% of tweets and 2.70% of unique users. Using a simple text-matching approach, 17.13% of user profiles in the 4 continental US time zones were able to be used to determine the user's city and state. Agreement between GPS data and data from the text-matching approach was high (87.69%). Furthermore, there was a significant correlation between the number of Twitter users per state and the 2010 US Census state populations (r >= 0.97, P < .001). Conclusions: Health researchers exploring ways to use Twitter data for disease surveillance should be aware that the majority of tweets are not currently associated with an identifiable geographic location. Location can be identified for approximately 4 times the number of tweets using a straightforward text-matching process compared to using the GPS location information available in Twitter. Given the strong correlation between both data gathering methods, future research may consider using more qualitative approaches with higher yields, such as text mining, to acquire information about Twitter users' geographical location. (J Med Internet Res 2012;14(6):e156) doi:10.2196/jmir.2121
引用
收藏
页码:366 / 376
页数:11
相关论文
共 35 条
[1]   THE HAWTHORNE EFFECT - A RECONSIDERATION OF THE METHODOLOGICAL ARTIFACT [J].
ADAIR, JG .
JOURNAL OF APPLIED PSYCHOLOGY, 1984, 69 (02) :334-345
[2]   YouTube as a source of quitting smoking information [J].
Backinger, Cathy L. ;
Pilsner, Alison M. ;
Augustson, Erik M. ;
Frydl, Andrea ;
Phillips, Todd ;
Rowden, Jessica .
TOBACCO CONTROL, 2011, 20 (02) :119-122
[3]  
Backstrom L, 2010, 2010 P 19 INT WORLD, P61
[4]  
Brownstein JS, 2009, NEW ENGL J MED, V360, P2153, DOI [10.1056/NEJMp0900702, 10.1056/NEJMp0904012]
[5]  
Burton S, 2012, 2012 P 2 ACM INT HLT, P81
[6]  
Cheng Z, 2010, 2010 P 19 ACM INT C, P759
[7]   Social and News Media Enable Estimation of Epidemiological Patterns Early in the 2010 Haitian Cholera Outbreak [J].
Chunara, Rumi ;
Andrews, Jason R. ;
Brownstein, John S. .
AMERICAN JOURNAL OF TROPICAL MEDICINE AND HYGIENE, 2012, 86 (01) :39-45
[8]   Using Social Media for Research and Public Health Surveillance [J].
Eke, P. I. .
JOURNAL OF DENTAL RESEARCH, 2011, 90 (09) :1045-1046
[9]   Open mHealth Architecture: An Engine for Health Care Innovation [J].
Estrin, Deborah ;
Sim, Ida .
SCIENCE, 2010, 330 (6005) :759-760
[10]   Infodemiology and Infoveillance: Framework for an Emerging Set of Public Health Informatics Methods to Analyze Search, Communication and Publication Behavior on the Internet [J].
Eysenbach, Gunther .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2009, 11 (01)