A Practical Guide to Big Data Research in Psychology

被引:96
作者
Chen, Eric Evan [1 ]
Wojcik, Sean P. [2 ]
机构
[1] Univ Calif Irvine, Dept Psychol & Social Behav, Irvine, CA 92697 USA
[2] Upworthy, Dept Data & Analyt, New York, NY USA
关键词
big data; text analysis; data analytics; data mining; machine learning; SELECTION; REGRESSION; LANGUAGE; COUPLE; TEXT;
D O I
10.1037/met0000111
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The massive volume of data that now covers a wide variety of human behaviors offers researchers in psychology an unprecedented opportunity to conduct innovative theory-and data-driven field research. This article is a practical guide to conducting big data research, covering data management, acquisition, processing, and analytics (including key supervised and unsupervised learning data mining methods). It is accompanied by walkthrough tutorials on data acquisition, text analysis with latent Dirichlet allocation topic modeling, and classification with support vector machines. Big data practitioners in academia, industry, and the community have built a comprehensive base of tools and knowledge that makes big data research accessible to researchers in a broad range of fields. However, big data research does require knowledge of software programming and a different analytical mindset. For those willing to acquire the requisite skills, innovative analyses of unexpected or previously untapped data sources can offer fresh ways to develop, test, and extend theories. When conducted with care and respect, big data research can become an essential complement to traditional research.
引用
收藏
页码:458 / 474
页数:17
相关论文
共 96 条
[81]  
Paluck EL, 2014, HANDBOOK OF RESEARCH METHODS IN SOCIAL AND PERSONALITY PSYCHOLOGY, SECOND EDITION, P81
[82]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[83]   Psychological aspects of natural language use: Our words, our selves [J].
Pennebaker, JW ;
Mehl, MR ;
Niederhoffer, KG .
ANNUAL REVIEW OF PSYCHOLOGY, 2003, 54 :547-577
[84]  
Raschka S., 2015, Python machine learning
[85]  
Raymond, 2000, CATHEDRAL BAZAAR
[86]  
Russell M. A., 2013, Mining the social web: Data mining Facebook, twitter, LinkedIn, google+, GitHub, and more
[87]   Social media for large studies of behavior [J].
Ruths, Derek ;
Pfeffer, Juergen .
SCIENCE, 2014, 346 (6213) :1063-+
[88]  
Sartor G, 2011, LAW GOV TECHNOL SER, V4, P21, DOI 10.1007/978-94-007-1887-6_3
[89]   STATISTICAL INTERPRETATION OF TERM SPECIFICITY AND ITS APPLICATION IN RETRIEVAL [J].
SPARCKJONES, K .
JOURNAL OF DOCUMENTATION, 1972, 28 (01) :11-+