Structural Topic Models for Open-Ended Survey Responses

被引:1148
作者
Roberts, Margaret E. [1 ]
Stewart, Brandon M. [2 ,3 ]
Tingley, Dustin [2 ,3 ]
Lucas, Christopher [2 ,3 ]
Leder-Luis, Jetson [4 ]
Gadarian, Shana Kushner [5 ]
Albertson, Bethany [6 ]
Rand, David G. [7 ,8 ]
机构
[1] Univ Calif San Diego, Dept Polit Sci, La Jolla, CA 92093 USA
[2] Harvard Univ, Dept Govt, Cambridge, MA 02138 USA
[3] Harvard Univ, Inst Quantitat Social Sci, Cambridge, MA 02138 USA
[4] CALTECH, Div Humanities & Social Sci, Pasadena, CA 91126 USA
[5] Syracuse Univ, Dept Polit Sci, Maxwell Sch Citizenship & Publ Affairs, Syracuse, NY 13244 USA
[6] Univ Texas Austin, Dept Govt, Austin, TX 78712 USA
[7] Yale Univ, Dept Psychol, New Haven, CT 06511 USA
[8] Yale Univ, Dept Econ, New Haven, CT 06511 USA
关键词
D O I
10.1111/ajps.12103
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
Collection and especially analysis of open-ended survey responses are relatively rare in the discipline and when conducted are almost exclusively done through human coding. We present an alternative, semiautomated approach, the structural topic model (STM) (Roberts, Stewart, and Airoldi 2013; Roberts et al. 2013), that draws on recent developments in machine learning based analysis of textual data. A crucial contribution of the method is that it incorporates information about the document, such as the author's gender, political affiliation, and treatment assignment (if an experimental study). This article focuses on how the STM is helpful for survey researchers and experimentalists. The STM makes analyzing open-ended responses easier, more revealing, and capable of being used to estimate treatment effects. We illustrate these innovations with analysis of text from surveys and experiments.
引用
收藏
页码:1064 / 1082
页数:19
相关论文
共 43 条
[41]  
Sontag D., 2009, ADV NEUR INF PROC WO
[42]  
Wang C., 2011, P 17 ACM SIGKDD INT, P448, DOI [10.1145/2020408.2020480, DOI 10.1145/2020408.2020480]
[43]  
Zou J.Y., 2012, NIPS, P3005