QUANTITATIVE ANALYSIS OF LARGE AMOUNTS OF JOURNALISTIC TEXTS USING TOPIC MODELLING

被引:342
作者
Jacobi, Carina [1 ]
van Atteveldt, Wouter [2 ]
Welbers, Kasper [2 ]
机构
[1] Univ Vienna, Dept Commun, A-1010 Vienna, Austria
[2] Vrije Univ Amsterdam, Dept Commun Sci, Amsterdam, Netherlands
关键词
automatic content analysis; journalism; nuclear energy; topic models; FRAMES;
D O I
10.1080/21670811.2015.1093271
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
The huge collections of news content which have become available through digital technologies both enable and warrant scientific inquiry, challenging journalism scholars to analyse unprecedented amounts of texts. We propose Latent Dirichlet Allocation (LDA) topic modelling as a tool to face this challenge. LDA is a cutting edge technique for content analysis, designed to automatically organize large archives of documents based on latent topics, measured as patterns of word (co-) occurrence. We explain how this technique works, how different choices by the researcher affect the results and how the results can be meaningfully interpreted. To demonstrate its usefulness for journalism research, we conducted a case study of the New York Times coverage of nuclear technology from 1945 to the present, partially replicating a study by Gamson and Modigliani. This shows that LDA is a useful tool for analysing trends and patterns in news content in large digital news archives relatively quickly.
引用
收藏
页码:89 / 106
页数:18
相关论文
共 22 条
[1]  
[Anonymous], 2014, POL CONT MATT CONT A
[2]  
[Anonymous], 2002, MALLET: A machine learning for language toolkit
[3]  
[Anonymous], 2006, PROC 5 INT C LANGUAG
[4]  
[Anonymous], 2013, P 7 INT C LANGUAGE R
[5]  
Blei D.M., 2006, INT C MACHINE LEARNI, DOI DOI 10.1145/1143844.1143859
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]   Teaching the Computer to Code Frames in News: Comparing Two Supervised Machine Learning Approaches to Frame Analysis [J].
Burscher, Bjoern ;
Odijk, Daan ;
Vliegenthart, Rens ;
de Rijke, Maarten ;
de Vreese, Claes H. .
COMMUNICATION METHODS AND MEASURES, 2014, 8 (03) :190-206
[8]  
Chang J., 2009, Adv. Neural Inf. Process. Syst., V22, DOI DOI 10.5555/2984093.2984126
[9]   Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of US government arts funding [J].
DiMaggio, Paul ;
Nag, Manish ;
Blei, David .
POETICS, 2013, 41 (06) :570-606
[10]   FRAMING - TOWARD CLARIFICATION OF A FRACTURED PARADIGM [J].
ENTMAN, RM .
JOURNAL OF COMMUNICATION, 1993, 43 (04) :51-58