Treating Words as Data with Error: Uncertainty in Text Statements of Policy Positions

被引:171
作者
Benoit, Kenneth [1 ]
Laver, Michael [2 ]
Mikhaylov, Slava [1 ]
机构
[1] Trinity Coll Dublin, Dept Polit Sci, Dublin 2, Ireland
[2] NYU, Dept Polit, New York, NY 10012 USA
关键词
PARTIES;
D O I
10.1111/j.1540-5907.2009.00383.x
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
Political text offers extraordinary potential as a source of information about the policy positions of political actors. Despite recent advances in computational text analysis, human interpretative coding of text remains an important source of text-based data, ultimately required to validate more automatic techniques. The profession's main source of cross-national, time-series data on party policy positions comes from the human interpretative coding of party manifestos by the Comparative Manifesto Project (CMP). Despite widespread use of these data, the uncertainty associated with each point estimate has never been available, undermining the value of the dataset as a scientific resource. We propose a remedy. First, we characterize processes by which CMP data are generated. These include inherently stochastic processes of text authorship, as well as of the parsing and coding of observed text by humans. Second, we simulate these error-generating processes by bootstrapping analyses of coded quasi-sentences. This allows us to estimate precise levels of nonsystematic error for every category and scale reported by the CMP for its entire set of 3,000-plus manifestos. Using our estimates of these errors, we show how to correct biased inferences, in recent prominently published work, derived from statistical analyses of error-contaminated CMP data.
引用
收藏
页码:495 / 513
页数:19
相关论文
共 34 条
[1]   Are niche parties fundamentally different from mainstream parties? - The causes and the electoral consequences of Western European parties' policy shifts, 1976-1998 [J].
Adams, James ;
Clark, Michael ;
Ezrow, Lawrence ;
Glasgow, Garrett .
AMERICAN JOURNAL OF POLITICAL SCIENCE, 2006, 50 (03) :513-529
[2]  
[Anonymous], 1994, Designing Social Inquiry: Scientific Inference in Qualitative Research
[3]  
[Anonymous], 2006, Monographs on Statistics and Applied Probability
[4]  
[Anonymous], 2001, Mapping policy preferences: Estimates for parties, electors, and governments 1945-1998, DOI DOI 10.1093/OSO/9780199244003.003.0005
[5]  
[Anonymous], 1993, INTRO BOOTSTRAP
[6]  
[Anonymous], 2004, ECONOMET J
[7]  
[Anonymous], 2001, MAPPING POLICY PREFE
[8]  
ARELLANO M, 2003, ADV TEXT ECONOMET, pR11
[9]   Benchmarks for text analysis: A response to Budge and Pennings [J].
Benoit, Kenneth ;
Laver, Michael .
ELECTORAL STUDIES, 2007, 26 (01) :130-135
[10]  
Benoit Kenneth., 2006, PARTY POLICY MODERN