When is best-worst best? A comparison of best-worst scaling, numeric estimation, and rating scales for collection of semantic norms

被引:34
作者
Hollis, Geoff [1 ]
Westbury, Chris [2 ]
机构
[1] Univ Alberta, Dept Comp Sci, 3-57 Athabasca Hall, Edmonton, AB T6G 2E8, Canada
[2] Univ Alberta, Dept Psychol, P217 Biol Sci Bldg, Edmonton, AB T6G 2E9, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Semantics; Semantic judgment; Best-worst scaling; Rating scales; Numeric estimation; ACQUISITION NORMS; LEXICAL DECISION; AGE; CONCRETENESS; FREQUENCY; EMOTION; VALENCE; AROUSAL; WORDS;
D O I
10.3758/s13428-017-1009-0
中图分类号
B841 [心理学研究方法];
学科分类号
040201 [基础心理学];
摘要
Large-scale semantic norms have become both prevalent and influential in recent psycholinguistic research. However, little attention has been directed towards understanding the methodological best practices of such norm collection efforts. We compared the quality of semantic norms obtained through rating scales, numeric estimation, and a less commonly used judgment format called best-worst scaling. We found that best-worst scaling usually produces norms with higher predictive validities than other response formats, and does so requiring less data to be collected overall. We also found evidence that the various response formats may be producing qualitatively, rather than just quantitatively, different data. This raises the issue of potential response format bias, which has not been addressed by previous efforts to collect semantic norms, likely because of previous reliance on a single type of response format for a single type of semantic judgment. We have made available software for creating best-worst stimuli and scoring best-worst data. We also made available new norms for age of acquisition, valence, arousal, and concreteness collected using best-worst scaling. These norms include entries for 1,040 words, of which 1,034 are also contained in the ANEW norms (Bradley & Lang, Affective norms for English words (ANEW): Instruction manual and affective ratings (pp. 1-45). Technical report C-1, the center for research in psychophysiology, University of Florida, 1999).
引用
收藏
页码:115 / 133
页数:19
相关论文
共 37 条
[1]
[Anonymous], 1990, MENTAL REPRESENTATIO, DOI DOI 10.1093/ACPROF:OSO/9780195066661.001.0001
[2]
Frequency in lexical processing [J].
Baayen, R. Harald ;
Milin, Petar ;
Ramscar, Michael .
APHASIOLOGY, 2016, 30 (11) :1174-1220
[3]
An Amorphous Model for Morphological Processing in Visual Comprehension Based on Naive Discriminative Learning [J].
Baayen, R. Harald ;
Milin, Petar ;
Durdevic, Dusica Filipovic ;
Hendrix, Peter ;
Marelli, Marco .
PSYCHOLOGICAL REVIEW, 2011, 118 (03) :438-481
[4]
The English Lexicon Project [J].
Balota, David A. ;
Yap, Melvin J. ;
Cortese, Michael J. ;
Hutchison, Keith A. ;
Kessler, Brett ;
Loftis, Bjorn ;
Neely, James H. ;
Nelson, Douglas L. ;
Simpson, Greg B. ;
Treiman, Rebecca .
BEHAVIOR RESEARCH METHODS, 2007, 39 (03) :445-459
[5]
Baroni M, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P238
[6]
Barsalou LW, 1999, BEHAV BRAIN SCI, V22, P637, DOI 10.1017/S0140525X99532147
[7]
CO2 reforming of CH4 [J].
Bradford, MCJ ;
Vannice, MA .
CATALYSIS REVIEWS-SCIENCE AND ENGINEERING, 1999, 41 (01) :1-42
[8]
Test-based age-of-acquisition norms for 44 thousand English word meanings [J].
Brysbaert, Marc ;
Biemiller, Andrew .
BEHAVIOR RESEARCH METHODS, 2017, 49 (04) :1520-1523
[9]
Concreteness ratings for 40 thousand generally known English word lemmas [J].
Brysbaert, Marc ;
Warriner, Amy Beth ;
Kuperman, Victor .
BEHAVIOR RESEARCH METHODS, 2014, 46 (03) :904-911
[10]
Norms of age of acquisition and concreteness for 30,000 Dutch words [J].
Brysbaert, Marc ;
Stevens, Michael ;
De Deyne, Simon ;
Voorspoels, Wouter ;
Storms, Gert .
ACTA PSYCHOLOGICA, 2014, 150 :80-84