Using Internet search engines to estimate word frequency

被引:64
作者
Blair, IV
Urland, GR
Ma, JE
机构
[1] Univ Colorado, Dept Psychol, Boulder, CO 80309 USA
[2] Univ Kansas, Lawrence, KS 66045 USA
来源
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS | 2002年 / 34卷 / 02期
关键词
D O I
10.3758/BF03195456
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
The present research investigated Internet search engines as a rapid, cost-effective alternative for estimating word frequencies. Frequency estimates for 382 words were obtained and compared across four methods: (1) Internet search engines, (2) the Kucera and Francis (1967) analysis of a traditional linguistic corpus, (3) the CELEX English linguistic database (Baayen, Piepenbrock, & Gulikers, 1995), and (4) participant ratings of familiarity. The results showed that Internet search engines produced frequency estimates that were highly consistent with those reported by Kucera and Francis and those calculated from CELEX, highly consistent across search engines, and very reliable over a 6-month period of time. Additional results suggested that Internet search engines are an excellent option when traditional word frequency analyses do not contain the necessary data (e.g., estimates for forenames and slang). In contrast, participants' familiarity judgments did not correspond well with the more objective estimates of word frequency. Researchers are advised to use search engines with large databases (e.g., AltaVista) to ensure the greatest representativeness of the frequency estimates.
引用
收藏
页码:286 / 290
页数:5
相关论文
共 21 条
[1]  
[Anonymous], 1944, TEACHERS WORD BOOK 3
[2]  
[Anonymous], 2019, Corpus Linguistics
[3]  
Baayen RH., 1996, The celex lexical database (cd-rom)
[4]   Automatic and controlled processes in stereotype priming [J].
Blair, IV ;
Banaji, MR .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1996, 70 (06) :1142-1163
[5]  
BRYSAERT M, 2000, J COGNITIVE PSYCHOL, V12, P65
[6]   A naturalistic study of the word frequency effect in episodic recognition [J].
Chalmers, KA ;
Humphreys, MS ;
Dennis, S .
MEMORY & COGNITION, 1997, 25 (06) :780-784
[7]  
Chomsky N., 1965, Aspects of the Theory of Syntax
[8]   Automatic preference for white Americans: Eliminating the familiarity explanation [J].
Dasgupta, N ;
McGhee, DE ;
Greenwald, AG ;
Banaji, MR .
JOURNAL OF EXPERIMENTAL SOCIAL PSYCHOLOGY, 2000, 36 (03) :316-328
[9]   GOALS IN SOCIAL INFORMATION-PROCESSING - THE CASE OF ANTICIPATED INTERACTION [J].
DEVINE, PG ;
SEDIKIDES, C ;
FUHRMAN, RW .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1989, 56 (05) :680-690
[10]  
Francis W. N., 1982, FREQUENCY ANAL ENGLI