Producing high-dimensional semantic spaces from lexical co-occurrence

被引:1040
作者
Lund, K
Burgess, C
机构
[1] University of California, Riverside, CA
[2] Psychology Department, 1419 Life Sciences Bldg., University of California, Riverside
来源
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS | 1996年 / 28卷 / 02期
关键词
D O I
10.3758/BF03204766
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
A procedure that processes a corpus of text and produces numeric vectors containing information about its meanings for each word is presented. This procedure is applied to a large corpus of natural language text taken from Usenet, and the resulting vectors are examined to determine what information is contained within them. These vectors provide the coordinates in a high-dimensional space in which word relationships can be analyzed. Analyses of both vector similarity and multidimensional scaling demonstrate that there is significant semantic information carried in the vectors. A comparison of vector similarity with human reaction times in a single-word priming experiment is presented. These vectors provide the basis for a representational model of semantic memory, hyperspace analogue to language (HAL).
引用
收藏
页码:203 / 208
页数:6
相关论文
共 21 条
[1]  
[Anonymous], 1995, COGNITIVE SCI P LEA
[2]  
[Anonymous], 1991, Lexical Acquisition
[3]  
ARMSTRONG S, 1994, USING LARGE CORPORA
[4]  
BURGESS C, 1994, PROCEEDINGS OF THE SIXTEENTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, P90
[5]  
BURGESS C, 1995, 8 ANN CUNY SENT PROC
[6]  
BURGESS C, 1995, ANN M PSYCH SOC LOS
[7]  
BURGESS C, IN PRESS GETTING RIG
[8]  
BURGESS C, 1995, P 17 ANN C COGN SCI, P13
[9]   SEMANTIC AND ASSOCIATIVE PRIMING IN THE CEREBRAL HEMISPHERES - SOME WORDS DO, SOME WORDS DONT ... SOMETIMES, SOME PLACES [J].
CHIARELLO, C ;
BURGESS, C ;
RICHARDS, L ;
POLLOCK, A .
BRAIN AND LANGUAGE, 1990, 38 (01) :75-104
[10]  
Ervin-Tripp S. M., 1970, NORMS WORD ASS, P383, DOI [10.1016/B978-0-12-563050-4.50012-1., DOI 10.1016/B978-0-12-563050-4.50012-1]