Recognizing speech under a processing load: Dissociating energetic from informational factors

被引:189
作者
Mattys, Sven L. [1 ]
Brooks, Joanna [2 ]
Cooke, Martin [3 ,4 ]
机构
[1] Univ Bristol, Dept Expt Psychol, Bristol BS8 1TU, Avon, England
[2] Univ Edinburgh, Dept Psychol, Edinburgh EH8 9JZ, Midlothian, Scotland
[3] Ikerbasque Basque Sci Fdn, Bilbao 48011, Spain
[4] Univ Basque Country, Dept Elect & Elect, Fac Ciencias & Tecnol, Leioa 48940, Spain
基金
英国经济与社会研究理事会;
关键词
Psycholinguistics; Spoken-word recognition; Speech segmentation; Processing load; Energetic masking; Informational masking;
D O I
10.1016/j.cogpsych.2009.04.001
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Effects of perceptual and cognitive loads on spoken-word recognition have so far largely escaped investigation. This study lays the foundations of a psycholinguistic approach to speech recognition in adverse conditions that draws upon the distinction between energetic masking, i.e., listening environments leading to signal degradation, and informational masking, i.e., listening environments leading to depletion of higher-order, domain-general processing resources, independent of signal degradation. We show that severe energetic masking, such as that produced by background speech or noise, curtails reliance on lexical-semantic knowledge and increases relative reliance on salient acoustic detail. In contrast, informational masking, induced by a resource-depleting competing task (divided attention or a memory load), results in the opposite pattern. Based on this clear dissociation, we propose a model of speech recognition that addresses not only the mapping between sensory input and lexical representations, as traditionally advocated, but also the way in which this mapping interfaces with general cognition and non-linguistic processes. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:203 / 243
页数:41
相关论文
共 87 条
[1]  
Albert S. Bregman, 1990, AUDITORY SCENE ANAL, P411, DOI [DOI 10.7551/MITPRESS/1486.001.0001, 10.1121/1.408434, DOI 10.1121/1.408434]
[2]  
[Anonymous], 2000, P 5 SEM SPEECH PROD
[3]  
*ANSI, 1997, S351997 ANSIASA
[4]   Effects of acoustic distortion and semantic context on lexical access [J].
Aydelott, J ;
Bates, E .
LANGUAGE AND COGNITIVE PROCESSES, 2004, 19 (01) :29-56
[5]   Mixed-effects modeling with crossed random effects for subjects and items [J].
Baayen, R. H. ;
Davidson, D. J. ;
Bates, D. M. .
JOURNAL OF MEMORY AND LANGUAGE, 2008, 59 (04) :390-412
[6]  
Baayen RH., 1996, The celex lexical database (cd-rom)
[7]   Working memory [J].
Baddeley, Alan .
CURRENT BIOLOGY, 2010, 20 (04) :R136-R140
[8]   Modelling speaker intelligibility in noise [J].
Barker, Jon ;
Cooke, Martin .
SPEECH COMMUNICATION, 2007, 49 (05) :402-417
[9]   The role of fundamental frequency contours in the perception of speech against interfering speech [J].
Binns, Christine ;
Culling, John F. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (03) :1765-1776
[10]   Mommy and Me -: Familiar names help launch babies into speech-stream segmentation [J].
Bortfeld, H ;
Morgan, JL ;
Golinkoff, RM ;
Rathbun, K .
PSYCHOLOGICAL SCIENCE, 2005, 16 (04) :298-304