Spectral and temporal changes to speech produced in the presence of energetic and informational maskers

被引：114

作者：

Cooke, Martin ^{[1
,2
]}

Lu, Youyi ^{[3
]}

机构：

[1] Ikerbasque Basque Sci Fdn, Vitoria 01006, Alava, Spain

[2] Univ Basque Country, Fac Letras, Language & Speech Lab, Vitoria 01006, Alava, Spain

[3] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2010年 / 128卷 / 04期

关键词：

ACOUSTIC-PHONETIC CHARACTERISTICS; SPEAKER INTELLIGIBILITY; AMBIENT NOISE; MASKING; COMMUNICATION; RECOGNITION; PERCEPTION; FREQUENCY; HEARING; CLEAR;

D O I：

10.1121/1.3478775

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Talkers change the way they speak in noisy conditions. For energetic maskers, speech production changes are relatively well-understood, but less is known about how informational maskers such as competing speech affect speech production. The current study examines the effect of energetic and informational maskers on speech production by talkers speaking alone or in pairs. Talkers produced speech in quiet and in backgrounds of speech-shaped noise, speech-modulated noise, and competing speech. Relative to quiet, speech output level and fundamental frequency increased and spectral tilt flattened in proportion to the energetic masking capacity of the background. In response to modulated backgrounds, talkers were able to reduce substantially the degree of temporal overlap with the noise, with greater reduction for the competing speech background. Reduction in foreground-background overlap can be expected to lead to a release from both energetic and informational masking for listeners. Passive changes in speech rate, mean pause length or pause distribution cannot explain the overlap reduction, which appears instead to result from a purposeful process of listening while speaking. Talkers appear to monitor the background and exploit upcoming pauses, a strategy which is particularly effective for backgrounds containing intelligible speech. (C) 2010 Acoustical Society of America. [DOI: 10.1121/1.3478775]

引用

页码：2059 / 2069

页数：11

共 67 条

[1]

[Anonymous], 1973, Speech Sounds and Features

[2]

[Anonymous], 1975, THESIS STANFORD U ST

[3]

[Anonymous], HDB MATH FUNCTIONS

[4]

ASSMANN PF, 2002, P 7 INT C SPOK LANG, P425

[5] Modelling speaker intelligibility in noise [J].

Barker, Jon ;

Cooke, Martin .

SPEECH COMMUNICATION, 2007, 49 (05) :402-417

[6]

Bird J., 1998, PSYCHOPHYSICAL PHYSL

[7] EQUALLY CONTRIBUTING FREQUENCY BANDS IN INTELLIGIBILITY TESTING [J].

BLACK, JW .

JOURNAL OF SPEECH AND HEARING RESEARCH, 1959, 2 (01) :81-83

[8]

Boersma P., 2020, Praat: doing phonetics by computer (Version 5.3.82) Computer software

[9]

Boersma P., 1993, Institute of Phonetic Sciences, University of Amsterdam, Proceedings 17 (1993) 97-110, P97

[10] ACOUSTIC PHONETIC CHARACTERISTICS OF SPEECH PRODUCED IN NOISE AND WHILE WEARING AN OXYGEN MASK [J].

BOND, ZS ;

MOORE, TJ ;

GABLE, B .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 85 (02) :907-912

← 1 2 3 4 5 6 7 →