Joint acoustic and modulation frequency

被引:102
作者
Atlas, L
Shamma, SA
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[3] Univ Maryland, Ctr Auditory & Acoust Res, Syst Res Inst, College Pk, MD 20742 USA
关键词
digital signal processing; acoustics; audition; talker separation; modulation spectrum;
D O I
10.1155/S1110865703305013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
There is a considerable evidence that our perception of sound uses important features which are related to underlying signal modulations. This topic has been studied extensively via perceptual experiments, yet there are few, if any, well-developed signal processing methods which capitalize on or model these effects. We begin by summarizing evidence of the importance of modulation representations from psychophysical, physiological, and other sources. The concept of a two-dimensional joint acoustic and modulation frequency representation is proposed. A simple single sinusoidal amplitude modulator of a sinusoidal carrier is then used to illustrate properties of an unconstrained and ideal joint representation. Added constraints are required to remove or reduce undesired interference terms and to provide invertibility. It is then noted that the constraints would be also applied to more general and complex cases of broader modulation and carriers. Applications in single-channel speaker separation and in audio coding are used to illustrate the applicability of this joint representation. Other applications in signal analysis and filtering are suggested.
引用
收藏
页码:668 / 675
页数:8
相关论文
共 38 条
[1]   MODULATION MASKING - EFFECTS OF MODULATION FREQUENCY, DEPTH, AND PHASE [J].
BACON, SP ;
GRANTHAM, DW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 85 (06) :2575-2580
[2]   AM-FM ENERGY DETECTION AND SEPARATION IN NOISE USING MULTIBAND ENERGY OPERATORS [J].
BOVIK, AC ;
MARAGOS, P ;
QUATIERI, TF .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3245-3265
[3]   Spectro-temporal modulation transfer functions and speech intelligibility [J].
Chi, TS ;
Gao, YJ ;
Guyton, MC ;
Ru, PW ;
Shamma, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (05) :2719-2732
[4]  
CHOWNING JM, 1973, J AUDIO ENG SOC, V21, P526
[5]   Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex [J].
Depireux, DA ;
Simon, JZ ;
Klein, DJ ;
Shamma, SA .
JOURNAL OF NEUROPHYSIOLOGY, 2001, 85 (03) :1220-1234
[6]   EFFECT OF TEMPORAL ENVELOPE SMEARING ON SPEECH RECEPTION [J].
DRULLMAN, R ;
FESTEN, JM ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 95 (02) :1053-1064
[7]   Remaking speech [J].
Dudley, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1939, 11 (02) :169-177
[8]  
ELHILALI M, IN PRESS SPEECH COMM
[9]   Characterizing frequency selectivity for envelope fluctuations [J].
Ewert, SD ;
Dau, T .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (03) :1181-1196
[10]  
Gardner W.A., 1987, Statistical Spectral Analysis: A Nonprobabilistic Theory