ENERGY SEPARATION IN SIGNAL MODULATIONS WITH APPLICATION TO SPEECH ANALYSIS

被引:520
作者
MARAGOS, P
KAISER, JF
QUATIERI, TF
机构
[1] BELLCORE, MORRISTOWN, NJ 07962 USA
[2] MIT, LINCOLN LAB, SPEECH SYST TECHNOL GRP, LEXINGTON, MA 02173 USA
[3] HARVARD UNIV, DEPT APPL SCI, CAMBRIDGE, MA 02138 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/78.277799
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Oscillatory signals that have both an amplitude-modulation (AM) and a frequency-modulation (FM) structure are encountered in almost all communication systems. We have also used these structures recently for modeling speech resonances, being motivated by previous work on investigating fluid dynamics phenomena during speech production that provide evidence for the existence of modulations in speech signals. In this paper, we use a nonlinear differential operator that can detect modulations in AM-FM signals by estimating the product of their time-varying amplitude and frequency. This operator essentially tracks the energy needed by a source to produce the oscillatory signal. To solve the fundamental problem of estimating both the amplitude envelope and instantaneous frequency of an AM-FM signal we develop a novel approach that uses nonlinear combinations of instantaneous signal outputs from the energy operator to separate its output energy product into its amplitude modulation and frequency modulation components. The theoretical analysis is done first for continuous-time signals. Then several efficient algorithms are developed and compared for estimating the amplitude envelope and instantaneous frequency of discrete-time AM-FM signals. These energy separation algorithms are then applied to search for modulations in speech resonances, which we model using AM-FM signals to account for time-varying amplitude envelopes and instantaneous frequencies. Our experimental results provide evidence that bandpass filtered speech signals around speech formants contain amplitude and frequency modulations within a pitch period. Overall, the energy separation algorithms, due to their very low computational complexity and instantaneously-adapting nature, are very useful in detecting modulation patterns in speech and other time-varying signals.
引用
收藏
页码:3024 / 3051
页数:28
相关论文
共 39 条
[1]   A method of reducing disturbances in radio signaling by a system of frequency modulation [J].
Armstrong, EH .
PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1936, 24 (05) :689-740
[2]   SPEECH ANALYSIS AND SYNTHESIS BY LINEAR PREDICTION OF SPEECH WAVE [J].
ATAL, BS ;
HANAUER, SL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 50 (02) :637-+
[3]   Notes on the theory of modulation [J].
Carson, JR .
PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1922, 10 (01) :57-64
[4]  
Fant G., 1960, ACOUSTIC THEORY SPEE
[5]  
Flanagan J. L., 1965, SPEECH ANAL SYNTHESI
[6]  
Gabor D, 1946, J I ELECT ENG 3, V93, P429, DOI DOI 10.1049/JI-3-2.1946.0074
[7]  
HANSON HM, 1992, 926 HARV U HARV ROB
[8]  
HEGERL GC, 1991, P IEEE ICASSP 91 MAY, P477
[9]  
IIJIMA H, 1989, P IEEE ICASSP 89 MAY, P246
[10]  
Kaiser J., 1983, VOCAL FOLD PHYSL BIO, P358