A COMPARISON OF THE ENERGY OPERATOR AND THE HILBERT TRANSFORM APPROACH TO SIGNAL AND SPEECH DEMODULATION

被引:168
作者
POTAMIANOS, A [1 ]
MARAGOS, P [1 ]
机构
[1] GEORGIA INST TECHNOL,SCH ELECT & COMP ENGN,ATLANTA,GA 30332
基金
美国国家科学基金会;
关键词
DEMODULATION; ENERGY OPERATOR; HILBERT TRANSFORM; SPEECH PROCESSING;
D O I
10.1016/0165-1684(94)90169-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The Hilbert transform together with Gabor's analytic sipal provides a standard linear integral approach to estimate the amplitude envelope and instantaneous frequency of signals with a combined amplitude modulation (AM) and frequency modulation (FM) structure. A recent alternative approach uses a nonlinear differential 'energy' operator to track the energy required to generate an AM-FM signal and separate it into amplitude and frequency components. In this paper, we compare these two fundamentally different approaches for demodulation of arbitrary signals and of speech resonances modeled by AM-FM signals. The comparison is done from several viewpoints: magnitude of estimation errors, computational complexity, and adaptability to instantaneous signal changes. We also propose a refinement of the energy operator approach that uses simple binomial convolutions to smooth the energy signals. This smoothed energy operator is compared to the Hilbert transform on tracking modulations in speech vowel signals, band-pass filtered around their formants. The effects of pitch periodicity and band-pass filtering on both demodulation approaches are examined and an application to formant tracking is presented. The results provide strong evidence that the estimation errors of the smoothed energy operator approach are similar to that of the Hilbert transform approach for speech applications, but smaller for communication applications. In addition, the smoothed energy operator approach has smaller computational complexity and faster adaptation due to its instantaneous nature.
引用
收藏
页码:95 / 120
页数:26
相关论文
共 27 条
[1]  
Boashash, Estimating and interpreting the instaneous frequency of a signal Part 1 Fundamentals, Proceedings of the IEEE, 80, pp. 520-538, (1992)
[2]  
Bovik, Maragos, Quatieri, Measuring amplitude and frequency modulations in noise using multiband energy operators, IEEE Internat. Symp. on Time-Frequency and Time-Scale Analysis, (1992)
[3]  
Flanagan, Speech Analysis Synthesis and Perception, (1972)
[4]  
Flanagan, Parametric coding of speech spectra, J. Acoust. Soc. Amer., 68, pp. 412-419, (1980)
[5]  
Foote, Mashao, Silverman, Stop classification using DESA-1 high resolution formant tracking, Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., (1993)
[6]  
Gabor, Theory of communication, J. IEE (London), 93, pp. 429-457, (1946)
[7]  
Hanson, Maragos, Potamianos, Finding speech formants and modulations via energy separation: With application to a vocoder, Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., (1993)
[8]  
Kaiser, On a simple algorithm to calculate the ‘energy’ of a signal, Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 381-384, (1990)
[9]  
Kaiser, On Teager's energy algorithm and its generalization to continuous signals, Proc. IEEE Digital Signal Process. Workshop, (1990)
[10]  
Mandel, Interpretation of instantaneous frequency, American Journal of Physics, 42, pp. 840-846, (1974)