Fine structure spectrography and its application in speech

被引:5
作者
Dajani, HR
Wong, W
Kunov, H
机构
[1] Univ Toronto, Inst Biomat & Biomed Engn, Toronto, ON M5S 3G9, Canada
[2] Univ Toronto, Edward S Rogers Sr Dept Elect & Comp Engn, Toronto, ON M5S 3G9, Canada
关键词
D O I
10.1121/1.1896365
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A filterbank-based algorithm for time-varying spectral analysis is proposed. The algorithm, which is an enhanced realization of the conventional spectrogram, consists of hundreds or thousands of highly overlapping wideband filter/detector stages, followed by a peak detector that probes the filter/detector outputs at very short time intervals. Analysis with synthetic modulated signals illustrates how the proposed method demodulates these signals. The resulting spectrogram-like display, referred to as a "fine structure spectrogram," shows the fine structure of the modulations in substantially higher detail than is possible with conventional spectrograms. Error evaluation is performed as a function of various parameters of a single- and two-component synthetic modulated signal, and of parameters of the analysis system. In speech, the fine structure spectrogram can. detect small frequency and amplitude modulations in the formants. It also appears to identify additional significant time-frequency components in speech that are not detected by other methods, making it potentially useful in speech processing applications. (c) 2005 Acoustical Society of America.
引用
收藏
页码:3902 / 3918
页数:17
相关论文
共 47 条
[1]  
ANDERSON JC, 1984, 707 MIT
[2]   ESTIMATING AND INTERPRETING THE INSTANTANEOUS FREQUENCY OF A SIGNAL .2. ALGORITHMS AND APPLICATIONS [J].
BOASHASH, B .
PROCEEDINGS OF THE IEEE, 1992, 80 (04) :540-568
[3]   ESTIMATING AND INTERPRETING THE INSTANTANEOUS FREQUENCY OF A SIGNAL .1. FUNDAMENTALS [J].
BOASHASH, B .
PROCEEDINGS OF THE IEEE, 1992, 80 (04) :520-538
[4]  
Carden Frank., 2002, ARTECH TEL
[5]   Multiridge detection and time-frequency reconstruction [J].
Carmona, RA ;
Hwang, WL ;
Torrésani, B .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1999, 47 (02) :480-492
[6]   Characterization of signals by the ridges of their wavelet transforms [J].
Carmona, RA ;
Hwang, WL ;
Torresani, B .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (10) :2586-2590
[7]  
Cohen L., 1995, TIME FREQUENCY ANAL, P44
[8]   ASYMPTOTIC WAVELET AND GABOR ANALYSIS - EXTRACTION OF INSTANTANEOUS FREQUENCIES [J].
DELPRAT, N ;
ESCUDIE, B ;
GUILLEMAIN, P ;
KRONLANDMARTINET, R ;
TCHAMITCHIAN, P ;
TORRESANI, B .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) :644-664
[9]   PARAMETRIC CODING OF SPEECH SPECTRA [J].
FLANAGAN, JL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (02) :412-419
[10]   AUTOMATIC EXTRACTION OF FORMANT FREQUENCIES FROM CONTINUOUS SPEECH [J].
FLANAGAN, JL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1956, 28 (01) :110-118