Speech probability distribution

被引:206
作者
Gazor, S [1 ]
Zhang, W [1 ]
机构
[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON K7L 3N6, Canada
关键词
speech coding; speech processing;
D O I
10.1109/LSP.2003.813679
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
It is demonstrated that the distribution of speech samples. is, well described by Laplacian distribution, (LD). The widely known speech distributions, i.e., LD, Gaussian distribution (GD), generalized GD, and gamma distribution, are tested as four hypotheses, and it is proved that speech samples during voice activity intervals are Laplacian random variables. A decorrelation transformation is then applied to speech samples. to approximate their multivariate distribution. To do this, speech is decomposed using an adaptive Karhunen-Loeve transform or a discrete cosine transform. Then, the distributions of speech components in decorrelated domains are investigated. Experimental evaluations prove that the statistics of speech signals are like a multivariate LD. In brief, all marginal distributions of speech are accurately described by LD in decorrelated domains. While the energies of speech components are time-varying, their distribution shape remains Laplacian.
引用
收藏
页码:204 / 207
页数:4
相关论文
共 12 条
[1]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[2]   DESCRIPTION AND GENERATION OF SPHERICALLY INVARIANT SPEECH-MODEL SIGNALS [J].
BREHM, H ;
STAMMLER, W .
SIGNAL PROCESSING, 1987, 12 (02) :119-141
[3]   AN EXPERIMENTAL STUDY OF SPEECH-WAVE PROBABILITY DISTRIBUTIONS [J].
DAVENPORT, WB .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1952, 24 (04) :390-399
[4]   A DCT-based fast signal subspace technique for robust speech recognition [J].
Huang, J ;
Zhao, YX .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06) :747-751
[5]  
JANG GJ, 2001, P ICASSP
[6]   COMPARISON OF GENERALIZED GAUSSIAN AND LAPLACIAN MODELING IN DCT IMAGE-CODING [J].
JOSHI, RL ;
FISCHER, TR .
IEEE SIGNAL PROCESSING LETTERS, 1995, 2 (05) :81-82
[7]  
LeBlanc JP, 1998, INT CONF ACOUST SPEE, P1029, DOI 10.1109/ICASSP.1998.675443
[8]   DISTRIBUTION SHAPE OF 2-DIMENSIONAL DCT COEFFICIENTS OF NATURAL IMAGES [J].
MULLER, F .
ELECTRONICS LETTERS, 1993, 29 (22) :1935-1936
[9]   MINIMUM MEAN-SQUARED-ERROR QUANTIZATION IN SPEECH PCM AND DPCM SYSTEMS [J].
PAEZ, MD ;
GLISSON, TH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1972, CO20 (02) :225-&
[10]   An adaptive KLT approach for speech enhancement [J].
Rezayee, A ;
Gazor, S .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (02) :87-95