Speech probability distribution

被引：206

作者：

Gazor, S ^{[1
]}

Zhang, W ^{[1
]}

机构：

[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON K7L 3N6, Canada

来源：

IEEE SIGNAL PROCESSING LETTERS | 2003年 / 10卷 / 07期

关键词：

speech coding; speech processing;

D O I：

10.1109/LSP.2003.813679

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

It is demonstrated that the distribution of speech samples. is, well described by Laplacian distribution, (LD). The widely known speech distributions, i.e., LD, Gaussian distribution (GD), generalized GD, and gamma distribution, are tested as four hypotheses, and it is proved that speech samples during voice activity intervals are Laplacian random variables. A decorrelation transformation is then applied to speech samples. to approximate their multivariate distribution. To do this, speech is decomposed using an adaptive Karhunen-Loeve transform or a discrete cosine transform. Then, the distributions of speech components in decorrelated domains are investigated. Experimental evaluations prove that the statistics of speech signals are like a multivariate LD. In brief, all marginal distributions of speech are accurately described by LD in decorrelated domains. While the energies of speech components are time-varying, their distribution shape remains Laplacian.

引用

页码：204 / 207

页数：4

共 12 条

[1] AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].

BELL, AJ ;

SEJNOWSKI, TJ .

NEURAL COMPUTATION, 1995, 7 (06) :1129-1159

[2] DESCRIPTION AND GENERATION OF SPHERICALLY INVARIANT SPEECH-MODEL SIGNALS [J].

BREHM, H ;

STAMMLER, W .

SIGNAL PROCESSING, 1987, 12 (02) :119-141

[3] AN EXPERIMENTAL STUDY OF SPEECH-WAVE PROBABILITY DISTRIBUTIONS [J].

DAVENPORT, WB .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1952, 24 (04) :390-399

[4] A DCT-based fast signal subspace technique for robust speech recognition [J].

Huang, J ;

Zhao, YX .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06) :747-751

[5]

JANG GJ, 2001, P ICASSP

[6] COMPARISON OF GENERALIZED GAUSSIAN AND LAPLACIAN MODELING IN DCT IMAGE-CODING [J].

JOSHI, RL ;

FISCHER, TR .

IEEE SIGNAL PROCESSING LETTERS, 1995, 2 (05) :81-82

[7]

LeBlanc JP, 1998, INT CONF ACOUST SPEE, P1029, DOI 10.1109/ICASSP.1998.675443

[8] DISTRIBUTION SHAPE OF 2-DIMENSIONAL DCT COEFFICIENTS OF NATURAL IMAGES [J].

MULLER, F .

ELECTRONICS LETTERS, 1993, 29 (22) :1935-1936

[9] MINIMUM MEAN-SQUARED-ERROR QUANTIZATION IN SPEECH PCM AND DPCM SYSTEMS [J].

PAEZ, MD ;

GLISSON, TH .

IEEE TRANSACTIONS ON COMMUNICATIONS, 1972, CO20 (02) :225-&

[10] An adaptive KLT approach for speech enhancement [J].

Rezayee, A ;

Gazor, S .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (02) :87-95

← 1 2 →