Telephony speech enhancement by data hiding

被引：22

作者：

Chen, Siyue ^{[1
]}

Leung, Henry

Ding, Heping

机构：

[1] Univ Calgary, Dept Elect & Comp Engn, Calgary, AB T2N 1N4, Canada

[2] Natl Res Council Canada, Inst Microstruct Sci, Ottawa, ON K1A 0R6, Canada

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2007年 / 56卷 / 01期

关键词：

auditory spectrum; data hiding; public switched telephone network (PSTN); speech coding; speech enhancement; spread spectrum (SS);

D O I：

10.1109/TIM.2006.887409

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 [电气工程]; 0809 [电子科学与技术];

摘要：

The current public switched telephone network (PSTN) is only able to deliver analog signals in a relatively narrow frequency band, about 200-3500 Hz. Such a limited bandwidth causes the typical sound of the narrowband telephone speech. In order to improve intelligibility and perceived quality of telephone speech, we propose using data hiding to extend the PSTN channel bandwidth. Based on the perceptual masking principle, the inaudible spectrum components within the telephone bandwidth can be removed without degrading the speech quality, providing a hidden channel to transmit extra information. The audible components outside the PSTN bandwidth, which are spread out by using orthogonal pseudo-noise codes, are embedded into this hidden channel and then transmitted through the PSTN channel. While this hidden signal is not audible to the human ear, it can be extracted at the receiver end. It results in a final speech signal with a wider bandwidth than the normal PSTN channel. Using both theoretical and simulation analysis, it is shown that the proposed approach is robust to quantization errors and channel noises. Although we cannot physically extend the transmission bandwidth of PSTN, the telephony speech quality can be significantly,improved by using the proposed data hiding technique.

引用

页码：63 / 74

页数：12

共 22 条

[1]

[Anonymous], 2000, ETSI 201 108 V112

[2]

[Anonymous], 129E ITU COM

[3]

AVENDANO C, 1995, EUR C SPEECH COMM TE

[4]

BEERENDS JG, 1992, J AUDIO ENG SOC, V40, P963

[5]

BRANDENBURG K, 1994, J AUDIO ENG SOC, V42, P780

[6]

Statistical Recovery of Wideband Speech from Narrowband Speech [J].

Cheng, Yan Ming ;

O'Shaughnessy, Douglas ;

Mermelstein, Paul .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :544-548

[7]

EPPS J, 1999, IEEE WORKSH SPEECH C

[8]

Fant G., 1973, Speech sounds and features

[9]

HARRIS FJ, 1979, P IEEE, V67, P1586

[10]

HASSAAN AA, 1998, PERSPECTIVES SPREAD

← 1 2 3 →