SPEECH CODING - A TUTORIAL REVIEW

被引:178
作者
SPANIAS, AS
机构
[1] Department of Electrical Engineering, Telecommunications Research Center, Arizona State University, Tempe
关键词
D O I
10.1109/5.326413
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The past decade has witnessed substantial progress towards the application of low-rate speech coders to civilian and military communications as well as computer-related voice applications. Central to this progress has been the development of new speech coders capable of producing high-quality speech at low data rates. Most of these coders incorporate mechanisms to: represent the spectral properties of speech, provide for speech waveform matching, and ''optimize'' the coder's performance for the human ear. A number of these coders have already been adopted in national and international cellular telephony standards. The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications. Although the emphasis is on the new low-rate coders, we attempt to provide a comprehensive survey by covering some of the traditional methodologies as well. We feel that this approach will not only point out key references but will also provide valuable background to the beginner. The paper starts with a historical perspective and continues with a brief discussion on the speech properties and performance measures. We then proceed with descriptions of waveform coders, sinusoidal transform coders, linear predictive vocoders, and analysis-by-synthesis linear predictive coders. Finally, we present concluding remarks followed by a discussion of opportunities for future research.
引用
收藏
页码:1541 / 1582
页数:42
相关论文
共 326 条
[31]  
CHEN JH, 1987, APR P INT C AC SPEEC, P2185
[32]  
CHEN JH, 1991, MAY P IEEE INT C AC, P21
[33]  
CHEN JH, 1986, P ICASSP 86, P1693
[34]  
CHENG Y, 1990, APR P ICASSP90 NEW M, P649
[35]   CEPSTRUM - GUIDE TO PROCESSING [J].
CHILDERS, DG ;
SKINNER, DP ;
KEMERAIT, RC .
PROCEEDINGS OF THE IEEE, 1977, 65 (10) :1428-1443
[36]   WALSH-TRANSFORM CODING OF THE SPEECH RESIDUAL IN RELP CODERS [J].
CHING, PC ;
GOODYEAR, CC .
IEE PROCEEDINGS-G CIRCUITS DEVICES AND SYSTEMS, 1984, 131 (01) :29-34
[37]  
CHUNG J, 1990, APR P ICASSP 90 NEW, P25
[38]  
CHUNG J, 1989, P ICASSP89, P144
[39]  
COOLEY, 1968, IBM RC1743 TJ WATS R
[40]  
COPPERI M, 1985, MAR P ICASSP 85 TAMP, P252