SPEECH CODING - A TUTORIAL REVIEW

被引:178
作者
SPANIAS, AS
机构
[1] Department of Electrical Engineering, Telecommunications Research Center, Arizona State University, Tempe
关键词
D O I
10.1109/5.326413
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The past decade has witnessed substantial progress towards the application of low-rate speech coders to civilian and military communications as well as computer-related voice applications. Central to this progress has been the development of new speech coders capable of producing high-quality speech at low data rates. Most of these coders incorporate mechanisms to: represent the spectral properties of speech, provide for speech waveform matching, and ''optimize'' the coder's performance for the human ear. A number of these coders have already been adopted in national and international cellular telephony standards. The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications. Although the emphasis is on the new low-rate coders, we attempt to provide a comprehensive survey by covering some of the traditional methodologies as well. We feel that this approach will not only point out key references but will also provide valuable background to the beginner. The paper starts with a historical perspective and continues with a brief discussion on the speech properties and performance measures. We then proceed with descriptions of waveform coders, sinusoidal transform coders, linear predictive vocoders, and analysis-by-synthesis linear predictive coders. Finally, we present concluding remarks followed by a discussion of opportunities for future research.
引用
收藏
页码:1541 / 1582
页数:42
相关论文
共 326 条
[21]  
Box GEP, 1970, TIME SERIES ANAL FOR
[22]  
BOYD I, 1988, BRIT TELECOM TECHNOL, V6, P50
[23]  
BRANDSTEIN M, 1990, APR P ICASSP90 NEW M, P5
[24]  
BURG JP, 1967, 37TH P M SOC EXPL GE
[25]   SPEECH CODING BASED UPON VECTOR QUANTIZATION [J].
BUZO, A ;
GRAY, AH ;
GRAY, RM ;
MARKEL, JD .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (05) :562-574
[26]   COMPARISON OF ORTHOGONAL TRANSFORMATIONS FOR DIGITAL SPEECH PROCESSING [J].
CAMPANELLA, SJ ;
ROBINSON, GS .
IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, 1971, CO19 (06) :1045-+
[27]  
Campbell J. P. Jr., 1990, Speech Technology, V5, P58
[28]  
Campbell J. P. Jr., 1986, ICASSP 86 Proceedings. IEEE-IECEJ-ASJ International Conference on Acoustics, Speech and Signal Processing (Cat. No.86CH2243-4), P473
[29]  
CHANG PC, 1987, IEEE T COMMUN, V35, P1059, DOI 10.1109/TCOM.1987.1096683
[30]   A LOW-DELAY CELP CODER FOR THE CCITT 16 KB S SPEECH CODING STANDARD [J].
CHEN, JH ;
COX, RV ;
LIN, YC ;
JAYANT, N ;
MELCHNER, MJ .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1992, 10 (05) :830-849