SPEAKER-DEPENDENT-FEATURE EXTRACTION, RECOGNITION AND PROCESSING TECHNIQUES

被引:38
作者
FURUI, S
机构
[1] NTT Human Interface Laboratories, Musashino-shi, Tokyo, 180
关键词
SPEAKER-DEPENDENT FEATURES; SPEAKER RECOGNITION; SPEAKER ADAPTATION; VOICE CONVERSION;
D O I
10.1016/0167-6393(91)90054-W
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper discusses recent advances in and perspectives of research on speaker-dependent-feature extraction from speech waves, automatic speaker identification and verification, speaker adaptation in speech recognition, and voice conversion techniques. Speaker-dependent information exists both in the spectral envelope and in the supra-segmental features of speech. This individual information can be further classified into temporal and dynamic features. Speaker identification/verification methods can be divided into text-dependent and text-independent methods. Although text-dependent speaker verification techniques have almost reached the level suitable for practical implementation, text-independent techniques are still in the fundamental research stage. Both supervised and unsupervised speaker adaptation algorithms for speech recognition have recently been proposed, and remarkable progress has been achieved in this field. Improving synthesized speech quality by adding natural characteristics of voice individuality, and converting synthesized voice individuality from one speaker to another, are as yet little exploited research fields to be studied in the near future. Research on speaker-dependent information is one of the most important future directions for achieving advanced speech information processing systems.
引用
收藏
页码:505 / 520
页数:16
相关论文
共 38 条
[1]  
ABE M, 1988, P IEEE INT C ACOUST
[2]  
BENNANI Y, 1990, P IEEE INT C ACOUST
[3]  
EATOCK J, 1990, P INT C SPOKEN LANGU
[4]  
FENG MW, 1989, P IEEE INT C ACOUST
[5]  
FURUI F, 1972, T IECE 55A, V10, P549
[6]   CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (02) :254-272
[7]   UNSUPERVISED SPEAKER ADAPTATION BASED ON HIERARCHICAL SPECTRAL CLUSTERING [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12) :1923-1930
[9]   A TRAINING PROCEDURE FOR ISOLATED WORD RECOGNITION SYSTEMS [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (02) :129-136
[10]  
FURUI S, 1974, T IECE 57 A, V12, P880