Recent advances in speaker recognition

被引:120
作者
Furui, S [1 ]
机构
[1] Tokyo Inst Technol, Meguro Ku, Tokyo 152, Japan
关键词
speaker recognition; speaker verification; speaker identification; text-prompted method; HMM; likelihood normalization;
D O I
10.1016/S0167-8655(97)00073-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces recent advances in speaker recognition technology. The first part discusses general topics and issues. The second part is devoted to a discussion of more specific topics of recent interest that have led to interesting new approaches and techniques. They include VQ- and ergodic-HMM-based text-independent recognition methods, a text-prompted recognition method, parameter/distance normalization and model adaptation techniques, and methods of updating models and a priori thresholds in speaker verification. Although many recent advances and successes have been achieved in speaker recognition, there are still many problems for which good solutions remain to be found. The last part of this paper describes 16 open questions about speaker recognition. The paper concludes with a short discussion assessing the current status and future possibilities. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:859 / 872
页数:14
相关论文
共 59 条
[1]  
[Anonymous], P INT C SPOK LANG PR
[2]  
[Anonymous], 1989, DIGITAL SPEECH PROCE
[3]   EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION [J].
ATAL, BS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) :1304-1312
[4]   AUTOMATIC SPEAKER RECOGNITION BASED ON PITCH CONTOURS [J].
ATAL, BS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (06) :1687-1697
[5]  
Carey MJ, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P1800, DOI 10.1109/ICSLP.1996.607979
[6]  
Carey MJ, 1992, P I ACOUSTICS 6, V14, P95
[7]   SPEAKER RECOGNITION - IDENTIFYING PEOPLE BY THEIR VOICES [J].
DODDINGTON, GR .
PROCEEDINGS OF THE IEEE, 1985, 73 (11) :1651-1664
[8]  
EATOCK J, 1990, P INT C SPOK LANG PR, P133
[9]  
Furui S., 1994, ESCA Workshop on Automatic Speaker Recognition Identification and Verification, P1
[10]   SPEAKER-DEPENDENT-FEATURE EXTRACTION, RECOGNITION AND PROCESSING TECHNIQUES [J].
FURUI, S .
SPEECH COMMUNICATION, 1991, 10 (5-6) :505-520