The NIST speaker recognition evaluation - Overview, methodology, systems, results, perspective

被引:190
作者
Doddington, GR
Przybocki, MA
Martin, AF
Reynolds, DA
机构
[1] Natl Inst Stand & Technol, Gaithersburg, MD 20899 USA
[2] SRI Int, Menlo Pk, CA 94025 USA
[3] MIT, Lincoln Lab, Lexington, MA 02173 USA
关键词
speaker recognition; identification; verification; performance evaluation; NIST evaluations; detection error trade-off (DET) curve;
D O I
10.1016/S0167-6393(99)00080-1
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper, based on three presentations made in 1998 at the RLA2C Workshop in Avignon, discusses the evaluation of speaker recognition systems from several perspectives. A general discussion of the speaker recognition task and the challenges and issues involved in its evaluation is offered. The NIST evaluations in this area and specifically the 1998 evaluation, its objectives, protocols and test data, are described. The algorithms used by the systems that were developed for this evaluation are summarized, compared and contrasted. Overall performance results of this evaluation are presented by means of detection error trade-off (DET) curves. These show the performance trade-off of missed detections and false alarms for each system and the effects on performance of training condition, test segment duration, the speakers' sex and the match or mismatch of training and test handsets. Several factors that were found to have an impact on performance, including pitch frequency, handset type and noise, are discussed and DET curves showing their effects are presented. The paper concludes with some perspective on the history of this technology and where it may be going. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:225 / 254
页数:30
相关论文
共 21 条
[1]  
[Anonymous], 1997, Proceedings of the uropean Conference on Speech Communication and Technology
[2]  
BESACIER L, 1998, RLA2C APR, P106
[3]  
CAREY M, 1998, RLA2C APR, P161
[4]  
DODDINGTON G, 1998, P ICSLP 98
[5]  
DODDINGTON G, 1998, P RLA2C 20 23 APR AV, P630
[6]  
GILLICK L, 1993, ICASSP APR
[7]  
Heck LP, 1997, INT CONF ACOUST SPEE, P1071, DOI 10.1109/ICASSP.1997.596126
[8]  
HENNEBERT J, 1998, RLA2C AV FRANC, P55
[9]  
HERMANSKY H, 1998, RLA2C APR, P111
[10]  
JABOULET C, 1998, RLA2C WORKSH AV, P202