AHUMADA: A large speech corpus in Spanish for speaker characterization and identification

被引:63
作者
Ortega-Garcia, J
Gonzalez-Rodriguez, J
Marrero-Aguiar, V
机构
[1] Univ Politecn Madrid, EUIT Telecomun, Dept Ingn Audiovisual & Comun, Madrid 23031, Spain
[2] Univ Nacl Educ Distancia, Dept Lengua Espanola, E-28040 Madrid, Spain
关键词
speech databases; speaker characterization; speaker recognition;
D O I
10.1016/S0167-6393(99)00081-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker recognition is an emerging task in both commercial and forensic applications. Nevertheless, while in certain applications we can estimate, adapt or hypothesize about our working conditions, most of the commercial applications and almost the whole of the forensic approaches to speaker recognition are still open problems, due to several reasons. Some of these reasons can be stated: environmental conditions are (usually) rapidly changing or highly degraded, acquisition processes are not always under control, incriminated people exhibit low degree of cooperativeness, etc., inducing a wide range of variability sources on speech utterances. In this sense, real approaches to speaker identification necessarily imply taking into account all these variability factors. In order to isolate, analyze and measure the effect of some of the main variability sources that can be found in real commercial and forensic applications, and their influence in automatic recognition systems, a specific large speech database in Castilian Spanish called AHUMADA (/aumada/) has been designed and acquired under controlled conditions. In this paper, together with a detailed description of the database, some experimental results including different speech variability factors are also presented. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:255 / 264
页数:10
相关论文
共 21 条
[1]  
A Reynolds D., 1992, GAUSSIAN MIXTURE MOD
[2]  
ACERO A, 1993, AC ENV ROB AUT SPEEC
[3]  
[Anonymous], P INT C SPOK LANG PR
[4]  
[Anonymous], 2012, ROBUSTNESS AUTOMATIC
[5]  
Boves L., 1994, ESCA Workshop on Automatic Speaker Recognition Identification and Verification, P43
[6]  
CHAMPOD C, 1998, ESCA WORKSH SPEAK RE, P125
[7]  
Furui S., 1994, ESCA Workshop on Automatic Speaker Recognition Identification and Verification, P1
[8]  
Gibbon D., 1997, Handbook of standards and resources for spoken language systems
[9]  
Godfrey J., 1994, ESCA Workshop on Automatic Speaker Recognition Identification and Verification, P39
[10]  
GonzalezRodriguez J, 1997, INT CONF ACOUST SPEE, P1103, DOI 10.1109/ICASSP.1997.596134