State-of-the-art in speaker recognition

被引:54
作者
Faundez-Zanuy, M
Monte-Moreno, E
机构
[1] Escola Univ Politecn Mataro, Barcelona 08303, Spain
[2] TALP Res Ctr, Barcelona, Spain
关键词
D O I
10.1109/MAES.2005.1432568
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Recent advances in speech technologies have produced new tools that can be used to improve the performance and flexibility of speaker recognition. While there are few degrees of freedom or alternative methods when using fingerprint or iris identification techniques, speech offers much more flexibility and different levels to perform recognition: the system can force the user to speak in a particular manner, different for each attempt to enter. Also, with voice input, the system has other degrees of freedom, such as the use of knowledge/codes that only the user knows, or dialectical/semantical traits that are difficult to forge. This paper offers an overview of the state-of-the-art in speaker recognition, with special emphasis on the pros and cons, and the current research lines. The current research lines include improved classification systems, and the use of high level information by means of probabilistic grammars. In conclusion, speaker recognition is far away from being a technology where all the possibilities have already been explored.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 11 条
[1]  
[Anonymous], EUR C SPEECH COMM TE
[2]  
[Anonymous], 1989, DIGITAL SPEECH PROCE
[3]  
CAMPBELL JP, 2003, FUSING HIGH LOW LEVE
[4]  
Cristianini N., 2000, Intelligent Data Analysis: An Introduction, DOI 10.1017/CBO9780511801389
[5]  
DODDINGTON G, 2001, EUROSPEECH, V4, P2521
[6]   On the vulnerability of biometric security systems [J].
Faúndez-Zanuy, M .
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2004, 19 (06) :3-8
[7]  
FAUNDEZZANNY M, 2004, IN PRESS IEEE AEROSP
[8]  
FAUNDEZZANUY M, 2004, IN PRESS IEEE AEROSP
[9]  
Manning C., 1999, Foundations of Statistical Natural Language Processing
[10]   AHUMADA: A large speech corpus in Spanish for speaker characterization and identification [J].
Ortega-Garcia, J ;
Gonzalez-Rodriguez, J ;
Marrero-Aguiar, V .
SPEECH COMMUNICATION, 2000, 31 (2-3) :255-264