Application of combined temporal and spectral processing methods for speaker recognition under noisy, reverberant or multi-speaker environments

被引:9
作者
Krishnamoorthy, P. [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Commun Engn, Gauhati 781039, Assam, India
来源
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES | 2009年 / 34卷 / 05期
关键词
Speaker recognition; speech enhancement; temporal and spectral processing; noisy speech; reverberant speech and multi-speaker speech; SPEECH ENHANCEMENT; FEATURES; IDENTIFICATION; SUPPRESSION; SEPARATION; ALGORITHM; DATABASE;
D O I
10.1007/s12046-009-0043-8
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper presents an experimental evaluation of the combined temporal and spectral processing methods for speaker recognition task under noise, reverberation or multi-speaker environments. Automatic speaker recognition system gives good performance in controlled environments. Speech recorded in real environments by distant microphones is degraded by factors like background noise, reverberation and interfering speakers. This degradation strongly affects the performance of the speaker recognition system. Combined temporal and spectral processing (TSP) methods proposed in our earlier study are used for pre-processing to improve the speaker-specific features and hence the speaker recognition performance. Different types of degradation like background noise, reverberation and interfering speaker are considered for evaluation. The evaluation is carried out for the individual temporal processing, spectral processing and the combined TSP method. The experimental results show that the combined TSP methods give relatively higher recognition performance compared to either temporal or spectral processing alone.
引用
收藏
页码:729 / 754
页数:26
相关论文
共 40 条
[1]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[2]  
[Anonymous], 2002, P IEEE ICASSP
[3]   A tutorial on text-independent speaker verification [J].
Bimbot, F ;
Bonastre, JF ;
Fredouille, C ;
Gravier, G ;
Magrin-Chagnolleau, I ;
Meignier, S ;
Merlin, T ;
Ortega-García, J ;
Petrovska-Delacrétaz, D ;
Reynolds, DA .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :430-451
[4]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[5]   Speaker recognition: A tutorial [J].
Campbell, JP .
PROCEEDINGS OF THE IEEE, 1997, 85 (09) :1437-1462
[6]  
EPHRAIM Y, 1984, ASSP3211091121
[7]   COMPARISON OF SPEAKER RECOGNITION METHODS USING STATISTICAL FEATURES AND DYNAMIC FEATURES [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (03) :342-350
[8]  
Greenberg S, 1997, INT CONF ACOUST SPEE, P1647, DOI 10.1109/ICASSP.1997.598826
[9]   Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments [J].
Habets, Emanuel A. P. ;
Gannot, Sharon ;
Cohen, Israel ;
Sommen, Piet C. W. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08) :1433-1451
[10]   Robustness to telephone handset distortion in speaker recognition by discriminative feature design [J].
Heck, LP ;
Konig, Y ;
Sönmez, MK ;
Weintraub, M .
SPEECH COMMUNICATION, 2000, 31 (2-3) :181-192