Predictive linear transforms for noise robust speech recognition

被引:12
作者
Gales, M. J. F. [1 ]
van Dalen, R. C. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
来源
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2 | 2007年
关键词
noise robust speech recognition; Joint Uncertainty Decoding; precision matrix modelling;
D O I
10.1109/ASRU.2007.4430084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that the addition of background noise alters the correlations between the elements of, for example, the MFCC feature vector. However, standard model-based compensation techniques do not modify the feature-space in which the diagonal covariance matrix Gaussian mixture models are estimated. One solution to this problem, which yields good performance, is Joint Uncertainty Decoding (JUD) with full transforms. Unfortunately, this results in a high computational cost during decoding. This paper contrasts two approaches to approximating full JUD while lowering the computational cost. Both use predictive linear transforms to modify the feature-space: adaptation-based linear transforms, where the model parameters are restricted to be the same as the original clean system; and precision matrix modelling approaches, in particular semi-tied covariance matrices. These predictive transforms are estimated using statistics derived from the full JUD transforms rather than noisy data. The schemes are evaluated on AURORA 2 and a noise-corrupted Resource Management task.
引用
收藏
页码:59 / 64
页数:6
相关论文
共 17 条
[1]  
[Anonymous], 1996, THESIS CARNEGIE MELL
[2]  
DROPPO J, 2002, P ICASSP ORL FLOR MA
[3]  
DROPPO J, 2006, P ICASSP TOUL FRANC
[4]  
GALES M, 1996, IEEE T SPEECH AUDIO
[5]   Semi-tied covariance matrices for hidden Markov models [J].
Gales, MJF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03) :272-281
[6]  
GALES MJF, 1998, SPEECH COMMUNICATION, V25
[7]  
GALES MJF, 1998, COMPUTER SPEECH LANG, V12
[8]   MAXIMUM-LIKELIHOOD LINEAR-REGRESSION FOR SPEAKER ADAPTATION OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS [J].
LEGGETTER, CJ ;
WOODLAND, PC .
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) :171-185
[9]  
LIAO H, 2006, P INT
[10]  
LIAO H, 2005, P INT LISB PORT SEPT