Acoustic modelling using continuous rational kernels

被引:5
作者
Layton, Martin [1 ]
Gales, Mark [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
来源
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2007年 / 48卷 / 1-2期
关键词
augmented statistical models; rational kernels; speech recognition; TIMIT database;
D O I
10.1007/s11265-006-0027-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many discriminative classification algorithms are designed for tasks where samples can be represented by fixed-length vectors. However, many examples in the fields of text processing, computational biology and speech recognition are best represented as variable-length sequences of vectors. Although several dynamic kernels have been proposed for mapping sequences of discrete observations into fixed-dimensional feature-spaces, few kernels exist for sequences of continuous observations. This paper introduces continuous rational kernels, an extension of standard rational kernels, as a general framework for classifying sequences of continuous observations. In addition to allowing new task-dependent kernels to be defined, continuous rational kernels allow existing continuous dynamic kernels, such as Fisher and generative kernels, to be calculated using standard weighted finite-state transducer algorithms. Preliminary results on both a large vocabulary continuous speech recognition (LVCSR) task and the TIMIT database are presented.
引用
收藏
页码:67 / 82
页数:16
相关论文
共 26 条
[11]  
Jaakkola TS, 1999, ADV NEUR IN, V11, P487
[12]  
LAYTON MI, 2006, THESIS U CAMBRIDGE
[13]   Text classification using string kernels [J].
Lodhi, H ;
Saunders, C ;
Shawe-Taylor, J ;
Cristianini, N ;
Watkins, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) :419-444
[14]  
MANGU L, 1999, P EUR, P495
[15]  
Mohri M, 1997, COMPUT LINGUIST, V23, P269
[16]   Weighted finite-state transducers in speech recognition [J].
Mohri, M ;
Pereira, F ;
Riley, M .
COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01) :69-88
[17]  
Mohri M., 2002, Journal of Automata, Languages and Combinatorics, V7, P321
[18]  
PEREIRA FCN, 1997, FINITE STATE DEVICES
[19]  
POVEY D, 2004, THESIS U CAMBRIDGE
[20]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286