Acoustic modelling using continuous rational kernels

被引:5
作者
Layton, Martin [1 ]
Gales, Mark [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
来源
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2007年 / 48卷 / 1-2期
关键词
augmented statistical models; rational kernels; speech recognition; TIMIT database;
D O I
10.1007/s11265-006-0027-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many discriminative classification algorithms are designed for tasks where samples can be represented by fixed-length vectors. However, many examples in the fields of text processing, computational biology and speech recognition are best represented as variable-length sequences of vectors. Although several dynamic kernels have been proposed for mapping sequences of discrete observations into fixed-dimensional feature-spaces, few kernels exist for sequences of continuous observations. This paper introduces continuous rational kernels, an extension of standard rational kernels, as a general framework for classifying sequences of continuous observations. In addition to allowing new task-dependent kernels to be defined, continuous rational kernels allow existing continuous dynamic kernels, such as Fisher and generative kernels, to be calculated using standard weighted finite-state transducer algorithms. Preliminary results on both a large vocabulary continuous speech recognition (LVCSR) task and the TIMIT database are presented.
引用
收藏
页码:67 / 82
页数:16
相关论文
共 26 条
[1]  
[Anonymous], 2004, KERNEL METHODS PATTE
[2]  
[Anonymous], 2005, SPR S STAT
[3]   Modeling splicing sites with pairwise correlations [J].
Arita, M ;
Tsuda, K ;
Asai, K .
BIOINFORMATICS, 2002, 18 :S27-S34
[4]  
BAHL LR, 1986, P ICASSP TOK
[5]   AN INEQUALITY WITH APPLICATIONS TO STATISTICAL ESTIMATION FOR PROBABILISTIC FUNCTIONS OF MARKOV PROCESSES AND TO A MODEL FOR ECOLOGY [J].
BAUM, LE ;
EAGON, JA .
BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1967, 73 (03) :360-&
[6]  
Cortes C, 2004, J MACH LEARN RES, V5, P1035
[7]  
CORTES C, 2003, 16 ANN C COMP LEARN, P656
[8]  
EVERMANN G, 2005, P ICASSP, P209
[9]  
Garofolo JS, 1993, TIMIT Acoustic-Phonetic Continuous Speech Corpus
[10]  
GUNAWARDANA A, 2005, INTERSPEECH