Acoustic modelling using continuous rational kernels

被引：5

作者：

Layton, Martin ^{[1
]}

Gales, Mark ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2007年 / 48卷 / 1-2期

关键词：

augmented statistical models; rational kernels; speech recognition; TIMIT database;

D O I：

10.1007/s11265-006-0027-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Many discriminative classification algorithms are designed for tasks where samples can be represented by fixed-length vectors. However, many examples in the fields of text processing, computational biology and speech recognition are best represented as variable-length sequences of vectors. Although several dynamic kernels have been proposed for mapping sequences of discrete observations into fixed-dimensional feature-spaces, few kernels exist for sequences of continuous observations. This paper introduces continuous rational kernels, an extension of standard rational kernels, as a general framework for classifying sequences of continuous observations. In addition to allowing new task-dependent kernels to be defined, continuous rational kernels allow existing continuous dynamic kernels, such as Fisher and generative kernels, to be calculated using standard weighted finite-state transducer algorithms. Preliminary results on both a large vocabulary continuous speech recognition (LVCSR) task and the TIMIT database are presented.

引用

页码：67 / 82

页数：16

共 26 条

[1]

[Anonymous], 2004, KERNEL METHODS PATTE

[2]

[Anonymous], 2005, SPR S STAT

[3] Modeling splicing sites with pairwise correlations [J].

Arita, M ;

Tsuda, K ;

Asai, K .

BIOINFORMATICS, 2002, 18 :S27-S34

[4]

BAHL LR, 1986, P ICASSP TOK

[5] AN INEQUALITY WITH APPLICATIONS TO STATISTICAL ESTIMATION FOR PROBABILISTIC FUNCTIONS OF MARKOV PROCESSES AND TO A MODEL FOR ECOLOGY [J].

BAUM, LE ;

EAGON, JA .

BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1967, 73 (03) :360-&

[6]

Cortes C, 2004, J MACH LEARN RES, V5, P1035

[7]

CORTES C, 2003, 16 ANN C COMP LEARN, P656

[8]

EVERMANN G, 2005, P ICASSP, P209

[9]

Garofolo JS, 1993, TIMIT Acoustic-Phonetic Continuous Speech Corpus

[10]

GUNAWARDANA A, 2005, INTERSPEECH

← 1 2 3 →