Acoustic modelling using continuous rational kernels

被引：5

作者：

Layton, Martin ^{[1
]}

Gales, Mark ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2007年 / 48卷 / 1-2期

关键词：

augmented statistical models; rational kernels; speech recognition; TIMIT database;

D O I：

10.1007/s11265-006-0027-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Many discriminative classification algorithms are designed for tasks where samples can be represented by fixed-length vectors. However, many examples in the fields of text processing, computational biology and speech recognition are best represented as variable-length sequences of vectors. Although several dynamic kernels have been proposed for mapping sequences of discrete observations into fixed-dimensional feature-spaces, few kernels exist for sequences of continuous observations. This paper introduces continuous rational kernels, an extension of standard rational kernels, as a general framework for classifying sequences of continuous observations. In addition to allowing new task-dependent kernels to be defined, continuous rational kernels allow existing continuous dynamic kernels, such as Fisher and generative kernels, to be calculated using standard weighted finite-state transducer algorithms. Preliminary results on both a large vocabulary continuous speech recognition (LVCSR) task and the TIMIT database are presented.

引用

页码：67 / 82

页数：16

共 26 条

[11]

Jaakkola TS, 1999, ADV NEUR IN, V11, P487

[12]

LAYTON MI, 2006, THESIS U CAMBRIDGE

[13] Text classification using string kernels [J].

Lodhi, H ;

Saunders, C ;

Shawe-Taylor, J ;

Cristianini, N ;

Watkins, C .

JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) :419-444

[14]

MANGU L, 1999, P EUR, P495

[15]

Mohri M, 1997, COMPUT LINGUIST, V23, P269

[16] Weighted finite-state transducers in speech recognition [J].

Mohri, M ;

Pereira, F ;

Riley, M .

COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01) :69-88

[17]

Mohri M., 2002, Journal of Automata, Languages and Combinatorics, V7, P321

[18]

PEREIRA FCN, 1997, FINITE STATE DEVICES

[19]

POVEY D, 2004, THESIS U CAMBRIDGE

[20] A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].

RABINER, LR .

PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286

← 1 2 3 →