Decision tree state tying using cluster validity criteria

被引:5
作者
Chien, JT [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 02期
关键词
cluster validity; decision tree; F distribution; continuous speech recognition; Hubert's Gamma statistic; hypothesis test; T-2-statistle;
D O I
10.1109/TSA.2004.840941
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Decision tree state tying aims to perform divisive clustering, which can combine the phonetics and acoustics of speech signal for large vocabulary continuous speech recognition. A tree is built by successively splitting the observation frames of a phonetic unit according to the best phonetic questions. To prevent building over-large tree models, the stopping criterion is required to suppress tree growing. Accordingly, it is crucial to exploit the goodness-of-split criteria to choose the best questions for node splitting and test whether the splitting should be terminated or not. In this paper. we apply the Hubert's Gamma statistic as the node splitting criterion and the T-statistic as the stopping criterion. The Hubert's Gamma statistic sufficiently characterizes the clustering structure in the given data. This cluster validity criterion is adopted to select the best questions to unravel tree nodes. Further, we examine the population closeness of two split nodes with a significance level. The T-2-statistic expressed by an F distribution is determined to verify whether the mean vectors of two nodes are close together. The splitting is stopped when verified. In the experiments of Mandarin speech recognition, the proposed methods achieve better syllable recognition rates with smaller tree models compared to the conventional maximum likelihood and minimum description length criteria.
引用
收藏
页码:182 / 193
页数:12
相关论文
共 38 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Anderson TW., 1984, INTRO MULTIVARIATE S
[3]  
[Anonymous], P EUR 1997
[4]  
[Anonymous], 1992, P ICASSP
[5]  
BAHL LR, 1991, INT CONF ACOUST SPEE, P185, DOI 10.1109/ICASSP.1991.150308
[6]  
BAIN LJ, 1992, INTRO PROBABILTY MAT
[7]  
Beulen K, 1998, INT CONF ACOUST SPEE, P805, DOI 10.1109/ICASSP.1998.675387
[8]  
Chen SS, 1998, INT CONF ACOUST SPEE, P645, DOI 10.1109/ICASSP.1998.675347
[9]  
CHESTA C, 1997, P EUR C SPEECH COMM, V1, P11
[10]   Unsupervised hierarchical adaptation using reliable selection of cluster-dependent parameters [J].
Chien, JT ;
Junqua, JC .
SPEECH COMMUNICATION, 2000, 30 (04) :235-253