PHONETICALLY-BASED MULTILAYERED NEURAL NETWORKS FOR VOWEL CLASSIFICATION

被引:7
作者
COSI, P
BENGIO, Y
DEMORI, R
机构
[1] CTR RECH INFORMAT MONTREAL, MONTREAL H3G 1N2, QUEBEC, CANADA
[2] MCGILL UNIV, SCH COMP SCI, MONTREAL H3A 2K6, QUEBEC, CANADA
关键词
articulatory features; classification; ear model; multi-layered neural networks; recognition; Speaker independent system; vowels;
D O I
10.1016/0167-6393(90)90041-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The vowel sub-component of a speaker-independent phoneme classification system will be described. The architecture of the vowel classifier is based on an ear model followed by a set of Multi-Layered Neural Networks (MLNN). MLNNs are trained to learn how to recognize articulatory features like the place of articulation and the manner of articulation related to tongue position. Experiments are performed on 10 English vowels showing a recognition rate higher than 95% on new speakers. When features are used for recognition, comparable results are obtained for vowels and diphthongs not used for training and pronounced by new speakers. This suggests that MLNNs suitably fed by the data computed by an ear model have good generalization capabilities over new speakers and new sounds. © 1990.
引用
收藏
页码:15 / 29
页数:15
相关论文
共 23 条
[1]  
[Anonymous], 1987, COMPUT SPEECH LANG, DOI DOI 10.1016/0885-2308(87)90026-X
[2]  
BENGIO Y, 1989, NATO ASI SERIES
[3]  
BOURLARD H, 1987, 1ST P IEEE INT C NEU, P407
[4]   SPEECH CODING IN THE AUDITORY-NERVE .3. VOICELESS FRICATIVE CONSONANTS [J].
DELGUTTE, B ;
KIANG, NYS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 75 (03) :887-896
[5]   SPEECH CODING IN THE AUDITORY-NERVE .1. VOWEL-LIKE SOUNDS [J].
DELGUTTE, B ;
KIANG, NYS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 75 (03) :866-878
[6]   REPRESENTATION OF SPEECH-LIKE SOUNDS IN THE DISCHARGE PATTERNS OF AUDITORY-NERVE FIBERS [J].
DELGUTTE, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (03) :843-857
[7]   SPEECH CODING IN THE AUDITORY-NERVE .4. SOUNDS WITH CONSONANT-LIKE DYNAMIC CHARACTERISTICS [J].
DELGUTTE, B ;
KIANG, NYS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 75 (03) :897-907
[8]   PARALLEL ALGORITHMS FOR SYLLABLE RECOGNITION IN CONTINUOUS SPEECH [J].
DEMORI, R ;
LAFACE, P ;
MONG, Y .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1985, 7 (01) :56-69
[9]  
GOLDHOR RS, 1985, RLE505 TECHN REP
[10]  
Hinton G.E., 1986, PARALLEL DISTRIBUTED, V1, DOI DOI 10.1234/12345678