GLOVE-TALK - A NEURAL NETWORK INTERFACE BETWEEN A DATA-GLOVE AND A SPEECH SYNTHESIZER

被引:165
作者
FELS, SS [1 ]
HINTON, GE [1 ]
机构
[1] UNIV TORONTO,DEPT PSYCHOL,TORONTO M5S 1A1,ONTARIO,CANADA
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1993年 / 4卷 / 01期
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/72.182690
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To illustrate the potential of multilayer neural networks for adaptive interfaces, we used a VPL Data-Glove connected to a DECtalk speech synthesizer via five neural networks to implement a hand-gesture to speech system. Using minor variations of the standard back-propagation learning procedure, the complex mapping of hand movements to speech is learned using data obtained from a single ''speaker'' in a simple training phase. With a 203 gesture-to-word vocabulary, the wrong word is produced less than 1% of the time, and no word is produced about 5% of the time. Adaptive control of the speaking rate and word stress is also available. The training times and final performance speed are improved by using small, separate networks for each naturally defined subtask. The system demonstrates that neural networks can be used to develop the complex mappings required in a high bandwidth interface that adapts to the individual user.
引用
收藏
页码:2 / 8
页数:7
相关论文
共 8 条
[1]  
[Anonymous], NEUROCOMPUTING ALGOR
[2]  
FELS SS, 1990, CRGRT901 U TORONTO T
[3]   CONNECTIONIST LEARNING PROCEDURES [J].
HINTON, GE .
ARTIFICIAL INTELLIGENCE, 1989, 40 (1-3) :185-234
[4]  
KRAMER J, 1989, 12TH P RESNA ANN C N, P471
[5]  
OGDEN CK, 1968, BASIC ENGLISH INT 2N
[6]   LEARNING REPRESENTATIONS BY BACK-PROPAGATING ERRORS [J].
RUMELHART, DE ;
HINTON, GE ;
WILLIAMS, RJ .
NATURE, 1986, 323 (6088) :533-536
[7]  
WILBUR RB, 1979, AM SIGN LANGUAGE SIG
[8]  
1989, DATA GLOVE MODEL 2 O