SPEECH DATABASE DEVELOPMENT AT MIT - TIMIT AND BEYOND

被引:384
作者
ZUE, V
SENEFF, S
GLASS, J
机构
[1] Spoken Language Systems Group, Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge
关键词
Speech corpora; speech database; speech recognition;
D O I
10.1016/0167-6393(90)90010-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic speech recognition by computers can provide the most natural and efficient method of communication between humans and computers. While in recent years high performance speech recognition systems are beginning to emerge from research institutions, scientists unequivocally agree that the deployment of speech recognition systems into realistic operating environments will require many hours of speech data to help us model the inherent variability in the speech signal. This paper describes the experiences of researchers at MIT in the collection of two large speech databases which have somewhat complementary objectives. The timit database was designed to be task and speaker-independent, and is suitable for general acoustic-phonetic research. The voyager database, on the other hand, was intended for development and evaluation of a system which incorporates both speech and natural language processing. This database is particularly valuable as a source of spontaneous utterances elicited in a realistic goal-oriented environment. © 1990.
引用
收藏
页码:351 / 356
页数:6
相关论文
共 16 条
[1]  
CARRE R, 1984, P ICASSP84
[2]  
Fisher W. M., 1986, DARPA SPEECH RECOGNI
[3]  
HULTZEN IS, 1964, TABLE TRANSITIONAL F
[4]  
KASSEL RH, 1986, THESIS MIT
[5]  
Kucera H., 1967, COMPUTATIONAL ANAL P
[6]  
KUWABARA H, 1989, P ICASSP89, P560
[7]  
Lamel L. F., 1986, P DARPA SPEECH REC W, P100
[8]   SPEAKER-INDEPENDENT PHONE RECOGNITION USING HIDDEN MARKOV-MODELS [J].
LEE, KF ;
HON, HW .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (11) :1641-1648
[9]  
LEUNG HC, 1984, P ICASSP84
[10]  
LEUNG HC, 1985, THESIS MIT