The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using Semantically Unpredictable Sentences

被引:108
作者
Benoit, C
Grice, M
Hazan, V
机构
[1] UNIV SAARLAND, D-6600 SAARBRUCKEN, GERMANY
[2] UNIV LONDON UNIV COLL, DEPT PHONET & LINGUIST, LONDON NW1 2HE, ENGLAND
关键词
Intelligibility; Multilinguality; Standardization; Synthesis assessment;
D O I
10.1016/0167-6393(96)00026-X
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes the experimental set-up used by the SAM (ESPRIT-BRA Project no. 2589: Multilingual Speech Input/Output: Assessment, Methodology and Standardisation) group for evaluating the intelligibility of text-to-speech systems at sentence level. The SUS test measures overall intelligibility of Semantically Unpredictable Sentences which can be automatically generated using five basic syntactic structures and a number of lexicons containing the most frequently occurring mini-syllabic words in each language. The sentence material has the advantage of not being fixed, as words can be extracted from the lexicons randomly to form a new set of sentences each time the test is run. Various text-to-speech systems in a number of languages have been evaluated using this test. Results have demonstrated that the SUS test is effective and that it allows for reliable comparison across synthesisers provided guidelines are followed carefully regarding the definition of the test material and actual running of the test. These recommendations are the result of experience gained during the SAM project and beyond. They are presented here so as to provide users with a standardized evaluation method which is flexible and easy to use and is applicable to a number of different languages.
引用
收藏
页码:381 / 392
页数:12
相关论文
共 19 条
[1]   AN INTELLIGIBILITY TEST USING SEMANTICALLY UNPREDICTABLE SENTENCES - TOWARDS THE QUANTIFICATION OF LINGUISTIC COMPLEXITY [J].
BENOIT, C .
SPEECH COMMUNICATION, 1990, 9 (04) :293-304
[2]  
Benoit C., 1995, P127
[3]  
BENOIT C, 1992, TALKING MACHINES THE, P435
[4]  
BENOIT C, 1989, P ESCA WORKSH SPEECH
[5]  
BENOIT C, 1992, P 2 INT C SPOK LANG, V2, P999
[6]  
BENOIT C, 1989, P EUROSPEECH 89 C ES, V2, P633
[7]  
Chomsky N., 1957, SYNTACTIC STRUCT
[8]  
FOUREIN AJF, 1992, TALKING MACHINES THE, P431
[9]  
GRICE M, 1989, SPEECH HEARING LANGU, V3, P107
[10]  
HAZAN V, 1993, P EUROSPEECH 93 C ES, V3, P1849