Noninvasive technique for detecting hypernasal speech using a nonlinear operator

被引:51
作者
Cairns, DA
Hansen, JHL
Riski, JE
机构
[1] DUKE UNIV,DEPT ELECT ENGN,ROBUST SPEECH PROC LAB,DURHAM,NC 27708
[2] SCOTTISH RITE CHILDRENS MED CTR,ATLANTA,GA 30342
关键词
D O I
10.1109/10.477699
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Speakers with a defective velopharyngeal mechanism produce speech with inappropriate nasal resonance (hypernasal speech). It is of clinical interest to detect hypernasality as it is indicative of an anatomical, neurological, or peripheral nervous system problem. There are various clinical techniques used to determine hypernasality. The current techniques are physically invasive or intrusive to some extent. A preferred approach for detecting hypernasality, would be noninvasive to maximize patient comfort and naturalness of speaking. In this study, a noninvasive technique based on the Teager Energy operator is proposed. Utilizing a property of the Teager Energy operator and a model for normal and nasalized speech, a significant difference between the Teager Energy profile for lowpass and bandpass filtered nasalized speech is shown. This difference is shown to be nonexistent for normal speech. A classification algorithm is formulated that detects the presence of hypernasality using a measure of the difference in the Teager Energy profiles. The classification algorithm was evaluated using a native English speaker population producing front (/i/) and mid (/A/) vowels Results show that the presence of hypernasality in speech can be reliably detected using the proposed classification algorithm.
引用
收藏
页码:35 / 45
页数:11
相关论文
共 39 条
[1]  
BORDEN GJ, 1984, SPEECH SCI PRIMER
[2]  
CAIRNS D, 1994, PROCEEDINGS OF THE 16TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY - ENGINEERING ADVANCES: NEW OPPORTUNITIES FOR BIOMEDICAL ENGINEERS, PTS 1&2, P253, DOI 10.1109/IEMBS.1994.412058
[3]  
CAIRNS DA, 1994, J ACOUST SOC AM, V92, P3392
[4]  
CAIRNS DA, 1994, P ICSLP94 INT C SPOK, P1035
[5]  
DALSTON RM, 1993, CLEFT PALATE-CRAN J, V30, P285, DOI 10.1597/1545-1569(1993)030<0285:NSASAC>2.3.CO
[6]  
2
[7]  
DALSTON RM, 1991, CLEFT PALATE-CRAN J, V28, P184, DOI 10.1597/1545-1569(1991)028<0184:UONAAD>2.3.CO
[8]  
2
[9]  
DARBY JK, 1981, SPEECH EVALUATION PS
[10]  
Deller Jr J. R., 1993, DISCRETE TIME PROCES