Noncausal all-pole modeling of voiced speech

被引：18

作者：

Gardner, WR ^{[1
]}

Rao, BD ^{[1
]}

机构：

[1] QUALCOMM INC, SAN DIEGO, CA USA

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1997年 / 5卷 / 01期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/89.554263

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper introduces noncausal all-pole models that are capable of efficiently capturing both the magnitude and phase information of voiced speech, It is shown that noncausal all-pole filter models are better able to match both magnitude and phase information and are particularly appropriate for voiced speech due to the nature of the glottal excitation. By modeling speech in the frequency domain, the standard difficulties that occur when using noncausal all-pole filters are avoided. Several algorithms for determining the model parameters based on frequency-domain information and the masking effects of the ear are described. Our work suggests that high-quality voiced speech can be produced using a 14th-order noncausal all-pole model.

引用

页码：1 / 10

页数：10

共 28 条

[1]

[Anonymous], P ICASSP

[2]

BRANDENBURG K, 1992, AUD ENG SOC CONV

[3]

BUNCH JR, 1992, J NUMER LINEAR ALGEB, V1

[4] 2-SIDED FILTERS FOR FRAME-BASED PREDICTION [J].

DAVID, S ;

RAMAMURTHI, B .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) :789-794

[5]

Flanagan J. L., 1972, Speech Analysis Synthesis and Perception

[6]

GARDNER W, 1993, SPEECH AUDIO CODING

[7]

GARDNER WR, 1994, THESIS U CALIFORNIA

[8]

GARDNER WR, 1992, P IEEE AS C SIGN SYS

[9]

GERSON IA, 1990, P ICASSP

[10]

Gill M., 1981, Practical Optimization

← 1 2 3 →