Time normalization of voice signals using functional data analysis

被引:31
作者
Lucero, JC [1 ]
Koenig, LL
机构
[1] Univ Brasilia, Dept Math, BR-70910900 Brasilia, DF, Brazil
[2] Haskins Labs Inc, New Haven, CT 06511 USA
[3] Long Isl Univ, Brooklyn, NY 11201 USA
关键词
D O I
10.1121/1.1289206
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The harmonics-to-noise ratio (HNR) has been used to quantify the waveform irregularity of voice signals [Yumoto er nl., J. Acoust. Sec: Am. 71, 1544-1550 (1982)]. This measure assumes that the signal consists of two components: a harmonic component, which is the common pattern that repeats from cycle-to-cycle, and an additive noise component, which produces the cycle-to-cycle irregularity. It has been shown [J. Qi, J. Acoust. Sec. Am. 92, 2569-2576 (1992)] that a valid computation of the HNR requires a nonlinear time normalization of the cycle wavelets to remove phase differences between them. This paper shows the application of functional data analysis to perform an optimal nonlinear normalization and compute the HNR of voice signals. Results obtained for the same signals using zero-padding, linear normalization, and dynamic programming algorithms are presented for comparison. Functional data analysis offers certain advantages over other approaches: it preserves meaningful features of signal shape, produces differentiable results, and allows flexibility in selecting the optimization criteria for the wavelet alignment. An extension of the technique for the time normalization of simultaneous voice signals (such as acoustic, EGG, and airflow signals) is also shown. The general purpose of this article is to illustrate the potential of functional data analysis as a powerful analytical tool for studying aspects of the voice production process. (C) 2000 Acoustical Society of America. [S0001-4966(00)00310-6].
引用
收藏
页码:1408 / 1420
页数:13
相关论文
共 37 条
  • [31] Titze I. R., 1994, PRINCIPLES VOICE PRO
  • [32] COMPARISON OF F(O) EXTRACTION METHODS FOR HIGH-PRECISION VOICE PERTURBATION MEASUREMENTS
    TITZE, IR
    LIANG, HX
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1993, 36 (06): : 1120 - 1133
  • [33] TITZE IR, 1994, WORSH AC VOIC AN SUM
  • [34] Vetterling W. T, 2002, NUMERICAL RECIPES C
  • [35] Wang KM, 1997, ANN STAT, V25, P1251
  • [36] LABIAL COORDINATION IN CHILDREN - PRELIMINARY CONSIDERATIONS
    WATKIN, KL
    FROMM, D
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 75 (02) : 629 - 632
  • [37] HARMONICS-TO-NOISE RATIO AS AN INDEX OF THE DEGREE OF HOARSENESS
    YUMOTO, E
    GOULD, WJ
    BAER, T
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 71 (06) : 1544 - 1550