A practical methodology for speech source localization with microphone arrays

被引:198
作者
Brandstein, MS [1 ]
Silverman, HF [1 ]
机构
[1] BROWN UNIV, DIV ENGN, LAB ENGN MAN MACHINE SYST, PROVIDENCE, RI 02912 USA
基金
美国国家科学基金会;
关键词
D O I
10.1006/csla.1996.0024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electronically steerable arrays of microphones have a variety of uses in speech data acquisition systems. Applications include teleconferencing, speech recognition and speaker identification, sound capture in adverse environments, and biomedical devices for the hearing impaired. An array of microphones has a number of advantages over a single-microphone system. It may be electronically aimed to provide a high-quality signal from a desired source location while simultaneously attenuating interfering talkers and ambient noise, does not necessitate local placement of transducers or encumber the talker with a hand-held or head-mounted microphone, and does not require physical movement to alter its direction of reception. Additionally, it has capabilities that a single microphone does not; namely automatic detection, localization and tracking of active talkers in its receptive area. This paper addresses the specific application of source localization algorithms for estimating the position of speech sources in a real-room environment given limited computational resources. The theoretical foundations of a speech source localization system are presented. This includes the development of a source-sensor geometry for talkers and sensors in the near-field environment as well as the evaluation of several error criteria available to the problem. Several practical algorithms necessary for real-time implementation are developed, specifically the derivation and evaluation of an appropriate time-delay estimator and a novel closed form locator. Finally, results obtained from a real system are presented to illustrate the effectiveness of the proposed source localization techniques as well as to confirm the practicality of the theoretical models. (C) 1997 Academic Press Limited.
引用
收藏
页码:91 / 126
页数:36
相关论文
共 59 条
[1]  
ADUGNA E, 1994, THESIS RUTGERS U NEW
[2]  
ALVARADO VM, 1990, THESIS BROWN U PROVI
[3]  
BANGS WJ, 1973, SIGNAL PROCESS, P577
[4]  
Bar-Shalom Y., 1990, Multitarget-Multisensor Tracking: Advanced Applications
[5]  
Bar-Shalom Y., 1988, Tracking and Data Association
[6]  
Bar-Shalom Yaakov., 1993, ESTIMATION TRACKING
[7]  
BEDARD S, 1994, INT CONF ACOUST SPEE, P261
[8]  
BRADY PT, 1965, AT&T TECH J, V44, P1
[9]  
BRANDSTEIN M, 1995, THESIS BROWN U PROVI
[10]   A closed-form location estimator for use with room environment microphone arrays [J].
Brandstein, MS ;
Adcock, JE ;
Silverman, HF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (01) :45-50