Three-dimensional sound localization from a compact non-coplanar array of microphones using tree-based learning

被引:23
作者
Wang, JY [1 ]
Guentchev, KY [1 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
关键词
D O I
10.1121/1.1377290
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
One of the various human sensory capabilities is to identify the direction of perceived sounds. The goal of this work is to study sound source localization in three dimensions using some of the most important cues the human uses. In an attempt to satisfy the requirements of portability and miniaturization in robotics, this approach employs a compact sensor structure that can be placed on a mobile platform. The objective is to estimate the relative sound source position in three-dimensional space without imposing excessive restrictions on its spatio-temporal characteristics and the environment structure. Two types of features are considered, interaural time and level differences. Their relative effectiveness for localization is studied, as well as a practical way of using these complementary parameters. A two-stage procedure was used. In the training stage, sound samples are produced from points with known coordinates and then are stored. In the recognition stage, unknown sounds are processed by the trained system to estimate the 3D location of the sound source. Results from the experiments showed under +/-3 degrees in average angular error and less than +/- 20% in average radial distance error. (C) 2001 Acoustical Society of America.
引用
收藏
页码:310 / 323
页数:14
相关论文
共 22 条
[1]  
BLAUERT J, 1969, ACUSTICA, V22, P205
[2]   A closed-form location estimator for use with room environment microphone arrays [J].
Brandstein, MS ;
Adcock, JE ;
Silverman, HF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (01) :45-50
[3]   A PRACTICAL TIME-DELAY ESTIMATOR FOR LOCALIZING SPEECH SOURCES WITH A MICROPHONE ARRAY [J].
BRANDSTEIN, MS ;
ADCOCK, JE ;
SILVERMAN, HF .
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) :153-169
[4]   A practical methodology for speech source localization with microphone arrays [J].
Brandstein, MS ;
Silverman, HF .
COMPUTER SPEECH AND LANGUAGE, 1997, 11 (02) :91-126
[5]  
BRANDSTEIN MS, 1997, P 1997 WORKSH APPL S
[6]  
BUB U, 1995, P 1995 ICASSP DETR M
[7]  
CAPEL V, 1978, MICROPHONES ACTION
[8]  
CARR HA, 1966, INTRO SPACE PERCEPTI
[9]  
CHAMPAGNE B, 1996, IEEE T SPEECH AUDIO, V4, P48
[10]   LEAST-SQUARES ESTIMATION OF TIME-DELAY AND ITS USE IN SIGNAL-DETECTION [J].
CHAN, YT ;
HATTIN, RV ;
PLANT, JB .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (03) :217-222