Vehicle classification by acoustic signature

被引:43
作者
Nooralahiyan, AY
Kirby, HR
McKeown, D
机构
[1] Univ Leeds, Inst Transport Studies, Leeds LS2 9JT, W Yorkshire, England
[2] Univ Leeds, Dept Psychol, Leeds LS2 9JT, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
vehicle; classification; traffic sensor; TDNN; LPC;
D O I
10.1016/S0895-7177(98)00060-0
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The aim of this research is to investigate the feasibility of developing a traffic monitoring detector for the purpose of reliable on-line vehicle classification to aid traffic management systems. The detector used was a directional microphone connected to a DAT (Digital Audio Tape) recorder. The digital signal was preprocessed by LPC (Linear Predictive Coding) parameter conversion based on autocorrelation analysis. A Time Delay Neural Network (TDNN) was chosen to classify individual travelling vehicles based on their speed-independent acoustic signature. The paper provides a. description of the TDNN architecture and training algorithm, and an overview of the LPC preprocessing and feature extraction technique as applied to audio monitoring of road traffic. The performance of TDNN vehicle classification, convergence, and accuracy for the training patterns are fully illustrated. To establish the viability of this classification approach, initially, recordings were carried out on a strip of airfield for four types of vehicles under controlled conditions. A TDNN network was successfully trained with 100% accuracy in classification for the training patterns, as well as the test patterns. The net was also robust to changes in the starting position of the acoustic waveforms with 86% accuracy for the same test data set. In the second phase of the experiment, roadside recordings were made at a two-way urban road site in the city of Leeds with no control over the environmental parameters such as background noise, interference from other travelling vehicles, or the speed of the recorded vehicle. A second TDNN network was also successfully trained with 96% accuracy for the training patterns and 84% accuracy for the test patterns. (C) 1998 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:205 / 214
页数:10
相关论文
共 9 条
[1]  
[Anonymous], 1976, LINEAR PREDICTION SP
[2]  
Hecht-Nielsen R., 1989, Neural Computers, V41, P445, DOI DOI 10.1007/978-3-642-83740-1_45
[3]  
HUSH DR, 1993, IEEE SIGNAL PROC JAN, P8
[4]  
KLEIN LA, 1994, P SPIE C, V2592
[5]  
KLEIN LA, 1994, P NATDAC 94
[6]  
MAKHOL J, 1975, P IEEE, V63
[7]  
MCCLELLAND L, 1991, EXPLORATION PARALLEL
[8]  
Rabiner L., 1993, Fundamentals of Speech Recognition
[9]   PHONEME RECOGNITION USING TIME-DELAY NEURAL NETWORKS [J].
WAIBEL, A ;
HANAZAWA, T ;
HINTON, G ;
SHIKANO, K ;
LANG, KJ .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (03) :328-339