Comparison of features for musical instrument recognition

被引:63
作者
Eronen, A [1 ]
机构
[1] Tampere Univ, Signal Proc Lab, FIN-33101 Tampere, Finland
来源
PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS | 2001年
关键词
D O I
10.1109/ASPAA.2001.969532
中图分类号
O42 [声学];
学科分类号
070206 [声学]; 082403 [水声工程];
摘要
Several features were compared with regard to recognition performance in a musical instrument recognition system. Both mel-frequency and linear prediction cepstral and delta cepstral coefficients were calculated. Linear prediction analysis was carried out both on a uniform and a warped frequency scale, and reflection coefficients were also used as features. The performance of earlier described features relating to the temporal development, modulation properties, brightness, and spectral synchronity of sounds was also analysed. The data base consisted of 5286 acoustic and synthetic solo tones from 29 different Western orchestral instruments, out of which 16 instruments were included in the test set. The best performance for solo tone recognition, 35% for individual instruments and 77% for families, was obtained with a feature set consisting of two sets of mel-frequency cepstral coefficients and a subset of the other analysed features. The confusions made by the system were analysed and compared to results reported in a human perception experiment.
引用
收藏
页码:19 / 22
页数:4
相关论文
共 9 条
[1]
[Anonymous], 1999, DISSERTATION
[2]
Feature dependence in the automatic identification of musical woodwind instruments [J].
Brown, JC ;
Houix, O ;
McAdams, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (03) :1064-1072
[3]
Eronen A., 2000, P IEEE INT C AC SPEE
[4]
HANDEL S, HEARING
[5]
Härmä A, 2000, J AUDIO ENG SOC, V48, P1011
[6]
KLAPURI A, 1999, P IEEE WORKSH APPL S
[7]
Opolko F., 1987, MCGILL U MASTER SAMP
[8]
Rabiner L., 1993, Fundamentals of Speech Recognition
[9]
Smith G, 1997, SIGHT SOUND, V7, P6