Nonnegative features of spectro-temporal sounds for classification

被引:47
作者
Cho, YC [1 ]
Choi, SJ [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci, Pohang 790784, South Korea
关键词
acoustic feature extraction; general sound recognition; nonnegative matrix factorization;
D O I
10.1016/j.patrec.2004.11.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A parts-based representation is a way of understanding object recognition in the brain. The nonnegative matrix factorization (NMF) is an algorithm which is able to learn a parts-based representation by allowing only non-subtractive combinations [Lee, D.D., Seung, H.S., 1999. Learning the parts of objects by non-negative matrix factorization. Nature 401, 788-791]. In this paper we incorporate a parts-based representation of spectro-temporal sounds into the acoustic feature extraction, which leads to nonnegative features. We present a method of inferring encoding variables in the framework of NMF and show that the method produces robust acoustic features in the presence of noise in the task of general sound classification.. Experimental results confirm that the proposed feature extraction method improves the classification performance, especially in the presence of noise, compared to independent component analysis (ICA) which produces holistic features. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:1327 / 1336
页数:10
相关论文
共 15 条