Visualizing music and audio using self-similarity

被引:145
作者
Foote, J [1 ]
机构
[1] FX Palo Alto Lab Inc, Palo Alto, CA 94304 USA
来源
ACM MULTIMEDIA 99, PROCEEDINGS | 1999年
关键词
music visualization; audio analysis; audio similarity;
D O I
10.1145/319463.319472
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel approach to visualizing the time structure of music and audio. The acoustic similarity between any two instants of an audio recording is displayed in a 2D representation, allowing identification of structural and rhythmic characteristics. Examples are presented for classical and popular music. Applications include content-based analysis and segmentation, as well as tempo and structure extraction.
引用
收藏
页码:77 / 80
页数:4
相关论文
共 7 条
[1]   A comparison of features for speech, music discrimination. [J].
Carey, MJ ;
Parris, ES ;
Lloyd-Thomas, H .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :149-152
[2]   Content-based retrieval of music and audio [J].
Foote, JT .
MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II, 1997, 3229 :138-147
[3]  
JOHNSON P, 1999, SCI SKEPTIC FAQ
[4]  
KOENIG WK, JASA, V18, P19
[5]  
Potter Ralph, 1947, VISIBLE SPEECH
[6]  
Rabiner L., 1993, Fundamentals of Speech Recognition
[7]   A visualization of music [J].
Smith, SM ;
Williams, GN .
VISUALIZATION '97 - PROCEEDINGS, 1997, :499-503