Beat tracking by dynamic programming

被引:190
作者
Ellis, Daniel P. W. [1 ]
机构
[1] Columbia Univ, LabROSA, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
D O I
10.1080/09298210701653344
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Beat tracking - i.e. deriving from a music audio signal a sequence of beat instants that might correspond to when a human listener would tap his foot - involves satisfying two constraints. On the one hand, the selected instants should generally correspond to moments in the audio where a beat is indicated, for instance by the onset of a note played by one of the instruments. On the other hand, the set of beats should reflect a locally-constant inter-beat-interval, since it is this regular spacing between beat times that defines musical rhythm. These dual constraints map neatly onto the two constraints optimized in dynamic programming, the local match, and the transition cost. We describe a beat tracking system which first estimates a global tempo, uses this tempo to construct a transition cost function, then uses dynamic programming to find the best-scoring set of beat times that reflect the tempo as well as corresponding to moments of high 'onset strength' in a function derived from the audio. This very simple and computationally efficient procedure is shown to perform well on the MIREX-06 beat tracking training data, achieving an average beat accuracy of just under 60% on the development data. We also examine the impact of the assumption of a fixed target tempo, and show that the system is typically able to track tempo changes in a range of +/- 10% of the target tempo.
引用
收藏
页码:51 / 60
页数:10
相关论文
共 19 条
[1]  
Bellman R., 1957, DYNAMIC PROGRAMMING
[2]   Computational models of beat induction: The rule-based approach [J].
Desain, P ;
Honing, H .
JOURNAL OF NEW MUSIC RESEARCH, 1999, 28 (01) :29-42
[3]   Automatic extraction of tempo and beat from expressive performances [J].
Dixon, S .
JOURNAL OF NEW MUSIC RESEARCH, 2001, 30 (01) :39-58
[4]  
DIXON S, 2006, J NEW MUSIC RES, V23, P195
[5]  
Ellis D.P. W., 2005, PLP and RASTA (and MFCC, and inversion) in Matlab
[6]  
Ellis DPW, 2007, INT CONF ACOUST SPEE, P1429
[7]  
Ellis Daniel P. W., 2003, USPOP2002 POP MUSIC
[8]  
Goto M., 1994, Proceedings ACM Multimedia '94, P365, DOI 10.1145/192593.192700
[9]   An experimental comparison of audio tempo induction algorithms [J].
Gouyon, Fabien ;
Klapuri, Anssi ;
Dixon, Simon ;
Alonso, Miguel ;
Tzanetakis, George ;
Uhle, Christian ;
Cano, Pedro .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05) :1832-1844
[10]  
JEHAN T, 2005, THESIS MIT MEDIA LAB