Game-powered machine learning

被引:21
作者
Barrington, Luke [1 ]
Turnbull, Douglas [2 ]
Lanckriet, Gert [1 ]
机构
[1] Univ Calif, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
[2] Ithaca Coll, Dept Comp Sci, Ithaca, NY 14850 USA
基金
美国国家科学基金会;
关键词
MUSIC;
D O I
10.1073/pnas.1014748109
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Searching for relevant content in a massive amount of multimedia information is facilitated by accurately annotating each image, video, or song with a large number of relevant semantic keywords, or tags. We introduce game-powered machine learning, an integrated approach to annotating multimedia content that combines the effectiveness of human computation, through online games, with the scalability of machine learning. We investigate this framework for labeling music. First, a socially-oriented music annotation game called Herd It collects reliable music annotations based on the "wisdom of the crowds." Second, these annotated examples are used to train a supervised machine learning system. Third, the machine learning system actively directs the annotation games to collect new data that will most benefit future model iterations. Once trained, the system can automatically annotate a corpus of music much larger than what could be labeled using human computation alone. Automatically annotated songs can be retrieved based on their semantic relevance to text-based queries (e.g., "funky jazz with saxophone," "spooky electronica," etc.). Based on the results presented in this paper, we find that actively coupling annotation games with machine learning provides a reliable and scalable approach to making searchable massive amounts of multimedia data.
引用
收藏
页码:6411 / 6416
页数:6
相关论文
共 22 条
[1]  
[Anonymous], CASUAL GAMES WHITE P
[2]  
[Anonymous], 2010, MACH LEARN
[3]  
[Anonymous], 22 INT C HUM FACT CO
[4]   The anatomy of a large-scale hypertextual Web search engine [J].
Brin, S ;
Page, L .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :107-117
[5]   Active learning with statistical models [J].
Cohn, DA ;
Ghahramani, Z ;
Jordan, MI .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :129-145
[6]   Predicting protein structures with a multiplayer online game [J].
Cooper, Seth ;
Khatib, Firas ;
Treuille, Adrien ;
Barbero, Janos ;
Lee, Jeehyung ;
Beenen, Michael ;
Leaver-Fay, Andrew ;
Baker, David ;
Popovic, Zoran ;
Players, Foldit .
NATURE, 2010, 466 (7307) :756-760
[7]  
Coviello E, 2010, 11 INT SOC MUS INF R
[8]  
Dasgupta S, 2007, J MACH LEARN RES, V8, P203
[9]  
Downie JS, 2008, AUDIO TAG CLASSIFICA
[10]  
Eck D, 2007, 21 C NEUR INF PROC S