Single- vs. multiple-instance classification

被引:30
作者
Alpaydin, Ethem [1 ]
Cheplygina, Veronika [2 ]
Loog, Marco [2 ,3 ]
Tax, David M. J. [2 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
[2] Delft Univ Technol, Pattern Recognit Lab, NL-2628 CD Delft, Netherlands
[3] Univ Copenhagen, Image Grp, DK-2100 Copenhagen, Denmark
关键词
Classification; Multiple-instance learning; Similarity-based representation; Bioinformatics;
D O I
10.1016/j.patcog.2015.04.006
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
In multiple-instance (MI) classification, each input object or event is represented by a set of instances, named a bag, and it is the bag that carries a label. MI learning is used in different applications where data is formed in terms of such bags and where individual instances in a bag do not have a label. We review MI classification from the point of view of label information carried in the instances in a bag, that is, their sufficiency for classification. Our aim is to contrast MI with the standard approach of single-instance (SI) classification to determine when casting a problem in the MI framework is preferable. We compare instance-level classification, combination by noisy-or, and bag-level classification, using the support vector machine as the base classifier. We define a set of synthetic MI tasks at different complexities to benchmark different MI approaches. Our experiments on these and two real-world bioinformatics applications on gene expression and text categorization indicate that depending on the situation, a different decision mechanism, at the instance- or bag-level, may be appropriate. If the instances in a bag provide complementary information, a bag-level MI approach is useful; but sometimes the bag information carries no useful information at all and an instance-level SI classifier works equally well, or better. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2831 / 2838
页数:8
相关论文
共 20 条
[1]
Multiple instance classification: Review, taxonomy and comparative study [J].
Amores, Jaume .
ARTIFICIAL INTELLIGENCE, 2013, 201 :81-105
[2]
[Anonymous], 2007, ICML '07
[3]
A note on learning from multiple-instance examples [J].
Blum, A ;
Kalai, A .
MACHINE LEARNING, 1998, 30 (01) :23-29
[4]
MILES: Multiple-Instance Learning via Embedded instance Selection [J].
Chen, Yixin ;
Bi, Jinbo ;
Wang, James Z. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :1931-1947
[5]
Multiple instance learning with bag dissimilarities [J].
Cheplygina, Veronika ;
Tax, David M. J. ;
Loog, Marco .
PATTERN RECOGNITION, 2015, 48 (01) :264-275
[6]
Cheplygina V, 2012, INT C PATT RECOG, P1156
[7]
Solving the multiple instance problem with axis-parallel rectangles [J].
Dietterich, TG ;
Lathrop, RH ;
LozanoPerez, T .
ARTIFICIAL INTELLIGENCE, 1997, 89 (1-2) :31-71
[8]
A review of multi-instance learning assumptions [J].
Foulds, James ;
Frank, Eibe .
KNOWLEDGE ENGINEERING REVIEW, 2010, 25 (01) :1-25
[9]
Gehler P.V., 2007, Artificial intelligence and statistics, P123
[10]
Multiple-instance learning as a classifier combining problem [J].
Li, Yan ;
Tax, David M. J. ;
Duin, Robert P. W. ;
Loog, Marco .
PATTERN RECOGNITION, 2013, 46 (03) :865-874