Greedy learning of multiple objects in images using robust statistics and factorial learning

被引:41
作者
Williams, CKI [1 ]
Titsias, MK [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH1 2QL, Midlothian, Scotland
关键词
D O I
10.1162/089976604773135096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider data that are images containing views of multiple objects. Our task is to learn about each of the objects present in the images. This task can be approached as a factorial learning problem, where each image must be explained by instantiating a model for each of the objects present with the correct instantiation parameters. A major problem with learning a factorial model is that as the number of objects increases, there is a combinatorial explosion of the number of configurations that need to be considered. We develop a method to extract object models sequentially from the data by making use of a robust statistical method, thus avoiding the combinatorial explosion, and present results showing successful extraction of objects from real images.
引用
收藏
页码:1039 / 1062
页数:24
相关论文
共 18 条
[1]  
[Anonymous], 1996, P EUROPEAN C COMPUTE
[2]  
[Anonymous], EM ALGORITHM
[3]   Unsupervised Learning [J].
Barlow, H. B. .
NEURAL COMPUTATION, 1989, 1 (03) :295-311
[4]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[5]  
Frey BJ, 2002, ADV NEUR IN, V14, P721
[6]   Transformation-invariant clustering using the EM algorithm [J].
Frey, BJ ;
Jojic, N .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (01) :1-17
[7]  
FREY BJ, 1999, P IEEE C COMP VIS PA
[8]  
GHAHRAMANI Z, 1995, ADV NEURAL INFORMATI, V7, P617
[9]  
HINTON GE, 1994, ADV NEURAL INFORMATI, V6
[10]  
JOJIC N, 2001, P IEEE C COMP VIS PA