Greedy learning of multiple objects in images using robust statistics and factorial learning

被引：41

作者：

Williams, CKI ^{[1
]}

Titsias, MK ^{[1
]}

机构：

[1] Univ Edinburgh, Sch Informat, Edinburgh EH1 2QL, Midlothian, Scotland

来源：

NEURAL COMPUTATION | 2004年 / 16卷 / 05期

关键词：

D O I：

10.1162/089976604773135096

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider data that are images containing views of multiple objects. Our task is to learn about each of the objects present in the images. This task can be approached as a factorial learning problem, where each image must be explained by instantiating a model for each of the objects present with the correct instantiation parameters. A major problem with learning a factorial model is that as the number of objects increases, there is a combinatorial explosion of the number of configurations that need to be considered. We develop a method to extract object models sequentially from the data by making use of a robust statistical method, thus avoiding the combinatorial explosion, and present results showing successful extraction of objects from real images.

引用

页码：1039 / 1062

页数：24

共 18 条

[1]

[Anonymous], 1996, P EUROPEAN C COMPUTE

[2]

[Anonymous], EM ALGORITHM

[3] Unsupervised Learning [J].

Barlow, H. B. .

NEURAL COMPUTATION, 1989, 1 (03) :295-311

[4] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[5]

Frey BJ, 2002, ADV NEUR IN, V14, P721

[6] Transformation-invariant clustering using the EM algorithm [J].

Frey, BJ ;

Jojic, N .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (01) :1-17

[7]

FREY BJ, 1999, P IEEE C COMP VIS PA

[8]

GHAHRAMANI Z, 1995, ADV NEURAL INFORMATI, V7, P617

[9]

HINTON GE, 1994, ADV NEURAL INFORMATI, V6

[10]

JOJIC N, 2001, P IEEE C COMP VIS PA

← 1 2 →