Learning to Combine Bottom-Up and Top-Down Segmentation

被引:49
作者
Levin, Anat [1 ]
Weiss, Yair [1 ]
机构
[1] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
关键词
Image segmentation; Object detection;
D O I
10.1007/s11263-008-0166-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bottom-up segmentation based only on low-level cues is a notoriously difficult problem. This difficulty has lead to recent top-down segmentation algorithms that are based on class-specific image information. Despite the success of top-down algorithms, they often give coarse segmentations that can be significantly refined using low-level cues. This raises the question of how to combine both top-down and bottom-up cues in a principled manner. In this paper we approach this problem using supervised learning. Given a training set of ground truth segmentations we train a fragment-based segmentation algorithm which takes into account both bottom-up and top-down cues simultaneously, in contrast to most existing algorithms which train top-down and bottom-up modules separately. We formulate the problem in the framework of Conditional Random Fields (CRF) and derive a feature induction algorithm for CRF, which allows us to efficiently search over thousands of candidate fragments. Whereas pure top-down algorithms often require hundreds of fragments, our simultaneous learning procedure yields algorithms with a handful of fragments that are combined with low-level cues to efficiently compute high quality segmentations.
引用
收藏
页码:105 / 118
页数:14
相关论文
共 25 条
[1]  
[Anonymous], 2004, P WORKSH STAT LEARN
[2]  
[Anonymous], P IEEE C COMP VIS PA
[3]  
[Anonymous], 2001, P 18 INT C MACH LEAR, DOI DOI 10.5555/645530.655813
[4]  
BARBU A, 2003, P IEEE INT C COMP VI
[5]  
BORENSTEIN E, 2002, P EUR C COMP VIS MAY
[6]  
He X., 2006, ECCV
[7]  
He XM, 2004, PROC CVPR IEEE, P695
[8]  
KUMAR MP, 2004, P IEEE C COMP VIS PA
[9]  
Kumar S., 2003, P IEEE INT C COMP VI
[10]  
Lafferty JohnD., 2004, ICML