Robust wide-baseline stereo from maximally stable extremal regions

被引:3330
作者
Matas, J
Chum, O
Urban, M
Pajdla, T
机构
[1] Czech Tech Univ, Ctr Machine Percept, Dept Cybernet, CZ-12135 Prague, Czech Republic
[2] Univ Surrey, CVSSP, Guildford GU2 7XH, Surrey, England
关键词
wide-baseline stereo; distinguished regions; maximally stable extremal regions; MSER; robust metric;
D O I
10.1016/j.imavis.2004.02.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
The wide-baseline stereo problem, i.e. the problem of establishing correspondences between a pair of images taken from different viewpoints is studied. A new set of image elements that are put into correspondence, the so called extremal regions, is introduced. Extremal regions possess highly desirable properties: the set is closed under (1) continuous (and thus projective) transformation of image coordinates and (2) monotonic transformation of image intensities. An efficient (near linear complexity) and practically fast detection algorithm (near frame rate) is presented for an affinely invariant stable subset of extremal regions, the maximally stable extremal regions (MSER). A new robust similarity measure for establishing tentative correspondences is proposed. The robustness ensures that invariants from multiple measurement regions (regions obtained by invariant constructions from extremal re ions), some that are significantly larger (and hence discriminative) than the MSERs, may be used to establish tentative correspondences. The high utility of MSERs, multiple measurement regions and the robust metric is demonstrated in wide-baseline experiments on image pairs from both indoor and outdoor scenes. Significant change of scale (3.5 X), illumination conditions, out-of-plane rotation, occlusion, locally anisotropic scale change and 3D translation of the viewpoint are all present in the test problems. Good estimates of epipolar geometry (average distance from corresponding points to the epipolar line below 0.09 of the inter-pixel distance) are obtained. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:761 / 767
页数:7
相关论文
共 22 条
[1]
[Anonymous], LNCS
[2]
Baumberg A, 2000, PROC CVPR IEEE, P774, DOI 10.1109/CVPR.2000.855899
[3]
CHUM O, 2003, P BMVC 03 LOND UK SE, V1, P73
[4]
Dufournaud Y, 2000, PROC CVPR IEEE, P612, DOI 10.1109/CVPR.2000.855876
[5]
GRIMSON WEL, 1990, OBJECT RECOGNITION
[6]
Hartley R., 2000, MULTIPLE VIEW GEOMET
[7]
Lowe D.G., 1999, P IEEE INT C COMP VI, P1150, DOI DOI 10.1109/ICCV.1999.790410
[8]
Matas J, 2002, INT C PATT RECOG, P363, DOI 10.1109/ICPR.2002.1047471
[9]
MATAS J, 2002, P CVWW 02 FEB, P296
[10]
MIKOLAJCZYK K, 2001, 8 INT C COMP VIS VAN