3-d depth reconstruction from a single still image

被引：406

作者：

Saxena, Ashutosh ^{[1
]}

Chung, Sung H. ^{[1
]}

Ng, Andrew Y. ^{[1
]}

机构：

[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2008年 / 76卷 / 01期

关键词：

monocular vision; learning depth; 3D reconstruction; dense reconstruction; Markov Random Field; depth estimation; monocular depth; stereo vision; hand-held camera; visual modeling;

D O I：

10.1007/s11263-007-0071-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider the task of 3-d depth estimation from a single still image. We take a supervised learning approach to this problem, in which we begin by collecting a training set of monocular images (of unstructured indoor and outdoor environments which include forests, sidewalks, trees, buildings, etc.) and their corresponding ground-truth depthmaps. Then, we apply supervised learning to predict the value of the depthmap as a function of the image. Depth estimation is a challenging problem, since local features alone are insufficient to estimate depth at a point, and one needs to consider the global context of the image. Our model uses a hierarchical, multiscale Markov Random Field (MRF) that incorporates multiscale local- and global-image features, and models the depths and the relation between depths at different points in the image. We show that, even on unstructured scenes, our algorithm is frequently able to recover fairly accurate depthmaps. We further propose a model that incorporates both monocular cues and stereo (triangulation) cues, to obtain significantly more accurate depth estimates than is possible using either monocular or stereo cues alone.

引用

页码：53 / 69

页数：17

共 63 条

[1] SCAPE: Shape Completion and Animation of People [J].

Anguelov, D ;

Srinivasan, P ;

Koller, D ;

Thrun, S ;

Rodgers, J ;

Davis, J .

ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :408-416

[2]

[Anonymous], NEURAL INFORM PROCES

[3]

[Anonymous], 2012, Computer Vision: A Modern Approach

[4] PERFORMANCE OF OPTICAL-FLOW TECHNIQUES [J].

BARRON, JL ;

FLEET, DJ ;

BEAUCHEMIN, SS .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 1994, 12 (01) :43-77

[5] Advances in computational stereo [J].

Brown, MZ ;

Burschka, D ;

Hager, GD .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (08) :993-1008

[6] Top-down influences on stereoscopic depth-perception [J].

Bulthoff, I ;

Bulthoff, H ;

Sinha, P .

NATURE NEUROSCIENCE, 1998, 1 (03) :254-257

[7]

Cornelis N., 2006, VID P CVPR VPCVPR

[8] Single view metrology [J].

Criminisi, A ;

Reid, I ;

Zisserman, A .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2000, 40 (02) :123-148

[9] PERFORMANCE ANALYSIS OF STEREO, VERGENCE, AND FOCUS AS DEPTH CUES FOR ACTIVE VISION [J].

DAS, S ;

AHUJA, N .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (12) :1213-1219

[10]

Davies E, 1997, MACHINE VISION THEOR

← 1 2 3 4 5 6 7 →