Modeling the world from Internet photo collections

被引:1499
作者
Snavely, Noah [1 ]
Seitz, Steven M. [1 ]
Szeliski, Richard [2 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Microsoft Res, Redmond, WA USA
基金
美国人文基金会;
关键词
structure from motion; 3D scene analysis; Internet imagery; photo browsers; 3D navigation;
D O I
10.1007/s11263-007-0107-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are billions of photographs on the Internet, comprising the largest and most diverse photo collection ever assembled. How can computer vision researchers exploit this imagery? This paper explores this question from the standpoint of 3D scene modeling and visualization. We present structure-from-motion and image-based rendering algorithms that operate on hundreds of images downloaded as a result of keyword-based image search queries like "Notre Dame" or "Trevi Fountain." This approach, which we call Photo Tourism, has enabled reconstructions of numerous well-known world sites. This paper presents these algorithms and results as a first step towards 3D modeling of the world's well-photographed sites, cities, and landscapes from Internet imagery, and discusses key open problems and challenges for the research community.
引用
收藏
页码:189 / 210
页数:22
相关论文
共 82 条
  • [1] AKBARZADEH A, 2006, P INT S 3D DAT PROC
  • [2] ALIAGA D, 2003, P SIGGRAPH S INT 3D, P163
  • [3] Sea of images - A dense sampling approach for rendering large indoor environments
    Aliaga, DG
    Funkhouser, T
    Yanovsky, D
    Carlbom, I
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2003, 23 (06) : 22 - 30
  • [4] Aloimonos Y., 1993, ACTIVE PERCEPTION
  • [5] Anandan, 2003, Proceedings of the eleventh ACM international conference on Multimedia, P156
  • [6] [Anonymous], P INT C COMP VIS
  • [7] [Anonymous], P EUR C COMP VIS ECC
  • [8] [Anonymous], 2004, 340 I COMP SCI FORTH
  • [9] [Anonymous], MITCSAILTR2005056
  • [10] [Anonymous], 2006, P CVPR 06 IE COMP SO