CAUSAL SCENE UNDERSTANDING

被引:6
作者
COOPER, PR
BIRNBAUM, LA
BRAND, ME
机构
[1] Intelligent Perception and Action Laboratory, Institute for the Learning Sciences, Northwestern University, Evanston, IL 60201
[2] Media Laboratory, Massachusetts Institute of Technology, Cambridge, MA 01239
关键词
D O I
10.1006/cviu.1995.1051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most computer vision systems are concerned with computing the whats and wheres of a scene. We describe a set of programs concerned instead with computing the whys and hows-why the scene is the way it is, and how an agent can interact with it. The basis of our approach lies in the construction of a causal explanation of a scene - a representation that describes what affects what in the scene, how these elements affect each other, and why they affect each other the way they do. Such explanations, by definition and design, must encompass representations of the potentials for action in a scene, and thus form a natural basis for describing how scene elements serve purposes - i.e., functional descriptions. As a concrete case study in causal scene understanding, this paper focuses primarily on ways to exploit the causality of objects in static equilibrium, in particular, the causality of support. We describe three camera-to-commentary vision systems, operating in three different domains, that develop causal explanations of scenes from visual images of those scenes and, in the process, provide novel solutions to a number of traditional problems in vision and robotics, including occlusion, focus of attention, and grasp planning. We also show how the kinds of causal descriptions produced by these systems can be exploited to physically interact with the scene. (C) 1995 Academic Press, Inc.
引用
收藏
页码:215 / 231
页数:17
相关论文
共 35 条
[1]  
ALOIMONOS J, 1987, 1ST P INT C COMP VIS, P35
[2]   ACTIVE PERCEPTION [J].
BAJCSY, R .
PROCEEDINGS OF THE IEEE, 1988, 76 (08) :996-1005
[3]   ANIMATE VISION [J].
BALLARD, DH .
ARTIFICIAL INTELLIGENCE, 1991, 48 (01) :57-86
[4]  
Ballard DH, 1982, COMPUTER VISION
[5]  
BINFORD T, 1982, INT J ROB RES, V1
[6]  
Blake A., 1993, [1993] Proceedings Fourth International Conference on Computer Vision, P724, DOI 10.1109/ICCV.1993.378142
[7]  
Blake A., 1992, ACTIVE VISION
[8]  
BRAND M, 1993, PROCEEDINGS OF THE ELEVENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, P588
[9]  
BRAND M, 1992, PROCEEDINGS OF THE FOURTEENTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, P720
[10]  
BRAND M, 1992, P SPIE WORKSHOP INTE