AN OPTIMIZATION-BASED APPROACH TO THE INTERPRETATION OF SINGLE LINE DRAWINGS AS 3D WIRE FRAMES

被引:108
作者
LECLERC, YG [1 ]
FISCHLER, MA [1 ]
机构
[1] SRI INT,CTR ARTIFICIAL INTELLIGENCE,MENLO PK,CA 94025
关键词
D O I
10.1007/BF00129683
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Line drawings provide an effective means of communication about the geometry of 3D objects. An understanding of how to duplicate the way humans interpret line drawings is extremely important in enabling man-machine communication with respect to images, diagrams, and spatial constructs. In particular, such an understanding could be used to provide the human with the capability to create a line-drawing sketch of a polyhedral object that the machine can automatically convert into the intended 3D model. A recently published paper (Marill 1991) presented a simple optimization procedure supposedly able to duplicate human judgment in recovering the 3D "wire frame" geometry of objects depicted in line drawings. Marill provided some impressive examples, but no theoretical justification for his approach. Here, we introduce our own work by first critically examining Marill's algorithm. We provide an explanation for why Marill's algorithm was able to perform as well as it did on the examples he presented, discuss its weaknesses, and show very simple examples where it fails. We then provide an algorithm that improves on Marill's results. In particular, we show that an effective objective function must favor both symmetry and planarity-Marill deals only with the symmetry issue. By modifying Marill's objective function to explicitly favor planar-faced solutions, and by using a more competent optimization technique, we were able to demonstrate significantly improved performance in all of the examples Marill provided and those additional ones we constructed ourselves. Finally, we examine some questions relevant to the implications of this work for understanding the human ability to interpret line drawings.
引用
收藏
页码:113 / 136
页数:24
相关论文
共 23 条
[1]  
[Anonymous], 1986, NUMERICAL RECIPES
[2]  
BARNARD ST, 1983, 8TH P INT JOINT C AR
[3]   INTERPRETING LINE DRAWINGS AS 3-DIMENSIONAL SURFACES [J].
BARROW, HG ;
TENENBAUM, JM .
ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :75-116
[4]   SEEING THINGS [J].
CLOWES, MB .
ARTIFICIAL INTELLIGENCE, 1971, 2 (01) :79-116
[5]   THE USE OF GRADIENT AND DUAL-SPACE IN LINE-DRAWING INTERPRETATION [J].
DRAPER, SW .
ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :461-508
[6]   A QUANTITATIVE APPROACH TO FIGURAL GOODNESS [J].
HOCHBERG, J ;
MCALISTER, E .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1953, 46 (05) :361-364
[7]  
Huffman D. A., 1971, Machine Intelligence Volume 6, P295
[8]   A THEORY OF ORIGAMI WORLD [J].
KANADE, T .
ARTIFICIAL INTELLIGENCE, 1980, 13 (03) :279-311
[9]   CONSTRUCTING SIMPLE STABLE DESCRIPTIONS FOR IMAGE PARTITIONING [J].
LECLERC, YG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1989, 3 (01) :73-102
[10]  
MACKWORTH AK, 1973, ARTIF INTELL, V4, P121, DOI 10.1016/0004-3702(73)90003-9