A coarse-to-fine strategy for multiclass shape detection

被引:75
作者
Amit, Y [1 ]
Geman, D
Fan, XD
机构
[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
[3] Johns Hopkins Univ, Dept Appl Math & Stat, Baltimore, MD 21218 USA
[4] Johns Hopkins Univ, Whitaker Biomed Engn Inst, Baltimore, MD 21218 USA
[5] Johns Hopkins Univ, Dept Elect & Comp Engn, Baltimore, MD 21218 USA
基金
美国国家科学基金会;
关键词
shape detection; multiple classes; statistical model; spread edges; coarse-to-fine search; online competition;
D O I
10.1109/TPAMI.2004.111
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiclass shape detection, in the sense of recognizing and localizing instances from multiple shape classes, is formulated as a two-step process in which local indexing primes global interpretation. During indexing a list of instantiations ( shape identities and poses) is compiled, constrained only by no missed detections at the expense of false positives. Global information, such as expected relationships among poses, is incorporated afterward to remove ambiguities. This division is motivated by computational efficiency. In addition, indexing itself is organized as a coarse-to-fine search simultaneously in class and pose. This search can be interpreted as successive approximations to likelihood ratio tests arising from a simple ("naive Bayes") statistical model for the edge maps extracted from the original images. The key to constructing efficient "hypothesis tests" for multiple classes and poses is local ORing; in particular, spread edges provide imprecise but common and locally invariant features. Natural tradeoffs then emerge between discrimination and the pattern of spreading. These are analyzed mathematically within the model-based framework and the whole procedure is illustrated by experiments in reading license plates.
引用
收藏
页码:1606 / 1621
页数:16
相关论文
共 37 条