Discriminant Saliency, the Detection of Suspicious Coincidences, and Applications to Visual Recognition

被引：219

作者：

Gao, Dashan ^{[1
]}

Han, Sunhyoung ^{[2
]}

Vasconcelos, Nuno ^{[2
]}

机构：

[1] Gen Elect Global Res, Visualizat & Comp Vis Lab, Niskayuna, NY 12309 USA

[2] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2009年 / 31卷 / 06期

基金：

美国国家科学基金会;

关键词：

Visual saliency; interest point detection; coincidence detection; visual recognition; object detection from cluttered scenes; infomax feature selection; saliency measures; natural image statistics; OBJECT RECOGNITION; ATTENTION; FEATURES; MODEL; COMPRESSION; TEXTURE; CONTEXT; SCALE;

D O I：

10.1109/TPAMI.2009.27

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A discriminant formulation of top-down visual saliency, intrinsically connected to the recognition problem, is proposed. The new formulation is shown to be closely related to a number of classical principles for the organization of perceptual systems, including infomax, inference by detection of suspicious coincidences, classification with minimal uncertainty, and classification with minimum probability of error. The implementation of these principles with computational parsimony, by exploitation of the statistics of natural images, is investigated. It is shown that Barlow's principle of inference by the detection of suspicious coincidences enables computationally efficient saliency measures which are nearly optimal for classification. This principle is adopted for the solution of the two fundamental problems in discriminant saliency: feature selection and saliency detection. The resulting saliency detector is shown to have a number of interesting properties, and acts effectively as a focus of attention mechanism for the selection of interest points according to their relevance for visual recognition. Experimental evidence shows that the selected points have good performance with respect to 1) the ability to localize objects embedded in significant amounts of clutter, 2) the ability to capture information relevant for image classification, and 3) the richness of the set of visual attributes that can be considered salient.

引用

页码：989 / 1005

页数：17

共 67 条

[1]

[Anonymous], P IEEE C COMP VIS PA

[2]

[Anonymous], 1966, Textures: a photographic album for artists and designers

[3]

[Anonymous], 1994, J APPL STAT, DOI DOI 10.1080/757582976

[4]

[Anonymous], 1988, Proceedings of International Conference ofComputer Vision (ICCV'88), DOI [10.1109/CCV.1988.590008, DOI 10.1109/CCV.1988.590008]

[5] THE CURVATURE PRIMAL SKETCH [J].

ASADA, H ;

BRADY, M .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (01) :2-14

[6] SOME INFORMATIONAL ASPECTS OF VISUAL PERCEPTION [J].

ATTNEAVE, F .

PSYCHOLOGICAL REVIEW, 1954, 61 (03) :183-193

[7] Redundancy reduction revisited [J].

Barlow, H .

NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2001, 12 (03) :241-253

[8]

Barlow H.B., 1985, MODELS VISUAL CORTEX, P37

[9] USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].

BATTITI, R .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550

[10] INFERRING SURFACES FROM IMAGES [J].

BINFORD, TO .

ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :205-244

← 1 2 3 4 5 6 7 →