Automatic image annotation using adaptive color classification

被引:52
作者
Saber, E
Tekalp, AM
Eschbach, R
Knox, K
机构
[1] UNIV ROCHESTER,CTR ELECTR IMAGING SYST,ROCHESTER,NY 14627
[2] XEROX CORP,SUPPLIES DEV & MFG UNIT,WEBSTER,NY 14580
[3] XEROX CORP,JOSEPH C WILSON CTR RES & TECHNOL,WEBSTER,NY 14580
来源
GRAPHICAL MODELS AND IMAGE PROCESSING | 1996年 / 58卷 / 02期
基金
美国国家科学基金会;
关键词
D O I
10.1006/gmip.1996.0010
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We describe a system which automatically annotates images with a set of prespecified keywords, based on supervised color classification of pixels into N prespecified classes using simple pixelwise operations. The conditional distribution of the chrominance components of pixels belonging to each class is modeled by a two-dimensional Gaussian function, where the mean vector and the covariance matrix for each class are estimated from appropriate training sets. Then, a succession of binary hypothesis tests with image-adaptive thresholds has been employed to decide whether each pixel in a given image belongs to one of the predetermined classes. To this effect, a universal decision threshold is first selected for each class based on receiver operating characteristics (ROC) curves quantifying the optimum ''true positive'' vs ''false positive'' performance on the training set. Then, a new method is introduced for adapting these thresholds to the characteristics of individual input images based on histogram cluster analysis. If a particular pixel is found to belong to more than one class, a maximum a posteriori probability (MAP) rule is employed to resolve the ambiguity. The performance improvement obtained by the proposed adaptive hypothesis testing approach over using universal decision thresholds is demonstrated by annotating a database of 31 images. (C) 1996 Academic Press, Inc.
引用
收藏
页码:115 / 126
页数:12
相关论文
共 30 条
[1]   A VISUAL INFORMATION MANAGEMENT-SYSTEM FOR THE INTERACTIVE RETRIEVAL OF FACES [J].
BACH, JR ;
PAUL, S ;
JAIN, R .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (04) :619-628
[2]   A COLOR CLUSTERING TECHNIQUE FOR IMAGE SEGMENTATION [J].
CELENK, M .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1990, 52 (02) :145-170
[3]  
Chang M. M., 1994, Journal of Electronic Imaging, V3, P404, DOI 10.1117/12.183741
[4]  
CHANG SF, 1995, INT SOC OPT ENG, V2501, P995
[5]  
CHANG SK, 1981, IEEE COMPUT, V14
[6]  
Daily M. J., 1989, Proceedings CVPR '89 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.89CH2752-4), P304, DOI 10.1109/CVPR.1989.37865
[7]   KNOWLEDGE-BASED COMPUTER VISION - INTEGRATED PROGRAMMING LANGUAGE AND DATA MANAGEMENT-SYSTEM DESIGN [J].
GOODMAN, AM ;
HARALICK, RM ;
SHAPIRO, LG .
COMPUTER, 1989, 22 (12) :43-54
[8]   TOWARD A DATA MODEL FOR INTEGRATED PICTORIAL DATABASES [J].
GROSKY, WI .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1984, 25 (03) :371-382
[9]  
GROSKY WI, 1992, ADV COMPUT, V35, P237
[10]  
Hedley M., 1992, Journal of Electronic Imaging, V1, P374, DOI 10.1117/12.61158