Semantic image segmentation and object labeling

被引:68
作者
Athanasiadis, Thanos [1 ]
Mylonas, Phivos [1 ]
Avrithis, Yannis [1 ]
Kollias, Stefanos [1 ]
机构
[1] Natl Tech Univ Athens, GR-15773 Athens, Greece
关键词
fuzzy region labeling; semantic region growing; semantic segmentation; visual context;
D O I
10.1109/TCSVT.2007.890636
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a framework for simultaneous image segmentation and object labeling leading to automatic image annotation. Focusing on semantic analysis of images, it contributes to knowledge-assisted multimedia analysis and bridging the gap between semantics and low level visual features. The proposed framework operates at semantic level using possible semantic labels, formally represented as fuzzy sets, to make decisions on handling image regions instead of visual features used traditionally. In order to stress its independence of a specific image segmentation approach we have modified two well known region growing algorithms, i.e., watershed and recursive shortest spanning tree, and compared them to their traditional counterparts. Additionally, a visual context representation and analysis approach is presented, blending global knowledge in interpreting each object locally. Contextual information is based on a novel semantic processing methodology, employing fuzzy algebra and ontological taxonomic knowledge representation. In this process, utilization of contextual knowledge re-adjusts labeling results of semantic region growing, by means of fine-tuning membership degrees of detected concepts. The performance of the overall methodology is evaluated on a real-life still image dataset from two popular domains.
引用
收藏
页码:298 / 312
页数:15
相关论文
共 39 条
[1]  
AADER F, 2002, DESCRIPTION LOGIC HA
[2]  
ADAMEK T, 2005, WORKSH IM AN MULT IN
[3]   SEEDED REGION GROWING [J].
ADAMS, R ;
BISCHOF, L .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (06) :641-647
[4]   Semantic association of multimedia document descriptions through fuzzy relational algebra and fuzzy reasoning [J].
Akrivas, G ;
Stamou, GB ;
Kollias, S .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2004, 34 (02) :190-196
[5]   Context-sensitive semantic query expansion [J].
Akrivas, G ;
Wallace, M ;
Andreou, G ;
Stamou, G ;
Kollias, S .
2002 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE SYSTEMS, PROCEEDINGS, 2002, :109-114
[6]  
ATHANASIADIS T, 2005, 5 INT WORKSH KNOWL M
[7]  
BENITEZ A, 2001, P ICAIP WARS POL, V2124, P41
[8]   Object-based multimedia content description schemes and applications for MPEG-7 [J].
Benitez, AB ;
Paek, S ;
Chang, SF ;
Puri, A ;
Huang, Q ;
Smith, JR ;
Li, CS ;
Bergman, LD ;
Judice, CN .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2000, 16 (1-2) :235-269
[9]   Efficient matching and indexing of graph models in content-based retrieval [J].
Berretti, S ;
Del Bimbo, A ;
Vicario, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (10) :1089-1105
[10]  
Beucher S., 1993, MATH MORPHOLOGY IMAG