A framework for parsing colonoscopy videos for semantic units

被引:9
作者
Cao, Y [1 ]
Tavanapong, W [1 ]
Kim, K [1 ]
Wong, J [1 ]
Oh, J [1 ]
de Groen, PC [1 ]
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
来源
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3 | 2004年
关键词
content-based analysis; scene segmentation; medical image processing;
D O I
10.1109/ICME.2004.1394625
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable for endoscopic research and education. The first necessary step for the analysis is parsing for semantic units. Since the characteristics of colonoscopy videos differ from those of videos studied in the literature, we introduce a new video parsing framework that includes (i) a new scene definition and a new video parsing paradigm and (ii) a novel scene segmentation algorithm using audio analysis and finite state automata to recognize scenes and associated boundaries. Our experimental results show average precision and recall of 95% and 81% for parsing scenes, respectively. The framework is extensible to videos captured from other endoscopic procedures such as upper gastrointestinal endoscopy, enteroscopy, cystoscopy, and laparoscopy.
引用
收藏
页码:1879 / 1882
页数:4
相关论文
共 10 条
[1]  
Cao Y, 2003, LECT NOTES COMPUT SC, V2728, P446
[2]   Cancer statistics, 2000 [J].
Greenlee, RT ;
Murray, T ;
Bolden, S ;
Wingo, PA .
CA-A CANCER JOURNAL FOR CLINICIANS, 2000, 50 (01) :7-33
[3]  
HAMILTON PW, 1997, J PATHOL JUL, P68
[4]  
LAKARE S, 2002, P SPIE 2002 S MED IM
[5]  
LI L, 2002, P SPIE 2002 S MED IM
[6]  
PHEE SJ, 1998, IEEE ENG MED BIOL MA
[7]   Constructing table-of-content for videos [J].
Rui, Y ;
Huang, TS ;
Mehrotra, S .
MULTIMEDIA SYSTEMS, 1999, 7 (05) :359-368
[8]  
Sundaram H., 2000, Proceedings ACM Multimedia 2000, P95, DOI 10.1145/354384.354440
[9]  
TODMAN A, 2000, P MED IMAGE UNDERSTA
[10]   An integrated system for content-based video retrieval and browsing [J].
Zhang, HJ ;
Wu, JH ;
Zhong, D ;
Smoliar, SW .
PATTERN RECOGNITION, 1997, 30 (04) :643-658