Video OCR for digital news archive

被引:106
作者
Sato, T [1 ]
Kanade, T [1 ]
Hughes, EK [1 ]
Smith, MA [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
来源
1998 IEEE INTERNATIONAL WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO DATABASE, PROCEEDINGS | 1998年
关键词
D O I
10.1109/CAIVD.1998.646033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video OCR is a technique that can greatly help to locate topics of interest in a large digital news video archive via the automatic extraction and reading of captions and annotations. News captions generally provide vital search information about the video being presented -the names of people and places or descriptions of objects. In this paper, two difficult problems of character recognition Sor videos are addressed: low resolution characters and extremely complex back-grounds. We apply an interpolation filter, multi-frame integration and a combination of four filters to solve these problems. Segmenting characters is done by a recognition-based segmentation method and intermediate character recognition results are used to improve the segmentation. The overall recognition results are good enough for use in news indexing. Performing Video OCR an news video and combining its results with Other video understanding techniques will improve the overall understanding of the news video content.
引用
收藏
页码:52 / 60
页数:9
相关论文
empty
未找到相关数据