机构:
Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
Sato, T
[1
]
论文数: 引用数:
h-index:
机构:
Kanade, T
[1
]
Hughes, EK
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
Hughes, EK
[1
]
Smith, MA
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
Smith, MA
[1
]
机构:
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
来源:
1998 IEEE INTERNATIONAL WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO DATABASE, PROCEEDINGS
|
1998年
关键词:
D O I:
10.1109/CAIVD.1998.646033
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Video OCR is a technique that can greatly help to locate topics of interest in a large digital news video archive via the automatic extraction and reading of captions and annotations. News captions generally provide vital search information about the video being presented -the names of people and places or descriptions of objects. In this paper, two difficult problems of character recognition Sor videos are addressed: low resolution characters and extremely complex back-grounds. We apply an interpolation filter, multi-frame integration and a combination of four filters to solve these problems. Segmenting characters is done by a recognition-based segmentation method and intermediate character recognition results are used to improve the segmentation. The overall recognition results are good enough for use in news indexing. Performing Video OCR an news video and combining its results with Other video understanding techniques will improve the overall understanding of the news video content.