Detection and location of multicharacter sequences in lines of imaged text

被引:5
作者
Chen, FR
Bloomberg, DS
Wilcox, LD
机构
[1] Xerox Palo Alto Research Center, Palo Alto, CA 94304
关键词
D O I
10.1117/12.228768
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A system for detecting and locating user-specified search strings, or phrases, in lines of imaged text is described. The phrases may be single words or multiple words, and may contain a partially specified word. The imaged text can be composed of a number of different fonts and graphics. Textlines in a deskewed image are hypothesized using multiresolution morphology. For each textline, the baseline, topline and x-height are identified by simple statistical methods and then used to normalize each textline bounding box. Columns of pixels in the resulting bounding box serve as feature vectors. One hidden Markov model is created for each user specified phrase and another represents all text and graphics other than the user-specified phrases. Phrases are identified using Viterbi decoding on a spotting network created from the models. The operating point of the system can be varied to trade off the percentage of words correctly spotted and the percentage of false alarms. Results are given using a subset of the UW English Document lmage Database I. (C) 1996 SPIE and lS&T.
引用
收藏
页码:37 / 49
页数:13
相关论文
共 19 条
[1]  
AGAZZI O, 1993, P IEEE INT C AC SPEE, V5, P113
[2]  
ANIGBOGU J, 1991, 1ST P INT C DOC ANAL, P785
[3]  
BLOOMBERG DS, 1992, P SOC PHOTO-OPT INS, V1818, P648
[4]  
BLOOMBERG DS, 1995, P SOC PHOTO-OPT INS, V2422, P302, DOI 10.1117/12.205832
[5]   OMNIDOCUMENT TECHNOLOGIES [J].
BOKSER, M .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1066-1078
[6]  
CHEN FR, 1993, 2ND P INT C DOC AN R, P133
[7]  
CHEN MY, 1994, IEEE IMAGE PROC, P174, DOI 10.1109/ICIP.1994.413298
[8]  
ELMS AJ, 1995, 3RD P INTL C DOC AN, P504
[9]   VITERBI ALGORITHM [J].
FORNEY, GD .
PROCEEDINGS OF THE IEEE, 1973, 61 (03) :268-278
[10]  
GILLIES A, 1992, P US POST SERV ADV T, P557