Automatic detection and recognition of signs from natural scenes

被引:166
作者
Chen, XL [1 ]
Yang, J
Zhang, J
Waibel, A
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[3] Mobile Technol LLC, Pittsburgh, PA 15213 USA
关键词
affine rectification; optical character recognition (OCR); sign detection; sign recognition; text detection;
D O I
10.1109/TIP.2003.819223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an approach to automatic detection and recognition of signs from natural scenes, and its application to a sign translation task. The proposed approach embeds multiresolution and multiscale edge detection, adaptive searching, color analysis, and affine rectification in a hierarchical framework for sign detection, with,different emphases at each phase to handle the text in different sizes, orientations, color distributions and backgrounds. We use affine rectification to recover deformation of the text regions caused by an inappropriate camera view angle. The procedure can significantly improve text detection rate and optical character recognition (OCR) accuracy. Instead of using binary information for OCR, we extract features from an intensity image directly. We propose a local intensity normalization method to effectively handle lighting variations, followed by a Gabor transform to obtain local features, and finally a linear discriminant analysis (LDA) method for feature selection. We have applied the approach in developing a Chinese sign translation system, which can automatically detect and recognize Chinese signs as input from a camera, and translate the recognized text into English.
引用
收藏
页码:87 / 99
页数:13
相关论文
共 35 条
[1]  
[Anonymous], 1999, P ISWC SAN FRANC US
[2]  
BARNHILL RE, 1995, COMP SUPPL, V10, P1
[3]  
Brown MS, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P367, DOI 10.1109/ICCV.2001.937649
[4]   Character extraction of license plates from video [J].
Cui, YT ;
Huang, Q .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :502-507
[5]  
DENG D, 1994, IEEE IMAGE PROC, P940, DOI 10.1109/ICIP.1994.413707
[6]  
Gao J, 2001, PROC CVPR IEEE, P84
[7]  
Gonzalez R.C., 2007, DIGITAL IMAGE PROCES, V3rd
[8]  
HAMAMOTO Y, 1995, P 3 ICDAR, V2, P819
[9]  
HOU Q, 2001, P ICASSP, V3, P1517
[10]  
HUA XS, 2001, P INT C DOC AN REC, P545