FASText: Efficient Unconstrained Scene Text Detector

被引:97
作者
Busta, Michal [1 ]
Neumann, Lukas [1 ]
Matas, Jiri [1 ]
机构
[1] Czech Tech Univ, Ctr Machine Percept, Dept Cybernet, Prague, Czech Republic
来源
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年
关键词
D O I
10.1109/ICCV.2015.143
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
We propose a novel easy-to-implement stroke detector based on an efficient pixel intensity comparison to surrounding pixels. Stroke-specific keypoints are efficiently detected and text fragments are subsequently extracted by local thresholding guided by keypoint properties. Classification based on effectively calculated features then eliminates non-text regions. The stroke-specific keypoints produce 2 times less region segmentations and still detect 25% more characters than the commonly exploited MSER detector and the process is 4 times faster. After a novel efficient classification step, the number of regions is reduced to 7 times less than the standard method and is still almost 3 times faster. All stages of the proposed pipeline are scale-and rotation-invariant and support a wide variety of scripts ( Latin, Hebrew, Chinese, etc.) and fonts. When the proposed detector is plugged into a scene text localization and recognition pipeline, a state-of-the-art text localization accuracy is maintained whilst the processing time is significantly reduced.
引用
收藏
页码:1206 / 1214
页数:9
相关论文
共 31 条
[1]
[Anonymous], 2012, COMPUTER VISION PATT
[2]
[Anonymous], 2009, VISAPP
[3]
[Anonymous], 2013, ICCV
[4]
GENERALIZING THE HOUGH TRANSFORM TO DETECT ARBITRARY SHAPES [J].
BALLARD, DH .
PATTERN RECOGNITION, 1981, 13 (02) :111-122
[6]
Epshtein B., CVPR 2010, P2963
[7]
Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[8]
Huang WL, 2014, LECT NOTES COMPUT SC, V8692, P497, DOI 10.1007/978-3-319-10593-2_33
[9]
Jaderberg M, 2014, LECT NOTES COMPUT SC, V8692, P512, DOI 10.1007/978-3-319-10593-2_34
[10]
Orientation Robust Text Line Detection in Natural Images [J].
Kang, Le ;
Li, Yi ;
Doermann, David .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :4034-4041