Recognition of CAPTCHA Characters by Supervised Machine Learning Algorithms

被引:17
作者
Bostik, Ondrej [1 ]
Klecka, Jan [1 ]
机构
[1] Brno Univ Technol, Dept Control & Instrumentat, Brno, Czech Republic
来源
IFAC PAPERSONLINE | 2018年 / 51卷 / 06期
关键词
CAPTCHA; OCR; Supervised Learning; Template Matching; Decision Trees; k-NN; SVM; Neural Network;
D O I
10.1016/j.ifacol.2018.07.155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The focus of this paper is to compare several common machine learning classification algorithms for Optical Character Recognition of CAPTCHA codes. The main part of a research focuses on the comparative study of Neural Networks, k-Nearest Neighbour, Support Vector Machines and Decision Trees implemented in MATLAB Computing environment. Achieved success rates of all analyzed algorithms overcome 89%. The main difference in results of used algorithms is within the learning times. Based on the data found, it is possible to choose the right algorithm for the particular task. (C) 2018, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.
引用
收藏
页码:208 / 213
页数:6
相关论文
共 23 条
[1]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[2]  
[Anonymous], 1998, PSYCHODIAGNOSTICS DI
[3]  
Bishop Christopher M, 2016, Pattern recognition and machine learning
[4]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[5]  
Bostik Ondrejb, 2017, MENDEL 2017, V23, P57
[6]  
Bursztein E, 2011, PROCEEDINGS OF THE 18TH ACM CONFERENCE ON COMPUTER & COMMUNICATIONS SECURITY (CCS 11), P125
[7]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[8]  
Deb S, 2004, 18TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1 (LONG PAPERS), PROCEEDINGS, P59
[9]  
Dietterich Thomas G., 1994, J ARTIF INTELL RES
[10]  
Horak K, 2010, TSP 2010: 33RD INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, P204