A fast recognition system for isolated Arabic characters

被引:13
作者
Cowell, J [1 ]
Hussain, F [1 ]
机构
[1] De Montfort Univ, Dept Comp Sci, Leicester LE1 9BH, Leics, England
来源
SIXTH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION, PROCEEDINGS | 2002年
关键词
Arabic; fonts; normalisation; OCR; pattern recognition; confusion matrix; image signatures;
D O I
10.1109/IV.2002.1028844
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a very fast mufti-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particular non-Latin characters where no commercial OCR systems are available. The approach used normalises isolated characters for size and extracts an image signature based on the number of black pixels in the rows and columns of the character and compares these values to a set of signatures for typical characters of the set. This technique identifies not only the closet match but gives the closeness of match to all other characters in the set, which is expressed in a triangular Confusion Matrix.
引用
收藏
页码:650 / 654
页数:5
相关论文
共 29 条
[1]   RECOGNITION OF HANDWRITTEN CURSIVE ARABIC CHARACTERS [J].
ABUHAIBA, ISI ;
MAHMOUD, SA ;
GREEN, RJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (06) :664-672
[2]   SURVEY AND BIBLIOGRAPHY OF ARABIC OPTICAL TEXT RECOGNITION [J].
ALBADR, B ;
MAHMOUD, SA .
SIGNAL PROCESSING, 1995, 41 (01) :49-77
[3]   ONLINE RECOGNITION OF HANDWRITTEN ARABIC CHARACTERS [J].
ALEMAMI, S ;
USHER, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (07) :704-710
[4]  
ALYOUSEFI, 1992, IEEE T PATTERN ANAL, V14
[5]   Off-line Arabic character recognition: The state of the art [J].
Amin, A .
PATTERN RECOGNITION, 1998, 31 (05) :517-530
[6]  
Amin A, 1994, P 12 IAPR INT C PATT, V2
[7]   A heuristic algorithm for optical character recognition of Arabic script [J].
Atici, AA ;
YarmanVural, FT .
SIGNAL PROCESSING, 1997, 62 (01) :87-99
[8]   A HIGH-ACCURACY ALGORITHM FOR RECOGNITION OF HANDWRITTEN NUMERALS [J].
BAPTISTA, G ;
KULKARNI, KM .
PATTERN RECOGNITION, 1988, 21 (04) :287-291
[9]   An omnifont open-vocabulary OCR system for English and Arabic [J].
Bazzi, I ;
Schwartz, R ;
Makhoul, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (06) :495-504
[10]  
COWELL J, CGIM2000 COMP GRAPH