READING CHESS

被引:23
作者
BAIRD, HS
THOMPSON, K
机构
[1] AT&T Bell Laboratories, Murray Hill, NJ 07974
关键词
Character recognition; chess; document image analysis; layout analysis; semantics;
D O I
10.1109/34.56191
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an application of semantic analysis to images of extended passages of text, several volumes of a chess encyclopedia have been read with high accuracy. Although carefully proofread, the books were poorly printed and posed a severe challenge to conventional page layout analysis and character-recognition methods. An experimental page reader system carried out strictly top-down layout analysis for identification of columns, lines, words, and characters. This proceeded rapidly and reliably thanks to a recently-developed skew-estimation technique. Resegmentation of broken, touching, and dirty characters was handled in an efficient and integrated manner by a heuristic search operating on isolated words. By analyzing the syntax of game descriptions and applying the rules of chess, the error rate was reduced by a factor of 30 from what was achievable through shape analysis alone. Of the games with no typographical errors, 98% have been assigned a legal interpretation, for an effective success rate of 99.995% on approximately one million characters (2850 games, 945 pages). We discuss several computer vision systems-integration issues suggested by this experience. © 1990 IEEE
引用
收藏
页码:552 / 559
页数:8
相关论文
共 18 条
[1]  
BAIRD HS, 1987, 40TH P SPSE C S HYBR, P21
[2]  
BAIRD HS, 1987, NOV P IEEE COMP SOC
[3]  
BLEDSOE WW, 1959, 1959 P E JOINT COMP
[4]  
Condon J. H., 1982, CHESS SKILL MAN MACH
[5]   EXPERIMENTS IN TEXT RECOGNITION WITH BINARY N-GRAM AND VITERBI ALGORITHMS [J].
HULL, JJ ;
SRIHARI, SN .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1982, 4 (05) :520-530
[6]   ON THE RECOGNITION OF PRINTED CHARACTERS OF ANY FONT AND SIZE [J].
KAHAN, S ;
PAVLIDIS, T ;
BAIRD, HS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (02) :274-288
[7]  
Kida H., 1986, Eighth International Conference on Pattern Recognition. Proceedings (Cat. No.86CH2342-4), P446
[8]  
Meynieux E., 1986, Eighth International Conference on Pattern Recognition. Proceedings (Cat. No.86CH2342-4), P442
[9]   MULTIFONT OCR POSTPROCESSING SYSTEM [J].
ROSENBAUM, WS ;
HILLIARD, JJ .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1975, 19 (04) :398-421
[10]  
Schantz H. F., 1982, HIST OCR