A Bayesian framework for deformable pattern recognition with application to handwritten character recognition

被引:37
作者
Cheung, KW [1 ]
Yeung, DY [1 ]
Chin, RT [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci, Clear Water Bay, Peoples R China
关键词
deformable models; Bayesian inference; handwriting recognition; expectation-maximization; NIST database;
D O I
10.1109/34.735813
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deformable models have recently been proposed for many pattern recognition applications due to their ability to handle large shape variations. These proposed approaches represent patterns or shapes as deformable models, which deform themselves to match with the input image, and subsequently feed the extracted information into a classifier. The three components-modeling, matching, and classification-are often treated as independent tasks, in this paper, we study how to integrate deformable models into a Bayesian framework as a unified approach for modeling, matching, and classifying shapes. Handwritten character recognition serves as a testbed for evaluating the approach. With the use of our system, recognition is invariant to affine transformation as well as other handwriting variations. In addition, no preprocessing or manual setting of hyperparameters (e.g., regularization parameter and character width) is required. Besides, issues on the incorporation of constraints on model flexibility detection of subparts, and speed-up are investigated. Using a model set with only 23 prototypes without any discriminative training, we can achieve an accuracy of 94.7 percent with no rejection on a subset (11,791 images by 100 writers) of handwritten digits from the NIST SD-1 dataset.
引用
收藏
页码:1382 / 1388
页数:7
相关论文
共 9 条
[1]  
[Anonymous], P 2 INT C COMP VIS
[2]   Competitive mixture of deformable models for pattern classification [J].
Cheung, KW ;
Yeung, DY ;
Chin, RT .
1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, :613-618
[3]  
CHEUNG KW, 1998, P 6 INT WORKSH FRONT
[4]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[5]  
GEIST J, 1994, 2 CENS OPT CHAR REC
[6]   Representation and recognition of handwritten digits using deformable templates [J].
Jain, AK ;
Zongker, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (12) :1386-1391
[7]   A PRACTICAL BAYESIAN FRAMEWORK FOR BACKPROPAGATION NETWORKS [J].
MACKAY, DJC .
NEURAL COMPUTATION, 1992, 4 (03) :448-472
[8]   Using generative models for handwritten digit recognition [J].
Revow, M ;
Williams, CKI ;
Hinton, GE .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (06) :592-606
[9]   SHAPE-MATCHING USING LAT AND ITS APPLICATION TO HANDWRITTEN NUMERAL RECOGNITION [J].
WAKAHARA, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (06) :618-629