Training set size requirements for the classification of a specific class

被引:259
作者
Foody, Giles M. [1 ]
Mathur, Ajay
Sanchez-Hernandez, Carolina
Boyd, Doreen S.
机构
[1] Univ Southampton, Sch Geog, Southampton SO17 1BJ, Hants, England
[2] Punjab Remote Sensing Ctr, Ludhiana 141004, Punjab, India
[3] Ordnance Survey, Res & Innovat, Southampton SO16 4GU, Hants, England
[4] Bournemouth Univ, Sch Conservat Sci, Poole BH12 5BB, Dorset, England
关键词
classsification; training set; support vector machine (SVM); support vector data description (SVDD);
D O I
10.1016/j.rse.2006.03.004
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The design of the training stage of a supervised classification should account for the properties of the classifier to be used. Consideration of the way the classifier operates may enable the training stage to be designed in a manner which ensures that the aim of the classification is satisfied with the use of a small, inexpensive, training set. It may, therefore, be possible to reduce the training set size requirements from that generally expected with the use of standard heuristics. Substantial reductions in training set size may be possible if interest is focused on a single class. This is illustrated for mapping cotton in north-western India by support vector machine type classifiers. Four approaches to reducing training set size were used: intelligent selection of the most informative training samples, selective class exclusion, acceptance of imprecise descriptions for spectrally distinct classes and the adoption of a one-class classifier. All four approaches were able to reduce the training set size required considerably below that suggested by conventional widely used heuristics without significant impact on the accuracy with which the class of interest was classified. For example, reductions in training set size of similar to 90% from that suggested by a conventional heuristic are reported with the accuracy of cotton classification remaining nearly constant at similar to 95% and similar to 97% from the user's and producer's perspectives respectively. (c) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 53 条