Visual Methods for Analyzing Probabilistic Classification Data

被引:80
作者
Alsallakh, Bilal [1 ]
Hanbury, Allan [1 ]
Hauser, Helwig [2 ]
Miksch, Silvia [1 ]
Rauber, Andreas [1 ]
机构
[1] Vienna Univ Technol, Vienna, Austria
[2] Univ Bergen, N-5020 Bergen, Norway
关键词
Probabilistic classification; confusion analysis; feature evaluation and selection; visual inspection; COMBINING MULTIPLE CLASSIFIERS; LAND-COVER CLASSIFICATIONS; VISUALIZATION; SEPARATION;
D O I
10.1109/TVCG.2014.2346660
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-class classifiers often compute scores for the classification samples describing probabilities to belong to different classes. In order to improve the performance of such classifiers, machine learning experts need to analyze classification results for a large number of labeled samples to find possible reasons for incorrect classification. Confusion matrices are widely used for this purpose. However, they provide no information about classification scores and features computed for the samples. We propose a set of integrated visual methods for analyzing the performance of probabilistic classifiers. Our methods provide insight into different aspects of the classification results for a large number of samples. One visualization emphasizes at which probabilities these samples were classified and how these probabilities correlate with classification error in terms of false positives and false negatives. Another view emphasizes the features of these samples and ranks them by their separation power between selected true and false classifications. We demonstrate the insight gained using our technique in a benchmarking classification dataset, and show how it enables improving classification performance by interactively defining and evaluating post-classification rules.
引用
收藏
页码:1703 / 1712
页数:10
相关论文
共 43 条
  • [1] Radial Sets: Interactive Visual Analysis of Large Overlapping Sets
    Alsallakh, Bilal
    Aigner, Wolfgang
    Miksch, Silvia
    Hauser, Helwig
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) : 2496 - 2505
  • [2] Reinventing the Contingency Wheel: Scalable Visual Analytics of Large Categorical Data
    Alsallakh, Bilal
    Aigner, Wolfgang
    Miksch, Silvia
    Groeller, M. Eduard
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (12) : 2849 - 2858
  • [3] An L∞ Norm Visual Classifier
    Anand, Anushka
    Wilkinson, Leland
    Dang Nhon Tuan
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 687 - +
  • [4] Ankerst M., 1999, P 5 ACM SIGKDD INT C, P392, DOI DOI 10.1145/312129.312298
  • [5] [Anonymous], DATA MINING KNOWLEDG
  • [6] Bache K., UCI machine learning repository
  • [7] Assisted Descriptor Selection Based on Visual Comparative Data Analysis
    Bremm, Sebastian
    von Landesberger, Tatiana
    Bernard, Juergen
    Schreck, Tobias
    [J]. COMPUTER GRAPHICS FORUM, 2011, 30 (03) : 891 - 900
  • [8] Brown ET, 2012, IEEE CONF VIS ANAL, P83, DOI 10.1109/VAST.2012.6400486
  • [9] Caragea D, 2008, LECT NOTES COMPUT SC, V4404, P136, DOI 10.1007/978-3-540-71080-6_10
  • [10] Incorporating domain knowledge and spatial relationships into land cover classifications: a rule-based approach
    Daniels, Amy E.
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2006, 27 (14) : 2949 - 2975