Unmasking Clever Hans predictors and assessing what machines really learn

被引:605
作者
Lapuschkin, Sebastian [1 ]
Waeldchen, Stephan [2 ]
Binder, Alexander [3 ]
Montavon, Gregoire [2 ]
Samek, Wojciech [1 ]
Mueller, Klaus-Robert [2 ,4 ,5 ]
机构
[1] Fraunhofer Heinrich Hertz Inst, Dept Video Coding & Analyt, Einsteinufer 37, D-10587 Berlin, Germany
[2] Tech Univ Berlin, Dept Elect Engn & Comp Sci, Marchstr 23, D-10587 Berlin, Germany
[3] Singapore Univ Technol & Design, ISTD Pillar, 8 Somapah Rd, Singapore 487372, Singapore
[4] Korea Univ, Dept Brain & Cognit Engn, Seoul 136713, South Korea
[5] Max Planck Inst Informat, Campus E1 4, D-66123 Saarbrucken, Germany
关键词
DEEP NEURAL-NETWORKS; ARTIFICIAL-INTELLIGENCE; CLASSIFICATION; GO; GAME;
D O I
10.1038/s41467-019-08987-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly intelligent behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.
引用
收藏
页数:8
相关论文
共 52 条
  • [1] Comparing Statistical Methods for Constructing Large Scale Gene Networks
    Allen, Jeffrey D.
    Xie, Yang
    Chen, Min
    Girard, Luc
    Xiao, Guanghua
    [J]. PLOS ONE, 2012, 7 (01):
  • [2] [Anonymous], 2016, P ADV NEURAL INFORM
  • [3] [Anonymous], 2016, P ICML VIS DEEP LEAR
  • [4] [Anonymous], 2018, IEEE INT CONF HEALT, DOI DOI 10.1109/ICHI.2018.00025
  • [5] Unsupervised learning of invariant representations
    Anselmi, Fabio
    Leibo, Joel Z.
    Rosasco, Lorenzo
    Mutch, Jim
    Tacchetti, Andrea
    Poggio, Tomaso
    [J]. THEORETICAL COMPUTER SCIENCE, 2016, 633 : 112 - 121
  • [6] "What is relevant in a text document?": An interpretable machine learning approach
    Arras, Leila
    Horn, Franziska
    Montavon, Gregoire
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PLOS ONE, 2017, 12 (08):
  • [7] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
    Bach, Sebastian
    Binder, Alexander
    Montavon, Gregoire
    Klauschen, Frederick
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PLOS ONE, 2015, 10 (07):
  • [8] Baehrens D, 2010, J MACH LEARN RES, V11, P1803
  • [9] Chen YF, 2017, IEEE INT C INT ROBOT, P1343, DOI 10.1109/IROS.2017.8202312
  • [10] Towards exact molecular dynamics simulations with machine-learned force fields
    Chmiela, Stefan
    Sauceda, Huziel E.
    Mueller, Klaus-Robert
    Tkatchenko, Alexandre
    [J]. NATURE COMMUNICATIONS, 2018, 9