Unmasking Clever Hans predictors and assessing what machines really learn

被引：605

作者：

Lapuschkin, Sebastian ^{[1
]}

Waeldchen, Stephan ^{[2
]}

Binder, Alexander ^{[3
]}

Montavon, Gregoire ^{[2
]}

Samek, Wojciech ^{[1
]}

Mueller, Klaus-Robert ^{[2
,4
,5
]}

机构：

[1] Fraunhofer Heinrich Hertz Inst, Dept Video Coding & Analyt, Einsteinufer 37, D-10587 Berlin, Germany

[2] Tech Univ Berlin, Dept Elect Engn & Comp Sci, Marchstr 23, D-10587 Berlin, Germany

[3] Singapore Univ Technol & Design, ISTD Pillar, 8 Somapah Rd, Singapore 487372, Singapore

[4] Korea Univ, Dept Brain & Cognit Engn, Seoul 136713, South Korea

[5] Max Planck Inst Informat, Campus E1 4, D-66123 Saarbrucken, Germany

来源：

NATURE COMMUNICATIONS | 2019年 / 10卷 / 1期

关键词：

DEEP NEURAL-NETWORKS; ARTIFICIAL-INTELLIGENCE; CLASSIFICATION; GO; GAME;

D O I：

10.1038/s41467-019-08987-4

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly intelligent behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.

引用

页数：8

共 52 条

[1] Comparing Statistical Methods for Constructing Large Scale Gene Networks
Allen, Jeffrey D.
Xie, Yang
Chen, Min
Girard, Luc
Xiao, Guanghua
[J]. PLOS ONE, 2012, 7 (01):
[2] [Anonymous], 2016, P ADV NEURAL INFORM
[3] [Anonymous], 2016, P ICML VIS DEEP LEAR
[4] [Anonymous], 2018, IEEE INT CONF HEALT, DOI DOI 10.1109/ICHI.2018.00025
[5] Unsupervised learning of invariant representations
Anselmi, Fabio
Leibo, Joel Z.
Rosasco, Lorenzo
Mutch, Jim
Tacchetti, Andrea
Poggio, Tomaso
[J]. THEORETICAL COMPUTER SCIENCE, 2016, 633 : 112 - 121
[6] "What is relevant in a text document?": An interpretable machine learning approach
Arras, Leila
Horn, Franziska
Montavon, Gregoire
Mueller, Klaus-Robert
Samek, Wojciech
[J]. PLOS ONE, 2017, 12 (08):
[7] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
Bach, Sebastian
Binder, Alexander
Montavon, Gregoire
Klauschen, Frederick
Mueller, Klaus-Robert
Samek, Wojciech
[J]. PLOS ONE, 2015, 10 (07):
[8] Baehrens D, 2010, J MACH LEARN RES, V11, P1803
[9] Chen YF, 2017, IEEE INT C INT ROBOT, P1343, DOI 10.1109/IROS.2017.8202312
[10] Towards exact molecular dynamics simulations with machine-learned force fields
Chmiela, Stefan
Sauceda, Huziel E.
Mueller, Klaus-Robert
Tkatchenko, Alexandre
[J]. NATURE COMMUNICATIONS, 2018, 9

← 1 2 3 4 5 6 →