One Pixel Attack for Fooling Deep Neural Networks

被引:1257
作者
Su, Jiawei [1 ]
Vargas, Danilo Vasconcellos [1 ]
Sakurai, Kouichi [1 ,2 ]
机构
[1] Kyushu Univ, Fac Informat Sci & Elect Engn, Grad Sch, Fukuoka, Fukuoka 8190395, Japan
[2] Adv Telecommun Res Inst Int, Kyoto, Japan
基金
日本科学技术振兴机构;
关键词
Perturbation methods; Neural networks; Robustness; Image color analysis; Image recognition; Additives; Convolutional neural network; differential evolution (DE); image recognition; information security; DIFFERENTIAL EVOLUTION; ADAPTATION; STRATEGY;
D O I
10.1109/TEVC.2019.2890858
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research has revealed that the output of deep neural networks (DNNs) can be easily altered by adding relatively small perturbations to the input vector. In this paper, we analyze an attack in an extremely limited scenario where only one pixel can be modified. For that we propose a novel method for generating one-pixel adversarial perturbations based on differential evolution (DE). It requires less adversarial information (a black-box attack) and can fool more types of networks due to the inherent features of DE. The results show that 67.97% of the natural images in Kaggle CIFAR-10 test dataset and 16.04% of the ImageNet (ILSVRC 2012) test images can be perturbed to at least one target class by modifying just one pixel with 74.03% and 22.91% confidence on average. We also show the same vulnerability on the original CIFAR-10 dataset. Thus, the proposed attack explores a different take on adversarial machine learning in an extreme limited scenario, showing that current DNNs are also vulnerable to such low dimension attacks. Besides, we also illustrate an important application of DE (or broadly speaking, evolutionary computation) in the domain of adversarial machine learning: creating tools that can effectively generate low-cost adversarial attacks against neural networks for evaluating robustness.
引用
收藏
页码:828 / 841
页数:14
相关论文
共 61 条
  • [1] Deriving and Improving CMA-ES with Information Geometric Trust Regions
    Abdolmaleki, Abbas
    Price, Bob
    Lau, Nuno
    Reis, Luis Paulo
    Neumann, Gerhard
    [J]. PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 657 - 664
  • [2] Compaction for Code Fragment Based Learning Classifier Systems
    Alvarez, Isidro M.
    Browne, Will N.
    Zhang, Mengjie
    [J]. ARTIFICIAL LIFE AND COMPUTATIONAL INTELLIGENCE, ACALCI 2016, 2016, 9592 : 41 - 53
  • [3] Alzantot M., 2018, NIPS 2017 Machine Deception Workshop
  • [4] Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
  • [5] [Anonymous], P 3 INT C LEARNING R
  • [6] [Anonymous], 2015, ARXIV150702379
  • [7] [Anonymous], REP
  • [8] [Anonymous], 2017, ARXIV170509552
  • [9] [Anonymous], 2017, CoRR
  • [10] [Anonymous], P ICML WORKSH DEEP L