ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.03115
16
0

LpL_pLp​-norm Distortion-Efficient Adversarial Attack

3 July 2024
Chao Zhou
Yuan-Gen Wang
Zi-Jia Wang
Xiangui Kang
ArXivPDFHTML
Abstract

Adversarial examples have shown a powerful ability to make a well-trained model misclassified. Current mainstream adversarial attack methods only consider one of the distortions among L0L_0L0​-norm, L2L_2L2​-norm, and L∞L_\inftyL∞​-norm. L0L_0L0​-norm based methods cause large modification on a single pixel, resulting in naked-eye visible detection, while L2L_2L2​-norm and L∞L_\inftyL∞​-norm based methods suffer from weak robustness against adversarial defense since they always diffuse tiny perturbations to all pixels. A more realistic adversarial perturbation should be sparse and imperceptible. In this paper, we propose a novel LpL_pLp​-norm distortion-efficient adversarial attack, which not only owns the least L2L_2L2​-norm loss but also significantly reduces the L0L_0L0​-norm distortion. To this aim, we design a new optimization scheme, which first optimizes an initial adversarial perturbation under L2L_2L2​-norm constraint, and then constructs a dimension unimportance matrix for the initial perturbation. Such a dimension unimportance matrix can indicate the adversarial unimportance of each dimension of the initial perturbation. Furthermore, we introduce a new concept of adversarial threshold for the dimension unimportance matrix. The dimensions of the initial perturbation whose unimportance is higher than the threshold will be all set to zero, greatly decreasing the L0L_0L0​-norm distortion. Experimental results on three benchmark datasets show that under the same query budget, the adversarial examples generated by our method have lower L0L_0L0​-norm and L2L_2L2​-norm distortion than the state-of-the-art. Especially for the MNIST dataset, our attack reduces 8.1%\%% L2L_2L2​-norm distortion meanwhile remaining 47%\%% pixels unattacked. This demonstrates the superiority of the proposed method over its competitors in terms of adversarial robustness and visual imperceptibility.

View on arXiv
Comments on this paper