v1v2v3v4 (latest)

Adversarial Examples Are Not Bugs, They Are Features

Neural Information Processing Systems (NeurIPS), 2019

6 May 2019

Papers citing "Adversarial Examples Are Not Bugs, They Are Features"

50 / 1,093 papers shown

Cross-Modal Conceptualization in Bottleneck ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

268

23 Oct 2023

Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks

476

230

16 Oct 2023

Regularization properties of adversarially-trained linear regressionNeural Information Processing Systems (NeurIPS), 2023

268

16 Oct 2023

Black-box Targeted Adversarial Attack on Segment Anything (SAM)

406

16 Oct 2023

$Is Certifying $\ell_p$ Robustness Still Worthwhile?$

Is Certifying

\ell_p

Robustness Still Worthwhile?

250

13 Oct 2023

Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer LearningNeural Information Processing Systems (NeurIPS), 2023

383

13 Oct 2023

Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration

343

11 Oct 2023

GraphCloak: Safeguarding Task-specific Knowledge within Graph-structured Data from Unauthorized Exploitation

Lichao Sun

217

11 Oct 2023

Investigating the Adversarial Robustness of Density Estimation Using the Probability Flow ODE

294

10 Oct 2023

AttributionLab: Faithfulness of Feature Attribution Under Controllable Environments

261

10 Oct 2023

Understanding the Robustness of Multi-modal Contrastive Learning to Distribution ShiftInternational Conference on Learning Representations (ICLR), 2023

Yihao Xue

Siddharth Joshi

Dang Nguyen

Baharan Mirzasoleiman

VLM

249

08 Oct 2023

Robustness-enhanced Uplift Modeling with Adversarial Feature DesensitizationIndustrial Conference on Data Mining (IDM), 2023

340

07 Oct 2023

Generating Less Certain Adversarial Examples Improves Robust Generalization

557

06 Oct 2023

LLM Lies: Hallucinations are not Bugs, but Features as Adversarial Examples

398

271

02 Oct 2023

A Survey of Robustness and Safety of 2D and 3D Deep Learning Models Against Adversarial AttacksACM Computing Surveys (ACM Comput. Surv.), 2023

273

01 Oct 2023

On Continuity of Robust and Accurate Classifiers

368

29 Sep 2023

Investigating Human-Identifiable Features Hidden in Adversarial Perturbations

Hung-yi Lee

171

28 Sep 2023

On the Computational Entanglement of Distant Features in Adversarial Machine Learning

468

27 Sep 2023

Improving Robustness of Deep Convolutional Neural Networks via Multiresolution Learning

Hongyan Zhou

Yao Liang

OOD

272

24 Sep 2023

Toward a Deeper Understanding: RetNet Viewed through ConvolutionPattern Recognition (Pattern Recogn.), 2023

Chenghao Li

Chaoning Zhang

ViT

256

11 Sep 2023

Good-looking but Lacking Faithfulness: Understanding Local Explanation Methods through Trend-based TestingConference on Computer and Communications Security (CCS), 2023

263

09 Sep 2023

Exploring Robust Features for Improving Adversarial RobustnessIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2023

337

09 Sep 2023

Robust Adversarial Defense by Tensor FactorizationInternational Conference on Machine Learning and Applications (ICMLA), 2023

Boian Alexandrov

184

03 Sep 2023

Why do universal adversarial attacks work on large language models?: Geometry might be the answer

Finale Doshi-Velez

215

01 Sep 2023

Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness TradeoffIEEE International Conference on Computer Vision (ICCV), 2023

Atsushi Ando

274

31 Aug 2023

Intriguing Properties of Diffusion Models: An Empirical Study of the Natural Attack Capability in Text-to-Image Generative ModelsComputer Vision and Pattern Recognition (CVPR), 2023

Takami Sato

Justin Yue

Nanze Chen

Ningfei Wang

Qi Alfred Chen

DiffM

226

30 Aug 2023

Advancing Adversarial Robustness Through Adversarial Logit Update

275

29 Aug 2023

On-Manifold Projected Gradient Descent

Ioannis G. Kevrekidis

AAML

218

23 Aug 2023

RemovalNet: DNN Fingerprint Removal AttacksIEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2023

260

23 Aug 2023

Adversarial Illusions in Multi-Modal EmbeddingsUSENIX Security Symposium (USENIX Security), 2023

805

22 Aug 2023

Spurious Correlations and Where to Find Them

Gautam Sreekumar

Vishnu Boddeti

CML

222

21 Aug 2023

Measuring the Effect of Causal Disentanglement on the Adversarial Robustness of Neural Network ModelsInternational Conference on Information and Knowledge Management (CIKM), 2023

171

21 Aug 2023

HoSNN: Adversarially-Robust Homeostatic Spiking Neural Networks with Adaptive Firing Thresholds

Hejia Geng

Peng Li

AAML

413

20 Aug 2023

Backdoor Mitigation by Correcting the Distribution of Neural Activations

162

18 Aug 2023

Balancing Transparency and Risk: The Security and Privacy Risks of Open-Source Machine Learning Models

162

18 Aug 2023

Not So Robust After All: Evaluating the Robustness of Deep Neural Networks to Unseen Adversarial Attacks

12 Aug 2023

Fixed Inter-Neuron Covariability Induces Adversarial RobustnessIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Muhammad Ahmed Shah

Bhiksha Raj

AAML

07 Aug 2023

TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored ModelsIEEE International Conference on Computer Vision (ICCV), 2023

156

07 Aug 2023

A reading survey on adversarial machine learning: Adversarial attacks and their understanding

Shashank Kotyan

AAML

169

07 Aug 2023

Unsupervised Adversarial Detection without Extra Model: Training Loss Should Change

Chien Cheng Chyou

Hung-Ting Su

Winston H. Hsu

AAML

07 Aug 2023

FROD: Robust Object Detection for Free

185

03 Aug 2023

Training on Foveated Images Improves Robustness to Adversarial AttacksNeural Information Processing Systems (NeurIPS), 2023

Muhammad Ahmed Shah

Bhiksha Raj

AAML

208

01 Aug 2023

An Exact Kernel Equivalence for Finite Classification Models

280

01 Aug 2023

Transferable Attack for Semantic Segmentation

Yuchao Dai

226

31 Jul 2023

Universal and Transferable Adversarial Attacks on Aligned Language Models

J. Zico Kolter

647

2,367

27 Jul 2023

NSA: Naturalistic Support Artifact to Boost Network ConfidenceIEEE International Joint Conference on Neural Network (IJCNN), 2023

212

27 Jul 2023

Towards Generic and Controllable Attacks Against Object Detection

273

23 Jul 2023

Fast Adaptive Test-Time Defense with Robust Features

Anurag Singh

Mahalakshmi Sabanayagam

Krikamol Muandet

Debarghya Ghoshdastidar

AAML TTA OOD

185

21 Jul 2023

A Holistic Assessment of the Reliability of Machine Learning Systems

Mykel J. Kochenderfer

316

20 Jul 2023

On the Robustness of Split Learning against Adversarial AttacksEuropean Conference on Artificial Intelligence (ECAI), 2023

Mingyuan Fan

Cen Chen

Chengyu Wang

Wenmeng Zhou

Yanjie Liang

AAML

171

16 Jul 2023