v1v2v3v4 (latest)

Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples

1 February 2018

Papers citing "Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples"

50 / 1,982 papers shown

Rethinking Machine Unlearning for Large Language Models

...

Mohit Bansal

Yang Liu

428

200

13 Feb 2024

Accuracy of TextFooler black box adversarial attacks on 01 loss sign activation neural network ensemble

Yunzhe Xue

Usman Roshan

AAML

147

12 Feb 2024

Accelerated Smoothing: A Scalable Approach to Randomized Smoothing

336

12 Feb 2024

A Random Ensemble of Encrypted Vision Transformers for Adversarially Robust DefenseIEEE Access (IEEE Access), 2024

Ryota Iijima

Sayaka Shiota

Hitoshi Kiya

298

11 Feb 2024

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

...

359

732

06 Feb 2024

FINEST: Stabilizing Recommendations by Rank-Preserving Fine-TuningACM Transactions on Knowledge Discovery from Data (TKDD), 2024

Sejoon Oh

Berk Ustun

Julian McAuley

Srijan Kumar

182

05 Feb 2024

Unraveling the Key of Machine Learning Solutions for Android Malware Detection

188

05 Feb 2024

Your Diffusion Model is Secretly a Certifiably Robust Classifier

Yinpeng Dong

385

04 Feb 2024

MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers

Somayeh Sojoudi

402

03 Feb 2024

Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance

Wenqi Wei

Ling Liu

361

02 Feb 2024

Tropical Decision Boundaries for Neural Networks Are Robust Against Adversarial Attacks

322

01 Feb 2024

Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization

367

29 Jan 2024

Securing Recommender System via Cooperative TrainingWorld wide web (Bussum) (WWW), 2023

Qingyang Wang

Chenwang Wu

Defu Lian

Enhong Chen

AAML

218

23 Jan 2024

Robustness to distribution shifts of compressed networks for edge devices

175

22 Jan 2024

How Robust Are Energy-Based Models Trained With Equilibrium Propagation?

263

21 Jan 2024

Adversarial Examples are Misaligned in Diffusion Model ManifoldsIEEE International Joint Conference on Neural Network (IJCNN), 2024

463

12 Jan 2024

Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial RobustnessComputer Vision and Pattern Recognition (CVPR), 2024

345

09 Jan 2024

Calibration Attacks: A Comprehensive Study of Adversarial Attacks on Model Confidence

265

05 Jan 2024

Adversarial Attacks on Image Classification Models: Analysis and Defense

161

28 Dec 2023

BlackboxBench: A Comprehensive Benchmark of Black-box Adversarial Attacks

415

28 Dec 2023

ARBiBench: Benchmarking Adversarial Robustness of Binarized Neural Networks

336

21 Dec 2023

The Ultimate Combo: Boosting Adversarial Example Transferability by Composing Data Augmentations

214

18 Dec 2023

Exploring Transferability for Randomized Smoothing

152

14 Dec 2023

May the Noise be with you: Adversarial Training without Adversarial Examples

102

12 Dec 2023

MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness

609

08 Dec 2023

On the Robustness of Large Multimodal Models Against Image Adversarial Attacks

Ser-Nam Lim

274

06 Dec 2023

A Simple Framework to Enhance the Adversarial Robustness of Deep Learning-based Intrusion Detection SystemComputers & security (CS), 2023

188

06 Dec 2023

Generating Visually Realistic Adversarial Patch

Xiaosen Wang

Kunyu Wang

AAML

207

05 Dec 2023

Singular Regularization with Information Bottleneck Improves Model's Adversarial Robustness

Man Zhou

139

04 Dec 2023

Adversarial Medical Image with Hierarchical Feature HidingIEEE Transactions on Medical Imaging (TMI), 2023

274

04 Dec 2023

Topology-Preserving Adversarial Training

Peng Li

Yang Liu

281

29 Nov 2023

Improving the Robustness of Transformer-based Large Language Models with Dynamic AttentionNetwork and Distributed System Security Symposium (NDSS), 2023

Yuwen Pu

Xuhong Zhang

187

29 Nov 2023

RADAP: A Robust and Adaptive Defense Against Diverse Adversarial Patches on Face Recognition

Jian Zhao

172

29 Nov 2023

Efficient Key-Based Adversarial Defense for ImageNet by Using Pre-trained ModelIEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023

AprilPyone Maungmaung

Isao Echizen

Hitoshi Kiya

VLM AAML

181

28 Nov 2023

On robust overfitting: adversarial training induced distribution matters

Runzhi Tian

Yongyi Mao

OOD

292

28 Nov 2023

Instruct2Attack: Language-Guided Semantic Adversarial Attacks

Yuxiang Guo

223

27 Nov 2023

Adversarial Purification of Information Masking

200

26 Nov 2023

Mixing Classifiers to Alleviate the Accuracy-Robustness Trade-OffConference on Learning for Dynamics & Control (L4DC), 2023

Yatong Bai

Brendon G. Anderson

Somayeh Sojoudi

AAML

288

26 Nov 2023

Adversarial defense based on distribution transfer

Jiahao Chen

Diqun Yan

Li Dong

187

23 Nov 2023

Explaining high-dimensional text classifiers

Odelia Melamed

Rich Caruana

186

22 Nov 2023

Fast Certification of Vision-Language Models Using Incremental Randomized Smoothing

S Sarkar

343

15 Nov 2023

Adversarially Robust Spiking Neural Networks Through Conversion

Ozan Özdenizci

Robert Legenstein

AAML

363

15 Nov 2023

On The Relationship Between Universal Adversarial Attacks And Sparse RepresentationsIEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023

Dana Weitzner

Raja Giryes

AAML

278

14 Nov 2023

Towards Improving Robustness Against Common Corruptions in Object Detectors Using Adversarial Contrastive Learning

Shashank Kotyan

Danilo Vasconcellos Vargas

AAML

207

14 Nov 2023

Upper and lower bounds for the Lipschitz constant of random neural networks

491

02 Nov 2023

Intriguing Properties of Data Attribution on Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023

393

01 Nov 2023

Exploring Geometry of Blind Spots in Vision ModelsNeural Information Processing Systems (NeurIPS), 2023

S. Balasubramanian

Gaurang Sriramanan

Vinu Sankar Sadasivan

Soheil Feizi

AAML

222

30 Oct 2023

Adversarial Attacks and Defenses in Large Language Models: Old and New Threats

Stephan Günnemann

236

30 Oct 2023

Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from a Minimax Game PerspectiveNeural Information Processing Systems (NeurIPS), 2023

282

30 Oct 2023

Adversarial Examples Are Not Real FeaturesNeural Information Processing Systems (NeurIPS), 2023

629

29 Oct 2023