Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.08347
Cited By
On Adaptive Attacks to Adversarial Example Defenses
19 February 2020
Florian Tramèr
Nicholas Carlini
Wieland Brendel
A. Madry
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On Adaptive Attacks to Adversarial Example Defenses"
50 / 540 papers shown
Title
AI Risk Management Should Incorporate Both Safety and Security
Xiangyu Qi
Yangsibo Huang
Yi Zeng
Edoardo Debenedetti
Jonas Geiping
...
Chaowei Xiao
Bo-wen Li
Dawn Song
Peter Henderson
Prateek Mittal
AAML
43
10
0
29 May 2024
Certifying Adapters: Enabling and Enhancing the Certification of Classifier Adversarial Robustness
Jieren Deng
Hanbin Hong
A. Palmer
Xin Zhou
Jinbo Bi
Kaleel Mahmood
Yuan Hong
Derek Aguiar
AAML
33
0
0
25 May 2024
Robust width: A lightweight and certifiable adversarial defense
Jonathan Peck
Bart Goossens
AAML
35
1
0
24 May 2024
Certifiably Robust RAG against Retrieval Corruption
Chong Xiang
Tong Wu
Zexuan Zhong
David Wagner
Danqi Chen
Prateek Mittal
SILM
25
41
0
24 May 2024
How Does Bayes Error Limit Probabilistic Robust Accuracy
Ruihan Zhang
Jun Sun
AAML
16
1
0
23 May 2024
Adversarial Training via Adaptive Knowledge Amalgamation of an Ensemble of Teachers
Shayan Mohajer Hamidi
Linfeng Ye
AAML
14
0
0
22 May 2024
Certified Robust Accuracy of Neural Networks Are Bounded due to Bayes Errors
Ruihan Zhang
Jun Sun
AAML
19
3
0
19 May 2024
Cross-Input Certified Training for Universal Perturbations
Changming Xu
Gagandeep Singh
AAML
19
2
0
15 May 2024
Evaluating Adversarial Robustness in the Spatial Frequency Domain
Keng-Hsin Liao
Chin-Yuan Yeh
Hsi-Wen Chen
Ming-Syan Chen
19
0
0
10 May 2024
Cutting through buggy adversarial example defenses: fixing 1 line of code breaks Sabre
Nicholas Carlini
AAML
24
1
0
06 May 2024
Uniformly Stable Algorithms for Adversarial Training and Beyond
Jiancong Xiao
Jiawei Zhang
Zhimin Luo
Asuman Ozdaglar
AAML
29
0
0
03 May 2024
AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples
Antonio Emanuele Cinà
Jérôme Rony
Maura Pintor
Luca Demetrio
Ambra Demontis
Battista Biggio
Ismail Ben Ayed
Fabio Roli
ELM
AAML
SILM
44
6
0
30 Apr 2024
Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks
Yunzhen Feng
Tim G. J. Rudner
Nikolaos Tsilivis
Julia Kempe
AAML
BDL
35
1
0
27 Apr 2024
Mitigating the Curse of Dimensionality for Certified Robustness via Dual Randomized Smoothing
Song Xia
Yu Yi
Xudong Jiang
Henghui Ding
29
9
0
15 Apr 2024
PASA: Attack Agnostic Unsupervised Adversarial Detection using Prediction & Attribution Sensitivity Analysis
Dipkamal Bhusal
Md Tanvirul Alam
M. K. Veerabhadran
Michael Clifford
Sara Rampazzi
Nidhi Rastogi
AAML
31
1
0
12 Apr 2024
Persistent Classification: A New Approach to Stability of Data and Adversarial Examples
Brian Bell
Michael Geyer
David Glickenstein
Keaton Hamm
C. Scheidegger
Amanda S. Fernandez
Juston Moore
AAML
23
0
0
11 Apr 2024
Towards Robust Domain Generation Algorithm Classification
Arthur Drichel
Marc Meyer
Ulrike Meyer
AAML
26
3
0
09 Apr 2024
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks
Maksym Andriushchenko
Francesco Croce
Nicolas Flammarion
AAML
81
157
0
02 Apr 2024
Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches
Lingxuan Wu
Xiao Yang
Yinpeng Dong
Liuwei Xie
Hang Su
Jun Zhu
AAML
35
2
0
31 Mar 2024
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Patrick Chao
Edoardo Debenedetti
Alexander Robey
Maksym Andriushchenko
Francesco Croce
...
Nicolas Flammarion
George J. Pappas
F. Tramèr
Hamed Hassani
Eric Wong
ALM
ELM
AAML
52
94
0
28 Mar 2024
SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks
Brian Formento
Wenjie Feng
Chuan-Sheng Foo
Anh Tuan Luu
See-Kiong Ng
AAML
21
6
0
27 Mar 2024
Testing the Limits of Jailbreaking Defenses with the Purple Problem
Taeyoun Kim
Suhas Kotha
Aditi Raghunathan
AAML
36
6
0
20 Mar 2024
Counter-Samples: A Stateless Strategy to Neutralize Black Box Adversarial Attacks
Roey Bokobza
Yisroel Mirsky
AAML
16
0
0
14 Mar 2024
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
Lin Li
Haoyan Guan
Jianing Qiu
Michael W. Spratling
AAML
VLM
VPVLM
31
21
0
04 Mar 2024
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
Xiaomeng Hu
Pin-Yu Chen
Tsung-Yi Ho
AAML
24
26
0
01 Mar 2024
How to Train your Antivirus: RL-based Hardening through the Problem-Space
Jacopo Cortellazzi
Ilias Tsingenopoulos
B. Bosanský
Simone Aonzo
Davy Preuveneers
Wouter Joosen
Fabio Pierazzi
Lorenzo Cavallaro
16
1
0
29 Feb 2024
Enhancing Tracking Robustness with Auxiliary Adversarial Defense Networks
Zhewei Wu
Ruilong Yu
Qihe Liu
Shuying Cheng
Shilin Qiu
Shijie Zhou
AAML
25
0
0
28 Feb 2024
A Curious Case of Remarkable Resilience to Gradient Attacks via Fully Convolutional and Differentiable Front End with a Skip Connection
Leonid Boytsov
Ameya Joshi
Filipe Condessa
AAML
16
0
0
26 Feb 2024
Optimal Zero-Shot Detector for Multi-Armed Attacks
Federica Granese
Marco Romanelli
Pablo Piantanida
AAML
29
0
0
24 Feb 2024
Distilling Adversarial Robustness Using Heterogeneous Teachers
Jieren Deng
A. Palmer
Rigel Mahmood
Ethan Rathbun
Jinbo Bi
Kaleel Mahmood
Derek Aguiar
AAML
25
0
0
23 Feb 2024
Corrective Machine Unlearning
Shashwat Goel
Ameya Prabhu
Philip H. S. Torr
Ponnurangam Kumaraguru
Amartya Sanyal
OnRL
33
14
0
21 Feb 2024
Generative AI Security: Challenges and Countermeasures
Banghua Zhu
Norman Mu
Jiantao Jiao
David A. Wagner
AAML
SILM
56
7
0
20 Feb 2024
Attacking Large Language Models with Projected Gradient Descent
Simon Geisler
Tom Wollschlager
M. H. I. Abdalla
Johannes Gasteiger
Stephan Günnemann
AAML
SILM
42
49
0
14 Feb 2024
Accelerated Smoothing: A Scalable Approach to Randomized Smoothing
Devansh Bhardwaj
Kshitiz Kaushik
Sarthak Gupta
AAML
16
0
0
12 Feb 2024
Your Diffusion Model is Secretly a Certifiably Robust Classifier
Huanran Chen
Yinpeng Dong
Shitong Shao
Zhongkai Hao
Xiao Yang
Hang Su
Jun Zhu
DiffM
26
12
0
04 Feb 2024
MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers
Yatong Bai
Mo Zhou
Vishal M. Patel
Somayeh Sojoudi
AAML
19
6
0
03 Feb 2024
Building Guardrails for Large Language Models
Yizhen Dong
Ronghui Mu
Gao Jin
Yi Qi
Jinwei Hu
Xingyu Zhao
Jie Meng
Wenjie Ruan
Xiaowei Huang
OffRL
57
27
0
02 Feb 2024
Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization
Guang Lin
Chao Li
Jianhai Zhang
Toshihisa Tanaka
Qibin Zhao
26
13
0
29 Jan 2024
CARE: Ensemble Adversarial Robustness Evaluation Against Adaptive Attackers for Security Applications
Hangsheng Zhang
Jiqiang Liu
Jinsong Dong
AAML
15
1
0
20 Jan 2024
Crafter: Facial Feature Crafting against Inversion-based Identity Theft on Deep Models
Shiming Wang
Zhe Ji
Liyao Xiang
Hao Zhang
Xinbing Wang
Cheng Zhou
Bo-wen Li
12
3
0
14 Jan 2024
Adversarial Examples are Misaligned in Diffusion Model Manifolds
P. Lorenz
Ricard Durall
Jansi Keuper
DiffM
33
1
0
12 Jan 2024
The Adaptive Arms Race: Redefining Robustness in AI Security
Ilias Tsingenopoulos
Vera Rimmer
Davy Preuveneers
Fabio Pierazzi
Lorenzo Cavallaro
Wouter Joosen
AAML
70
0
0
20 Dec 2023
May the Noise be with you: Adversarial Training without Adversarial Examples
Ayoub Arous
A. F. López-Lopera
Nael B. Abu-Ghazaleh
Ihsen Alouani
AAML
OOD
17
0
0
12 Dec 2023
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
33
0
0
08 Dec 2023
Adversarial Medical Image with Hierarchical Feature Hiding
Qingsong Yao
Zecheng He
Yuexiang Li
Yi Lin
Kai Ma
Yefeng Zheng
S. Kevin Zhou
MedIm
AAML
18
4
0
04 Dec 2023
Topology-Preserving Adversarial Training
Xiaoyue Mi
Fan Tang
Yepeng Weng
Danding Wang
Juan Cao
Sheng Tang
Peng Li
Yang Liu
37
1
0
29 Nov 2023
Efficient Key-Based Adversarial Defense for ImageNet by Using Pre-trained Model
AprilPyone Maungmaung
Isao Echizen
Hitoshi Kiya
VLM
AAML
21
0
0
28 Nov 2023
Instruct2Attack: Language-Guided Semantic Adversarial Attacks
Jiang-Long Liu
Chen Wei
Yuxiang Guo
Heng Yu
Alan L. Yuille
S. Feizi
Chun Pong Lau
Rama Chellappa
DiffM
AAML
22
5
0
27 Nov 2023
Mixing Classifiers to Alleviate the Accuracy-Robustness Trade-Off
Yatong Bai
Brendon G. Anderson
Somayeh Sojoudi
AAML
16
2
0
26 Nov 2023
Adversarial Prompt Tuning for Vision-Language Models
Jiaming Zhang
Xingjun Ma
Xin Wang
Lingyu Qiu
Jiaqi Wang
Yu-Gang Jiang
Jitao Sang
AAML
VPVLM
VLM
9
18
0
19 Nov 2023
Previous
1
2
3
4
5
...
9
10
11
Next