Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1706.06083
Cited By
v1
v2
v3
v4 (latest)
Towards Deep Learning Models Resistant to Adversarial Attacks
19 June 2017
Aleksander Madry
Aleksandar Makelov
Ludwig Schmidt
Dimitris Tsipras
Adrian Vladu
SILM
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Github (752★)
Papers citing
"Towards Deep Learning Models Resistant to Adversarial Attacks"
50 / 7,067 papers shown
SPEAR++: Scaling Gradient Inversion via Sparsely-Used Dictionary Learning
Alexander Bakarsky
Dimitar I. Dimitrov
Maximilian Baader
Martin Vechev
FedML
101
0
0
28 Oct 2025
Self-Calibrated Consistency can Fight Back for Adversarial Robustness in Vision-Language Models
Jiaxiang Liu
Jiawei Du
Xiao Liu
Prayag Tiwari
Mingkun Xu
AAML
VLM
124
1
0
26 Oct 2025
Stable neural networks and connections to continuous dynamical systems
Matthias Joachim Ehrhardt
Davide Murari
Ferdia Sherry
AAML
96
0
0
25 Oct 2025
FrameShield: Adversarially Robust Video Anomaly Detection
Mojtaba Nafez
Mobina Poulaei
Nikan Vasei
Bardia Soltani Moakhar
Mohammad Sabokrou
M. Rohban
AAML
173
0
0
24 Oct 2025
Toward Understanding the Transferability of Adversarial Suffixes in Large Language Models
Sarah Ball
Niki Hasrati
Alexander Robey
Avi Schwarzschild
Frauke Kreuter
Zico Kolter
Andrej Risteski
AAML
297
0
0
24 Oct 2025
Transferable Black-Box One-Shot Forging of Watermarks via Image Preference Models
Tomáš Souček
Sylvestre-Alvise Rebuffi
Pierre Fernandez
Nikola Jovanović
Hady ElSahar
Valeriu Lacatusu
Tuan Tran
Alexandre Mourachko
WIGM
AAML
298
0
0
23 Oct 2025
Kernel Learning with Adversarial Features: Numerical Efficiency and Adaptive Regularization
Antônio H. Ribeiro
David Vävinggren
Dave Zachariah
Thomas B. Schon
Francis Bach
AAML
137
0
0
23 Oct 2025
Adversarially-Aware Architecture Design for Robust Medical AI Systems
Alyssa Gerhart
Balaji Iyangar
AAML
190
1
0
23 Oct 2025
H-SPLID: HSIC-based Saliency Preserving Latent Information Decomposition
Lukas Miklautz
Chengzhi Shi
Andrii Shkabrii
Theodoros Thirimachos Davarakis
Prudence Lam
Claudia Plant
Jennifer Dy
Stratis Ioannidis
102
0
0
23 Oct 2025
FPT-Noise: Dynamic Scene-Aware Counterattack for Test-Time Adversarial Defense in Vision-Language Models
Jia Deng
Jin Li
Zhenhua Zhao
Shaowei Wang
AAML
VLM
156
1
0
22 Oct 2025
Revisiting the Relation Between Robustness and Universality
M. Klabunde
L. Caspari
F. Lemmerich
AAML
109
0
0
22 Oct 2025
Towards Strong Certified Defense with Universal Asymmetric Randomization
Hanbin Hong
Ashish Kundu
Ali Payani
Binghui Wang
Yuan Hong
AAML
157
0
0
22 Oct 2025
AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields
Woo Jae Kim
Kyu Beom Han
Y. Cho
Youngju Na
Junsik Jung
Sooel Son
Sung-eui Yoon
AAML
161
0
0
22 Oct 2025
The Black Tuesday Attack: how to crash the stock market with adversarial examples to financial forecasting models
Thomas Hofweber
Jefrey Bergl
Ian Reyes
Amir Sadovnik
AAML
AIFin
150
0
0
21 Oct 2025
PP3D: An In-Browser Vision-Based Defense Against Web Behavior Manipulation Attacks
Spencer King
Irfan Ozen
Karthika Subramani
Saranyan Senthivel
Phani Vadrevu
R. Perdisci
AAML
93
0
0
21 Oct 2025
S2AP: Score-space Sharpness Minimization for Adversarial Pruning
Giorgio Piras
Qi Zhao
Fabio Brau
Maura Pintor
Christian Wressnegger
Battista Biggio
AAML
135
0
0
21 Oct 2025
Black-Box Evasion Attacks on Data-Driven Open RAN Apps: Tailored Design and Experimental Evaluation
Pranshav Gajjar
Molham Khoja
Abiodun Ganiyu
Marc Juarez
Mahesh K. Marina
Andrew Lehane
Vijay K. Shah
136
0
0
20 Oct 2025
A Single Set of Adversarial Clothes Breaks Multiple Defense Methods in the Physical World
Wei Emma Zhang
Zhanhao Hu
Xiao-Li Li
Xiaopei Zhu
Xiaolin Hu
AAML
83
0
0
20 Oct 2025
Data Unlearning Beyond Uniform Forgetting via Diffusion Time and Frequency Selection
Jinseong Park
Mijung Park
DiffM
MU
250
0
0
20 Oct 2025
Variance-Reduction Guidance: Sampling Trajectory Optimization for Diffusion Models
Shifeng Xu
Yanzhu Liu
A. Kong
103
1
0
20 Oct 2025
A Versatile Framework for Designing Group-Sparse Adversarial Attacks
Alireza Heshmati
Saman Soleimani Roudi
Sajjad Amini
Shahrokh Ghaemmaghami
Farokh Marvasti
AAML
147
0
0
18 Oct 2025
Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness
Longwei Wang
Ifrat Ikhtear Uddin
KC Santosh
Chaowei Zhang
Xiao Qin
Yang Zhou
AAML
261
2
0
17 Oct 2025
Constrained Adversarial Perturbation
Virendra Nishad
B. Mukhoty
Hilal AlQuabeh
S. Shukla
Sayak Ray Chowdhury
AAML
150
0
0
17 Oct 2025
When Flatness Does (Not) Guarantee Adversarial Robustness
Nils Philipp Walter
Linara Adilova
Jilles Vreeken
Michael Kamp
141
1
0
16 Oct 2025
SAJA: A State-Action Joint Attack Framework on Multi-Agent Deep Reinforcement Learning
Weiqi Guo
Guanjun Liu
Ziyuan Zhou
AAML
94
0
0
15 Oct 2025
NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations
Junjie Nan
Jianing Li
Wei Chen
Mingkun Zhang
Xueqi Cheng
PICV
233
0
0
15 Oct 2025
Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models
Haochuan Xu
Yun Sing Koh
Shuhuai Huang
Z. Zhou
D. Wang
Jun Sakuma
Jingfeng Zhang
AAML
181
3
0
15 Oct 2025
Generalist++: A Meta-learning Framework for Mitigating Trade-off in Adversarial Training
Yisen Wang
Yichuan Mo
Hongjun Wang
Junyi Li
Zhouchen Lin
AAML
131
1
0
15 Oct 2025
Towards Adversarial Robustness and Uncertainty Quantification in DINOv2-based Few-Shot Anomaly Detection
Akib Mohammed Khan
Bartosz Krawczyk
AAML
137
0
0
15 Oct 2025
Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning
James Pedley
Benjamin Etheridge
Stephen J. Roberts
Francesco Quinzan
OffRL
AAML
116
0
0
14 Oct 2025
KoALA: KL-L0 Adversarial Detector via Label Agreement
Siqi Li
Yasser Shoukry
AAML
VLM
124
0
0
14 Oct 2025
Joint Discriminative-Generative Modeling via Dual Adversarial Training
Xuwang Yin
Claire Zhang
Julie Steele
Nir Shavit
T. T. Wang
GAN
435
0
0
13 Oct 2025
Adversarial Robustness in One-Stage Learning-to-Defer
Yannis Montreuil
Letian Yu
Axel Carlier
Lai Xing Ng
Wei Tsang Ooi
AAML
112
1
0
13 Oct 2025
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
Simin Li
Zihao Mao
Hanxiao Li
Zonglei Jing
Zhuohang bian
...
Yuqing Ma
Bo An
Yaodong Yang
Weifeng Lv
Xianglong Liu
154
0
0
13 Oct 2025
Adversarial Attacks Leverage Interference Between Features in Superposition
Edward Stevinson
Lucas Prieto
Melih Barsbey
Tolga Birdal
AAML
113
0
0
13 Oct 2025
The Easy Path to Robustness: Coreset Selection using Sample Hardness
Pranav Ramesh
Arjun Roy
Deepak Ravikumar
Kaushik Roy
Gopalakrishnan Srinivasan
141
0
0
13 Oct 2025
CoDefend: Cross-Modal Collaborative Defense via Diffusion Purification and Prompt Optimization
Fengling Zhu
Boshi Liu
Jingyu Hua
Sheng Zhong
DiffM
AAML
114
0
0
13 Oct 2025
Anchor-based Maximum Discrepancy for Relative Similarity Testing
Zhijian Zhou
Liuhua Peng
Xunye Tian
Yifan Zhang
127
0
0
12 Oct 2025
Adversarial Attacks on Downstream Weather Forecasting Models: Application to Tropical Cyclone Trajectory Prediction
Yue Deng
Francisco Santos
Pang-Ning Tan
Lifeng Luo
AAML
103
0
0
11 Oct 2025
Explainable Human-in-the-Loop Segmentation via Critic Feedback Signals
Pouya Shaeri
Ryan T. Woo
Yasaman Mohammadpour
Ariane Middel
130
0
0
11 Oct 2025
Tight Robustness Certificates and Wasserstein Distributional Attacks for Deep Neural Networks
Bach C. Le
Tung V. Dao
Binh T. Nguyen
Hong T.M. Chu
OOD
190
0
0
11 Oct 2025
SegTrans: Transferable Adversarial Examples for Segmentation Models
Yufei Song
Ziqi Zhou
Qi Lu
Hangtao Zhang
Yifan Hu
Lulu Xue
Shengshan Hu
Minghui Li
Leo Yu Zhang
144
5
0
10 Oct 2025
A geometrical approach to solve the proximity of a point to an axisymmetric quadric in space
Bibekananda Patra
Aditya Mahesh Kolte
Sandipan Bandyopadhyay
122
11
0
10 Oct 2025
Uncolorable Examples: Preventing Unauthorized AI Colorization via Perception-Aware Chroma-Restrictive Perturbation
Yuki Nii
Futa Waseda
Ching-Chun Chang
Isao Echizen
AAML
127
0
0
10 Oct 2025
A unified Bayesian framework for adversarial robustness
Pablo G. Arce
Roi Naveiro
David Ríos Insua
AAML
113
0
0
10 Oct 2025
Text Prompt Injection of Vision Language Models
Ruizhe Zhu
SILM
VLM
342
1
0
10 Oct 2025
VisuoAlign: Safety Alignment of LVLMs with Multimodal Tree Search
MingSheng Li
Guangze Zhao
Sichen Liu
129
0
0
10 Oct 2025
SynthID-Image: Image watermarking at internet scale
Sven Gowal
Rudy Bunel
Florian Stimberg
David Stutz
Guillermo Ortiz-Jimenez
...
Simon Rosen
Christopher Savčak
Armin Senoner
Nidhi Vyas
Pushmeet Kohli
WIGM
257
4
0
10 Oct 2025
MemLoss: Enhancing Adversarial Training with Recycling Adversarial Examples
Soroush Mahdi
M. Amirmazlaghani
Saeed Saravani
Zahra Dehghanian
AAML
83
0
0
10 Oct 2025
The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections
Milad Nasr
Nicholas Carlini
Chawin Sitawarin
Sander Schulhoff
Jamie Hayes
...
Ilia Shumailov
Abhradeep Thakurta
Kai Yuanqing Xiao
Seth Neel
F. Tramèr
AAML
ELM
183
15
0
10 Oct 2025
Previous
1
2
3
4
5
6
...
140
141
142
Next
Page 3 of 142
Page
of 142
Go