v1v2v3v4v5 (latest)

Backdoor Learning: A Survey

IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020

17 July 2020

ArXiv (abs)PDF HTML Github (1107★)

Papers citing "Backdoor Learning: A Survey"

50 / 368 papers shown

Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models

422

29 Nov 2025

AutoBackdoor: Automating Backdoor Attacks via LLM Agents

470

20 Nov 2025

AI Bill of Materials and Beyond: Systematizing Security Assurance through the AI Risk Scanning (AIRS) Framework

Samuel Nathanson

Alexander Lee

Catherine Chen Kieffer

126

16 Nov 2025

CatBack: Universal Backdoor Attacks on Tabular Data via Categorical Encoding

147

08 Nov 2025

Power to the Clients: Federated Learning in a Dictatorship Setting

Mohammadsajad Alipour

Mohammad Mohammadi Amiri

FedML

232

25 Oct 2025

Forgetting to Forget: Attention Sink as A Gateway for Backdooring LLM Unlearning

279

19 Oct 2025

TED++: Submanifold-Aware Backdoor Detection via Layerwise Tubular-Neighbourhood Screening

169

16 Oct 2025

Backdoor Unlearning by Linear Task Decomposition

273

16 Oct 2025

DropVLA: An Action-Level Backdoor Attack on Vision-Language-Action Models

Xingjun Ma

Yu-Gang Jiang

161

13 Oct 2025

Geometry-Aware Backdoor Attacks: Leveraging Curvature in Hyperbolic Embeddings

Ali Baheri

AAML LLMSV

248

07 Oct 2025

Responsible Diffusion: A Comprehensive Survey on Safety, Ethics, and Trust in Diffusion Models

288

25 Sep 2025

NeuroStrike: Neuron-Level Attacks on Aligned LLMs

342

15 Sep 2025

NeuroDeX: Unlocking Diverse Support in Decompiling Deep Neural Network Executables

181

08 Sep 2025

DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models

227

04 Sep 2025

A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models

244

04 Sep 2025

Backdoor Poisoning Attack Against Face Spoofing Attack Detection Methods

383

03 Sep 2025

BadFU: Backdoor Federated Learning through Adversarial Machine Unlearning

205

21 Aug 2025

NT-ML: Backdoor Defense via Non-target Label Training and Mutual Learning

Wenjie Huo

Katinka Wolter

AAML

181

07 Aug 2025

BadBlocks: Lightweight and Stealthy Backdoor Threat in Text-to-Image Diffusion Models

352

05 Aug 2025

Towards Stealthy and Effective Backdoor Attacks on Lane Detection: A Naturalistic Data Poisoning Approach

215

04 Aug 2025

Coward: Collision-based Watermark for Proactive Federated Backdoor Detection

217

04 Aug 2025

BadReasoner: Planting Tunable Overthinking Backdoors into Large Reasoning Models for Fun or Profit

286

24 Jul 2025

ShrinkBox: Backdoor Attack on Object Detection to Disrupt Collision Avoidance in Machine Learning-based Advanced Driver Assistance Systems

M. Shahzad

Muhammad Abdullah Hanif

B. Ouni

Muhammad Shafique

AAML

134

22 Jul 2025

VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation

512

09 Jul 2025

Rethinking Data Protection in the (Generative) Artificial Intelligence Era

...

500

03 Jul 2025

SoK: On the Survivability of Backdoor Attacks on Unconstrained Face Recognition Systems

Quentin Le Roux

Yannick Teglia

Teddy Furon

Philippe Loubet-Moundi

Eric Bourbao

CVBM AAML SILM

331

02 Jul 2025

Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025

373

19 Jun 2025

CertDW: Towards Certified Dataset Ownership Verification via Conformal Prediction

271

16 Jun 2025

SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models

179

10 Jun 2025

TwinBreak: Jailbreaking LLM Security Alignments based on Twin Prompts

T. Krauß

Hamid Dashtbani

Alexandra Dmitrienko

230

09 Jun 2025

Trojan Horse Hunt in Time Series Forecasting for Space Operations

Evridiki Vasileia Ntagiou

236

02 Jun 2025

BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization

337

22 May 2025

The Ripple Effect: On Unforeseen Complications of Backdoor Attacks

258

16 May 2025

ROSA: Finding Backdoors with FuzzingInternational Conference on Software Engineering (ICSE), 2025

275

13 May 2025

Comet: Accelerating Private Inference for Large Language Model by Predicting Activation SparsityIEEE Symposium on Security and Privacy (S&P), 2025

365

12 May 2025

MergeGuard: Efficient Thwarting of Trojan Attacks in Machine Learning Models

Soheil Zibakhsh Shabgahi

Yaman Jandali

F. Koushanfar

MoMe AAML

289

06 May 2025

BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models

271

06 May 2025

Cert-SSBD: Certified Backdoor Defense with Sample-Specific Smoothing Noises

567

30 Apr 2025

Robo-Troj: Attacking LLM-based Task Planners

470

23 Apr 2025

Exploring Backdoor Attack and Defense for LLM-empowered Recommendations

409

15 Apr 2025

Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models

409

08 Apr 2025

A Survey on Unlearnable Data

422

30 Mar 2025

DeBackdoor: A Deductive Framework for Detecting Backdoor Attacks on Deep Models with Limited Data

387

27 Mar 2025

A Semantic and Clean-label Backdoor Attack against Graph Convolutional Networks

Jiazhu Dai

Haoyu Sun

AAML

359

19 Mar 2025

Breaking Free from MMI: A New Frontier in Rationalization by Probing Input UtilizationInternational Conference on Learning Representations (ICLR), 2025

444

08 Mar 2025

CBW: Towards Dataset Ownership Verification for Speaker Verification via Clustering-based Backdoor Watermarking

963

02 Mar 2025

Re-Imagining Multimodal Instruction Tuning: A Representation ViewInternational Conference on Learning Representations (ICLR), 2025

...

1.2K

02 Mar 2025

A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models

Vu Tuan Truong Long

Bao Le

DiffM AAML

1.2K

26 Feb 2025

Multi-Target Federated Backdoor Attack Based on Feature AggregationPattern Recognition (Pattern Recogn.), 2025

399

23 Feb 2025

REFINE: Inversion-Free Backdoor Defense via Model ReprogrammingInternational Conference on Learning Representations (ICLR), 2025

311

22 Feb 2025