NeuronInspect: Detecting Backdoors in Neural Networks via Output Explanations

18 November 2019

Papers citing "NeuronInspect: Detecting Backdoors in Neural Networks via Output Explanations"

50 / 58 papers shown

Enhancing the Effectiveness and Durability of Backdoor Attacks in Federated Learning through Maximizing Task Distinction

144

23 Sep 2025

BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense EvaluationKnowledge Discovery and Data Mining (KDD), 2024

355

17 Nov 2024

A Practical Trigger-Free Backdoor Attack on Neural Networks

191

21 Aug 2024

A Survey of Trojan Attacks and Defenses to Deep Neural Networks

230

15 Aug 2024

Clean-Label Physical Backdoor Attacks with Data Distillation

481

27 Jul 2024

Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models

Changjiang Li

Ren Pang

Bochuan Cao

Jinghui Chen

Fenglong Ma

Shouling Ji

Ting Wang

DiffM

181

14 Jun 2024

Generalization Bound and New Algorithm for Clean-Label Backdoor Attack

304

02 Jun 2024

BackdoorIndicator: Leveraging OOD Data for Proactive Backdoor Detection in Federated Learning

Songze Li

Yanbo Dai

AAML FedML

228

31 May 2024

Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers

Kuofeng Gao

315

17 May 2024

A Backdoor-based Explainable AI Benchmark for High Fidelity Evaluation of Attributions

225

02 May 2024

LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning

Shiwei Feng

Xiangzhe Xu

227

25 Mar 2024

Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors

Bhaskar Ramasubramanian

Bo Li

Radha Poovendran

AAML

188

12 Feb 2024

Preference Poisoning Attacks on Reward Model Learning

276

02 Feb 2024

UltraClean: A Simple Framework to Train Robust Neural Networks against Backdoor Attacks

Bingyin Zhao

Yingjie Lao

AAML

300

17 Dec 2023

On the Difficulty of Defending Contrastive Learning against Backdoor AttacksUSENIX Security Symposium (USENIX Security), 2023

286

14 Dec 2023

Towards Sample-specific Backdoor Attack with Clean Labels via Attribute TriggerIEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2023

Tao Wei

373

03 Dec 2023

SecurityNet: Assessing Machine Learning Vulnerabilities on Public Models

Boyang Zhang

Zheng Li

Ziqing Yang

Xinlei He

Michael Backes

Mario Fritz

Yang Zhang

337

19 Oct 2023

XGBD: Explanation-Guided Graph Backdoor DetectionEuropean Conference on Artificial Intelligence (ECAI), 2023

Zihan Guan

Mengnan Du

Ninghao Liu

AAML

259

08 Aug 2023

TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored ModelsIEEE International Conference on Computer Vision (ICCV), 2023

155

07 Aug 2023

A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and ValidationArtificial Intelligence Review (AIR), 2023

...

351

146

19 May 2023

Defending Against Patch-based Backdoor Attacks on Self-Supervised LearningComputer Vision and Pattern Recognition (CVPR), 2023

Sinong Wang

195

04 Apr 2023

Poisoning Web-Scale Training Datasets is PracticalIEEE Symposium on Security and Privacy (IEEE S&P), 2023

Nicholas Carlini

Matthew Jagielski

Christopher A. Choquette-Choo

376

268

20 Feb 2023

Mithridates: Auditing and Boosting Backdoor Resistance of Machine Learning PipelinesConference on Computer and Communications Security (CCS), 2023

Eugene Bagdasaryan

Vitaly Shmatikov

AAML

326

09 Feb 2023

BEAGLE: Forensics of Deep Learning Backdoor Attack for Better DefenseNetwork and Distributed System Security Symposium (NDSS), 2023

Xiangzhe Xu

...

214

16 Jan 2023

XMAM:X-raying Models with A Matrix to Reveal Backdoor Attacks for Federated Learning

184

28 Dec 2022

Fine-Tuning Is All You Need to Mitigate Backdoor Attacks

262

18 Dec 2022

Be Careful with Rotation: A Uniform Backdoor Pattern for 3D Shape

Bing Li

337

28 Nov 2022

Dormant Neural TrojansInternational Conference on Machine Learning and Applications (ICMLA), 2022

224

02 Nov 2022

An Embarrassingly Simple Backdoor Attack on Self-supervised LearningIEEE International Conference on Computer Vision (ICCV), 2022

319

13 Oct 2022

Understanding Impacts of Task Similarity on Backdoor Attack and Detection

262

12 Oct 2022

Untargeted Backdoor Watermark: Towards Harmless and Stealthy Dataset Copyright ProtectionNeural Information Processing Systems (NeurIPS), 2022

412

137

27 Sep 2022

MOVE: Effective and Harmless Ownership Verification via Embedded External FeaturesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Kui Ren

285

04 Aug 2022

Game of Trojans: A Submodular Byzantine Approach

Bhaskar Ramasubramanian

Radha Poovendran

AAML

152

13 Jul 2022

Towards a Defense Against Federated Backdoor Attacks Under Continuous Training

281

24 May 2022

Wild Patterns Reloaded: A Survey of Machine Learning Security against Training Data PoisoningACM Computing Surveys (ACM CSUR), 2022

Antonio Emanuele Cinà

Ambra Demontis

Battista Biggio

Fabio Roli

396

166

04 May 2022

Backdooring Explainable Machine Learning

Maximilian Noppel

Lukas Peter

Christian Wressnegger

AAML

202

20 Apr 2022

Trojan Horse Training for Breaking Defenses against Backdoor Attacks in Deep Learning

Arezoo Rajabi

Bhaskar Ramasubramanian

Radha Poovendran

AAML

208

25 Mar 2022

A Survey of Neural Trojan Attacks and Defenses in Deep Learning

Jie Wang

Ghulam Mubashar Hassan

Naveed Akhtar

AAML

187

15 Feb 2022

Jigsaw Puzzle: Selective Backdoor Attack to Subvert Malware ClassifiersIEEE Symposium on Security and Privacy (IEEE S&P), 2022

284

11 Feb 2022

Robust and Privacy-Preserving Collaborative Learning: A Comprehensive Survey

Yang Liu

231

19 Dec 2021

Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks

248

25 Nov 2021

Poisoning Deep Reinforcement Learning Agents with In-Distribution Triggers

C. Ashcraft

Kiran Karra

151

14 Jun 2021

Stealthy Backdoors as Compression ArtifactsIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2021

Yulong Tian

Fnu Suya

Fengyuan Xu

David Evans

228

30 Apr 2021

MISA: Online Defense of Trojaned Models using MisattributionsAsia-Pacific Computer Systems Architecture Conference (ACSA), 2021

250

29 Mar 2021

Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Lei Li

Xuancheng Ren

184

169

29 Mar 2021

Black-box Detection of Backdoor Attacks with Limited Information and DataIEEE International Conference on Computer Vision (ICCV), 2021

Yinpeng Dong

Hang Su

Jun Zhu

AAML

157

124

24 Mar 2021

EX-RAY: Distinguishing Injected Backdoor from Natural Features in Neural Networks by Examining Differential Feature Symmetry

224

16 Mar 2021

TrojanZoo: Towards Unified, Holistic, and Practical Evaluation of Neural BackdoorsEuropean Symposium on Security and Privacy (EuroS&P), 2020

398

16 Dec 2020

Invisible Backdoor Attack with Sample-Specific Triggers

565

586

07 Dec 2020

Cassandra: Detecting Trojaned Networks from Adversarial PerturbationsIEEE Access (IEEE Access), 2020

Nazanin Rahnavard

196

28 Jul 2020