Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.12585
Cited By
BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning
19 December 2023
Jing Cui
Yufei Han
Yuzhe Ma
Jianbin Jiao
Junge Zhang
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning"
5 / 5 papers shown
Title
BadMoE: Backdooring Mixture-of-Experts LLMs via Optimizing Routing Triggers and Infecting Dormant Experts
Qingyue Wang
Qi Pang
Xixun Lin
Shuai Wang
Daoyuan Wu
MoE
54
0
0
24 Apr 2025
UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning
Oubo Ma
L. Du
Yang Dai
Chunyi Zhou
Qingming Li
Yuwen Pu
Shouling Ji
41
0
0
28 Jan 2025
Recent Advances in Attack and Defense Approaches of Large Language Models
Jing Cui
Yishi Xu
Zhewei Huang
Shuchang Zhou
Jianbin Jiao
Junge Zhang
PILM
AAML
52
1
0
05 Sep 2024
BACKDOORL: Backdoor Attack against Competitive Reinforcement Learning
Lun Wang
Zaynah Javed
Xian Wu
Wenbo Guo
Xinyu Xing
D. Song
AAML
89
64
0
02 May 2021
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments
Amin Rakhsha
Xuezhou Zhang
Xiaojin Zhu
Adish Singla
AAML
OffRL
29
37
0
16 Feb 2021
1