ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.05821
  4. Cited By
Policy Poisoning in Batch Reinforcement Learning and Control
v1v2 (latest)

Policy Poisoning in Batch Reinforcement Learning and Control

Neural Information Processing Systems (NeurIPS), 2019
13 October 2019
Yuzhe Ma
Xuezhou Zhang
Wen Sun
Xiaojin Zhu
    AAMLOffRL
ArXiv (abs)PDFHTML

Papers citing "Policy Poisoning in Batch Reinforcement Learning and Control"

50 / 80 papers shown
Exposing Vulnerabilities in RL: A Novel Stealthy Backdoor Attack through Reward Poisoning
Exposing Vulnerabilities in RL: A Novel Stealthy Backdoor Attack through Reward Poisoning
Bokang Zhang
Chaojun Lu
Jianhui Li
Junfeng Wu
AAML
188
0
0
27 Nov 2025
Provably Invincible Adversarial Attacks on Reinforcement Learning Systems: A Rate-Distortion Information-Theoretic Approach
Provably Invincible Adversarial Attacks on Reinforcement Learning Systems: A Rate-Distortion Information-Theoretic Approach
Ziqing Lu
Lifeng Lai
Weiyu Xu
AAML
138
0
0
15 Oct 2025
Density-Ratio Weighted Behavioral Cloning: Learning Control Policies from Corrupted Datasets
Density-Ratio Weighted Behavioral Cloning: Learning Control Policies from Corrupted Datasets
Shriram Karpoora Sundara Pandian
Ali Baheri
OffRL
216
0
0
01 Oct 2025
Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation
Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation
Kosuke Nakanishi
Akihiro Kubo
Yuji Yasui
Shin Ishii
AAMLOffRL
297
0
0
20 Jun 2025
TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models
TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models
Yang Dai
Oubo Ma
Longfei Zhang
Xingxing Liang
Xiaochun Cao
Shouling Ji
J. Zhang
Jincai Huang
Li Shen
285
1
0
15 Jun 2025
Collapsing Sequence-Level Data-Policy Coverage via Poisoning Attack in Offline Reinforcement Learning
Collapsing Sequence-Level Data-Policy Coverage via Poisoning Attack in Offline Reinforcement LearningConference on Uncertainty in Artificial Intelligence (UAI), 2025
Xue Zhou
Dapeng Man
Chen Xu
Fanyi Zeng
Tao Liu
Huan Wang
Shucheng He
Chaoyang Gao
Wu Yang
OffRL
236
0
0
12 Jun 2025
Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks?
Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks?
Paulius Sasnauskas
Yiğit Yalın
Goran Radanović
272
0
0
07 Jun 2025
Nonparametric Teaching for Graph Property Learners
Nonparametric Teaching for Graph Property Learners
Chen Zhang
Weixin Bu
Zhaochun Ren
Ziyue Liu
Yik-Chung Wu
Ngai Wong
504
4
0
20 May 2025
Optimally Installing Strict Equilibria
Optimally Installing Strict Equilibria
Jeremy McMahan
Young Wu
Yudong Chen
Xiaojin Zhu
Qiaomin Xie
360
0
0
05 Mar 2025
Online Poisoning Attack Against Reinforcement Learning under Black-box
  Environments
Online Poisoning Attack Against Reinforcement Learning under Black-box Environments
Jianhui Li
Bokang Zhang
Junfeng Wu
AAMLOffRLOnRL
364
4
0
01 Dec 2024
Provably Efficient Action-Manipulation Attack Against Continuous
  Reinforcement Learning
Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning
Zhi Luo
Xiaoyu Yang
Pan Zhou
D. Wang
AAML
284
1
0
20 Nov 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning
  for Robustness under Diverse Data Corruptions
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data CorruptionsNeural Information Processing Systems (NeurIPS), 2024
Rui Yang
Jie Wang
Guoping Wu
Yangqiu Song
AAMLOffRL
454
9
0
01 Nov 2024
Inception: Efficiently Computable Misinformation Attacks on Markov Games
Inception: Efficiently Computable Misinformation Attacks on Markov Games
Jeremy McMahan
Young Wu
Yudong Chen
Xiaojin Zhu
Qiaomin Xie
182
1
0
24 Jun 2024
Nonparametric Teaching of Implicit Neural Representations
Nonparametric Teaching of Implicit Neural RepresentationsInternational Conference on Machine Learning (ICML), 2024
Chen Zhang
Steven Tin Sui Luo
Jason Chun Lok Li
Yik-Chung Wu
Ngai Wong
383
12
0
17 May 2024
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement
  Learning Agents
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
Chen Gong
Kecen Li
Jin Yao
Tianhao Wang
OnRL
238
2
0
18 Apr 2024
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Elita Lobo
Harvineet Singh
Marek Petrik
Cynthia Rudin
Himabindu Lakkaraju
278
3
0
06 Apr 2024
Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Andi Nika
Debmalya Mandal
Adish Singla
Goran Radanović
OffRL
242
3
0
04 Mar 2024
Reward Design for Justifiable Sequential Decision-Making
Reward Design for Justifiable Sequential Decision-Making
A. Sukovic
Goran Radanović
242
0
0
24 Feb 2024
Testing autonomous vehicles and AI: perspectives and challenges from
  cybersecurity, transparency, robustness and fairness
Testing autonomous vehicles and AI: perspectives and challenges from cybersecurity, transparency, robustness and fairness
David Fernández Llorca
Ronan Hamon
Henrik Junklewitz
Kathrin Grosse
Lars Kunze
...
Nick Reed
Alexandre Alahi
Emilia Gómez
Ignacio E. Sánchez
Á. Kriston
330
14
0
21 Feb 2024
Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits
Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits
Zhiwei Wang
Huazheng Wang
Hongning Wang
AAML
335
1
0
21 Feb 2024
Informativeness of Reward Functions in Reinforcement Learning
Informativeness of Reward Functions in Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2024
R. Devidze
Parameswaran Kamalaruban
Adish Singla
264
3
0
10 Feb 2024
SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent
  Reinforcement Learning Systems
SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning SystemsConference on Computer and Communications Security (CCS), 2024
Oubo Ma
Yuwen Pu
L. Du
Yang Dai
Ruo Wang
Xiaolei Liu
Yingcai Wu
Shouling Ji
AAML
327
14
0
06 Feb 2024
Assessing the Impact of Distribution Shift on Reinforcement Learning
  Performance
Assessing the Impact of Distribution Shift on Reinforcement Learning Performance
Ted Fujimoto
Joshua Suetterlein
Samrat Chatterjee
A. Ganguly
OffRL
272
9
0
05 Feb 2024
Camouflage Adversarial Attacks on Multiple Agent Systems
Camouflage Adversarial Attacks on Multiple Agent Systems
Ziqing Lu
Guanlin Liu
Lifeng Lai
Weiyu Xu
AAML
240
4
0
30 Jan 2024
Nonparametric Teaching for Multiple Learners
Nonparametric Teaching for Multiple Learners
Chen Zhang
Xiaofeng Cao
Weiyang Liu
Ivor Tsang
James T. Kwok
303
8
0
17 Nov 2023
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with
  Human Feedback in Large Language Models
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models
Zhenghao Hu
Junlin Wu
Muhao Chen
Yevgeniy Vorobeychik
Chaowei Xiao
AAML
282
33
0
16 Nov 2023
Optimal Cost Constrained Adversarial Attacks For Multiple Agent Systems
Optimal Cost Constrained Adversarial Attacks For Multiple Agent SystemsAnnual Conference on Information Sciences and Systems (CISS), 2023
Ziqing Lu
Guanlin Liu
Lifeng Lai
Weiyu Xu
AAML
240
3
0
01 Nov 2023
Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and
  Value
Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and ValueInternational Conference on Machine Learning (ICML), 2023
Young Wu
Jeremy McMahan
Yiding Chen
Yudong Chen
Xiaojin Zhu
Qiaomin Xie
575
4
0
01 Nov 2023
Corruption-Robust Offline Reinforcement Learning with General Function
  Approximation
Corruption-Robust Offline Reinforcement Learning with General Function ApproximationNeural Information Processing Systems (NeurIPS), 2023
Chen Ye
Rui Yang
Quanquan Gu
Tong Zhang
OffRL
478
30
0
23 Oct 2023
Efficient Adversarial Attacks on Online Multi-agent Reinforcement
  Learning
Efficient Adversarial Attacks on Online Multi-agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Guanlin Liu
Lifeng Lai
AAML
227
18
0
15 Jul 2023
Data Poisoning to Fake a Nash Equilibrium in Markov Games
Data Poisoning to Fake a Nash Equilibrium in Markov Games
Young Wu
Jeremy McMahan
Xiaojin Zhu
Qiaomin Xie
OffRL
351
2
0
13 Jun 2023
Nonparametric Iterative Machine Teaching
Nonparametric Iterative Machine TeachingInternational Conference on Machine Learning (ICML), 2023
Chen Zhang
Xiaofeng Cao
Weiyang Liu
Ivor Tsang
James T. Kwok
424
12
0
05 Jun 2023
Attacks on Online Learners: a Teacher-Student Analysis
Attacks on Online Learners: a Teacher-Student AnalysisNeural Information Processing Systems (NeurIPS), 2023
R. Margiotta
Sebastian Goldt
G. Sanguinetti
AAML
298
1
0
18 May 2023
Policy Resilience to Environment Poisoning Attacks on Reinforcement
  Learning
Policy Resilience to Environment Poisoning Attacks on Reinforcement Learning
Hang Xu
Xinghua Qu
Zinovi Rabinovich
274
3
0
24 Apr 2023
Implicit Poisoning Attacks in Two-Agent Reinforcement Learning:
  Adversarial Policies for Training-Time Attacks
Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time AttacksAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Mohammad Mohammadi
Jonathan Nöther
Debmalya Mandal
Adish Singla
Goran Radanović
AAMLOffRL
233
12
0
27 Feb 2023
Adversarial Attacks on Adversarial Bandits
Adversarial Attacks on Adversarial BanditsInternational Conference on Learning Representations (ICLR), 2023
Yuzhe Ma
Zhijin Zhou
AAML
221
10
0
30 Jan 2023
Learned-Database Systems Security
Learned-Database Systems Security
R. Schuster
Jinyi Zhou
Thorsten Eisenhofer
Paul Grubbs
Nicolas Papernot
AAML
445
2
0
20 Dec 2022
Security of Deep Reinforcement Learning for Autonomous Driving: A Survey
Security of Deep Reinforcement Learning for Autonomous Driving: A Survey
Ambra Demontis
Srishti Gupta
Christian Scano
Luca Demetrio
Kathrin Grosse
Hsiao-Ying Lin
Chengfang Fang
Battista Biggio
Fabio Roli
AAML
402
4
0
12 Dec 2022
Iterative Teaching by Data Hallucination
Iterative Teaching by Data HallucinationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Zeju Qiu
Weiyang Liu
Tim Z. Xiao
Zhen Liu
Umang Bhatt
Yucen Luo
Adrian Weller
Bernhard Schölkopf
368
12
0
31 Oct 2022
Imitating Opponent to Win: Adversarial Policy Imitation Learning in
  Two-player Competitive Games
Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive GamesAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Viet The Bui
Tien Mai
T. Nguyen
AAML
327
6
0
30 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Yue Liu
Ding Zhao
450
53
0
16 Sep 2022
Understanding the Limits of Poisoning Attacks in Episodic Reinforcement
  Learning
Understanding the Limits of Poisoning Attacks in Episodic Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
A. Rangi
Haifeng Xu
Long Tran-Thanh
M. Franceschetti
AAMLOffRL
233
26
0
29 Aug 2022
Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation
  and Complexity Analysis
Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis
Tao Li
Haozhe Lei
Quanyan Zhu
AAML
458
10
0
29 Jul 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
RORL: Robust Offline Reinforcement Learning via Conservative SmoothingNeural Information Processing Systems (NeurIPS), 2022
Rui Yang
Chenjia Bai
Xiaoteng Ma
Zhaoran Wang
Chongjie Zhang
Lei Han
OffRL
589
106
0
06 Jun 2022
Reward Poisoning Attacks on Offline Multi-Agent Reinforcement Learning
Reward Poisoning Attacks on Offline Multi-Agent Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
Young Wu
Jermey McMahan
Xiaojin Zhu
Qiaomin Xie
AAMLOffRL
535
24
0
04 Jun 2022
Byzantine-Robust Online and Offline Distributed Reinforcement Learning
Byzantine-Robust Online and Offline Distributed Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yiding Chen
Xuezhou Zhang
Jianchao Tan
Mengdi Wang
Xiaojin Zhu
OffRL
398
22
0
01 Jun 2022
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Yinglun Xu
Qi Zeng
Gagandeep Singh
AAML
332
8
0
30 May 2022
COPA: Certifying Robust Policies for Offline Reinforcement Learning
  against Poisoning Attacks
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksInternational Conference on Learning Representations (ICLR), 2022
Fan Wu
Linyi Li
Chejian Xu
Huan Zhang
B. Kailkhura
K. Kenthapadi
Ding Zhao
Yue Liu
AAMLOffRL
224
38
0
16 Mar 2022
Reinforcement Learning for Linear Quadratic Control is Vulnerable Under
  Cost Manipulation
Reinforcement Learning for Linear Quadratic Control is Vulnerable Under Cost Manipulation
Yunhan Huang
Quanyan Zhu
OffRLAAML
342
4
0
11 Mar 2022
Trusted AI in Multi-agent Systems: An Overview of Privacy and Security
  for Distributed Learning
Trusted AI in Multi-agent Systems: An Overview of Privacy and Security for Distributed LearningProceedings of the IEEE (Proc. IEEE), 2022
Chuan Ma
Jun Li
Kang Wei
Bo Liu
Ming Ding
Long Yuan
Zhu Han
H. Vincent Poor
434
77
0
18 Feb 2022
12
Next
Page 1 of 2