ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.10513
  4. Cited By
Learning Robust Policy against Disturbance in Transition Dynamics via
  State-Conservative Policy Optimization

Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization

AAAI Conference on Artificial Intelligence (AAAI), 2021
20 December 2021
Yufei Kuang
Miao Lu
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
ArXiv (abs)PDFHTML

Papers citing "Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization"

17 / 17 papers shown
Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts
Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts
Zhongjian Qiao
Rui Yang
Jiafei Lyu
Xiu Li
Zhongxiang Dai
Zhuoran Yang
Siyang Gao
Shuang Qiu
OffRL
155
0
0
02 Dec 2025
Robust Reinforcement Learning in Finance: Modeling Market Impact with Elliptic Uncertainty Sets
Robust Reinforcement Learning in Finance: Modeling Market Impact with Elliptic Uncertainty Sets
Shaocong Ma
Heng Huang
OODAIFinOffRL
278
0
0
22 Oct 2025
Keep on Going: Learning Robust Humanoid Motion Skills via Selective Adversarial Training
Keep on Going: Learning Robust Humanoid Motion Skills via Selective Adversarial Training
Yang Zhang
Zhanxiang Cao
Buqing Nie
Haoyang Li
Zhong Jiangwei
Qiao Sun
Xiaoyi Hu
Xiaokang Yang
Yue Gao
AAML
223
1
0
11 Jul 2025
Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes
Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes
David Bossens
Atsushi Nitanda
378
0
0
29 Jun 2025
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Montaser Mohammedalamen
Michael Bowling
290
0
0
13 May 2025
Finite-Sample Analysis of Policy Evaluation for Robust Average Reward Reinforcement Learning
Finite-Sample Analysis of Policy Evaluation for Robust Average Reward Reinforcement Learning
Yang Xu
Washim Uddin Mondal
Vaneet Aggarwal
OffRL
485
7
0
24 Feb 2025
Robust Deep Reinforcement Learning with Adaptive Adversarial
  Perturbations in Action Space
Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space
Qian Liu
Yufei Kuang
Jie Wang
AAML
114
10
0
20 May 2024
Lipschitz-Regularized Critics Lead to Policy Robustness Against Transition Dynamics Uncertainty
Lipschitz-Regularized Critics Lead to Policy Robustness Against Transition Dynamics Uncertainty
Xulin Chen
Ruipeng Liu
Garret E. Katz
Garrett E. Katz
350
0
0
22 Apr 2024
Distributionally Robust Reinforcement Learning with Interactive Data
  Collection: Fundamental Hardness and Near-Optimal Algorithm
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal AlgorithmNeural Information Processing Systems (NeurIPS), 2024
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRLOOD
251
18
0
04 Apr 2024
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear
  Programming
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
Haotian Ling
Zhihai Wang
Jie Wang
295
10
0
31 Jan 2024
MICRO: Model-Based Offline Reinforcement Learning with a Conservative
  Bellman Operator
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
317
10
0
07 Dec 2023
Adjustable Robust Reinforcement Learning for Online 3D Bin Packing
Adjustable Robust Reinforcement Learning for Online 3D Bin PackingNeural Information Processing Systems (NeurIPS), 2023
Yuxin Pan
Yize Chen
Fangzhen Lin
OffRL
239
19
0
06 Oct 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function
  Approximation
Natural Actor-Critic for Robust Reinforcement Learning with Function ApproximationNeural Information Processing Systems (NeurIPS), 2023
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
379
39
0
17 Jul 2023
Double Pessimism is Provably Efficient for Distributionally Robust
  Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial CoverageNeural Information Processing Systems (NeurIPS), 2023
Jose H. Blanchet
Miao Lu
Tong Zhang
Han Zhong
OffRL
285
48
0
16 May 2023
Adversarial Policy Optimization in Deep Reinforcement Learning
Adversarial Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
AAML
112
0
0
27 Apr 2023
Optimal Transport Perturbations for Safe Reinforcement Learning with
  Robustness Guarantees
Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees
James Queeney
E. C. Ozcan
I. Paschalidis
Christos G. Cassandras
OODOffRL
293
8
0
31 Jan 2023
Provable Sim-to-real Transfer in Continuous Domain with Partial
  Observations
Provable Sim-to-real Transfer in Continuous Domain with Partial ObservationsInternational Conference on Learning Representations (ICLR), 2022
Jiachen Hu
Han Zhong
Chi Jin
Liwei Wang
270
10
0
27 Oct 2022
1
Page 1 of 1