ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.11513
  4. Cited By
Mixed Policy Gradient: off-policy reinforcement learning driven jointly
  by data and model
v1v2 (latest)

Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model

23 February 2021
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model"

8 / 8 papers shown
Aligning Language Models with Human Preferences via a Bayesian Approach
Aligning Language Models with Human Preferences via a Bayesian ApproachNeural Information Processing Systems (NeurIPS), 2023
Jiashuo Wang
Haozhao Wang
Shichao Sun
Wenjie Li
ALM
418
36
0
09 Oct 2023
Smoothing Policy Iteration for Zero-sum Markov Games
Smoothing Policy Iteration for Zero-sum Markov Games
Yangang Ren
Yao Lyu
Wenxuan Wang
Sheng Li
Zeyang Li
Jingliang Duan
180
1
0
03 Dec 2022
Integrated Decision and Control for High-Level Automated Vehicles by
  Mixed Policy Gradient and Its Experiment Verification
Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification
Yang Guan
Liye Tang
Chuanxiao Li
Shengbo Eben Li
Yangang Ren
Junqing Wei
Bo Zhang
Ke Li
145
2
0
19 Oct 2022
Reachability Constrained Reinforcement Learning
Reachability Constrained Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Dongjie Yu
Haitong Ma
Sheng Li
Jianyu Chen
321
83
0
16 May 2022
Improve Generalization of Driving Policy at Signalized Intersections
  with Adversarial Learning
Improve Generalization of Driving Policy at Signalized Intersections with Adversarial LearningTransportation Research Part C: Emerging Technologies (TRC), 2022
Yangang Ren
Tianze Zhu
Liye Tang
Shengbo Eben Li
Jianhua Jiang
Jingliang Duan
AAML
245
19
0
09 Apr 2022
Learn Zero-Constraint-Violation Policy in Model-Free Constrained
  Reinforcement Learning
Learn Zero-Constraint-Violation Policy in Model-Free Constrained Reinforcement Learning
Haitong Ma
Changliu Liu
Shengbo Eben Li
Sifa Zheng
Wen Sun
Jianyu Chen
179
11
0
25 Nov 2021
Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring
  Statewise Safety
Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety
Haitong Ma
Yang Guan
Shegnbo Eben Li
Xiangteng Zhang
Sifa Zheng
Jianyu Chen
247
47
0
22 May 2021
Integrated Decision and Control: Towards Interpretable and
  Computationally Efficient Driving Intelligence
Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving IntelligenceIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2021
Yang Guan
Yangang Ren
Qi Sun
Shengbo Eben Li
Haitong Ma
Jingliang Duan
Yifan Dai
B. Cheng
212
88
0
18 Mar 2021
1
Page 1 of 1