ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.11513
  4. Cited By
Mixed Policy Gradient: off-policy reinforcement learning driven jointly
  by data and model

Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model

23 February 2021
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
    OffRL
ArXivPDFHTML

Papers citing "Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model"

8 / 8 papers shown
Title
Aligning Language Models with Human Preferences via a Bayesian Approach
Aligning Language Models with Human Preferences via a Bayesian Approach
Jiashuo Wang
Haozhao Wang
Shichao Sun
Wenjie Li
ALM
32
22
0
09 Oct 2023
Smoothing Policy Iteration for Zero-sum Markov Games
Smoothing Policy Iteration for Zero-sum Markov Games
Yangang Ren
Yao Lyu
Wenxuan Wang
Sheng Li
Zeyang Li
Jingliang Duan
31
1
0
03 Dec 2022
Integrated Decision and Control for High-Level Automated Vehicles by
  Mixed Policy Gradient and Its Experiment Verification
Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification
Yang Guan
Liye Tang
Chuanxiao Li
Shengbo Eben Li
Yangang Ren
Junqing Wei
Bo Zhang
Ke Li
18
0
0
19 Oct 2022
Reachability Constrained Reinforcement Learning
Reachability Constrained Reinforcement Learning
Dongjie Yu
Haitong Ma
Sheng Li
Jianyu Chen
63
54
0
16 May 2022
Improve Generalization of Driving Policy at Signalized Intersections
  with Adversarial Learning
Improve Generalization of Driving Policy at Signalized Intersections with Adversarial Learning
Yangang Ren
Guojian Zhan
Liye Tang
Shengbo Eben Li
Jianhua Jiang
Jingliang Duan
AAML
11
10
0
09 Apr 2022
Learn Zero-Constraint-Violation Policy in Model-Free Constrained
  Reinforcement Learning
Learn Zero-Constraint-Violation Policy in Model-Free Constrained Reinforcement Learning
Haitong Ma
Changliu Liu
Shengbo Eben Li
Sifa Zheng
Wen Sun
Jianyu Chen
27
11
0
25 Nov 2021
Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring
  Statewise Safety
Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety
Haitong Ma
Yang Guan
Shegnbo Eben Li
Xiangteng Zhang
Sifa Zheng
Jianyu Chen
40
37
0
22 May 2021
Integrated Decision and Control: Towards Interpretable and
  Computationally Efficient Driving Intelligence
Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving Intelligence
Yang Guan
Yangang Ren
Qi Sun
Shengbo Eben Li
Haitong Ma
Jingliang Duan
Yifan Dai
B. Cheng
8
65
0
18 Mar 2021
1