Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2102.11513
Cited By

Mixed Policy Gradient: off-policy reinforcement learning driven jointly
by data and model

v1v2 (latest)

Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model

23 February 2021

Jingliang Duan

Shengbo Eben Li

Jie Li

ArXiv (abs)PDF HTML

Papers citing "Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model"

8 / 8 papers shown

Aligning Language Models with Human Preferences via a Bayesian Approach

Aligning Language Models with Human Preferences via a Bayesian ApproachNeural Information Processing Systems (NeurIPS), 2023

418

36

0

09 Oct 2023

Smoothing Policy Iteration for Zero-sum Markov Games

Smoothing Policy Iteration for Zero-sum Markov Games

Jingliang Duan

180

1

0

03 Dec 2022

Integrated Decision and Control for High-Level Automated Vehicles by
Mixed Policy Gradient and Its Experiment Verification

Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification

Shengbo Eben Li

145

2

0

19 Oct 2022

Reachability Constrained Reinforcement Learning

Reachability Constrained Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

321

83

0

16 May 2022

Improve Generalization of Driving Policy at Signalized Intersections
with Adversarial Learning

Improve Generalization of Driving Policy at Signalized Intersections with Adversarial LearningTransportation Research Part C: Emerging Technologies (TRC), 2022

Shengbo Eben Li

Jianhua Jiang

Jingliang Duan

245

19

0

09 Apr 2022

Learn Zero-Constraint-Violation Policy in Model-Free Constrained
Reinforcement Learning

Learn Zero-Constraint-Violation Policy in Model-Free Constrained Reinforcement Learning

Changliu Liu

Shengbo Eben Li

Sifa Zheng

179

11

0

25 Nov 2021

Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring
Statewise Safety

Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety

Shegnbo Eben Li

Xiangteng Zhang

Sifa Zheng

247

47

0

22 May 2021

Integrated Decision and Control: Towards Interpretable and
Computationally Efficient Driving Intelligence

Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving IntelligenceIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2021

Shengbo Eben Li

Jingliang Duan

212

88

0

18 Mar 2021

Page 1 of 1