ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.06752
  4. Cited By
Critic PI2: Master Continuous Planning via Policy Improvement with Path
  Integrals and Deep Actor-Critic Reinforcement Learning

Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning

International Conference on Advanced Robotics and Mechatronics (ICARM), 2020
13 November 2020
Jiajun Fan
He Ba
Xian Guo
Jianye Hao
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning"

5 / 5 papers shown
Incentivizing Consistent, Effective and Scalable Reasoning Capability in Audio LLMs via Reasoning Process Rewards
Incentivizing Consistent, Effective and Scalable Reasoning Capability in Audio LLMs via Reasoning Process Rewards
Jiajun Fan
Roger Ren
Jingyuan Li
R. Pandey
Prashanth Gurunath Shivakumar
I. Bulyko
Ankur Gandhe
Ge Liu
Yile Gu
LRM
235
2
0
23 Oct 2025
Fine-tuning Flow Matching Generative Models with Intermediate Feedback
Fine-tuning Flow Matching Generative Models with Intermediate Feedback
Jiajun Fan
Chaoran Cheng
Shuaike Shen
Xiangxin Zhou
Ge Liu
EGVM
239
2
0
20 Oct 2025
Recent Advances in Path Integral Control for Trajectory Optimization: An
  Overview in Theoretical and Algorithmic Perspectives
Recent Advances in Path Integral Control for Trajectory Optimization: An Overview in Theoretical and Algorithmic PerspectivesAnnual Reviews in Control (ARC), 2023
Muhammad Kazim
JunGee Hong
Min-Gyeom Kim
Kwang-Ki K. Kim
332
36
0
22 Sep 2023
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior SelectionInternational Conference on Learning Representations (ICLR), 2023
Jiajun Fan
Yuzheng Zhuang
Yuecheng Liu
Jianye Hao
Sijin Yu
Jiangcheng Zhu
Hao Wang
Shutao Xia
270
24
0
09 May 2023
A Review for Deep Reinforcement Learning in Atari:Benchmarks,
  Challenges, and Solutions
A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions
Jiajun Fan
OffRL
378
24
0
08 Dec 2021
1
Page 1 of 1