ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.12141
  4. Cited By
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy

Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy

25 July 2022
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
ArXivPDFHTML

Papers citing "Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy"

8 / 8 papers shown
Title
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
  with Curriculum Preference Learning
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
49
9
0
09 Oct 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with
  Uncertainty-Aware Rollout Adaption
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
44
5
0
29 May 2024
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning
  via Transition Occupancy Matching
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
24
5
0
22 May 2023
Relative Policy-Transition Optimization for Fast Policy Transfer
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
16
0
0
13 Jun 2022
On-Policy Model Errors in Reinforcement Learning
On-Policy Model Errors in Reinforcement Learning
Lukas P. Frohlich
Maksym Lefarov
M. Zeilinger
Felix Berkenkamp
OnRL
49
6
0
15 Oct 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
203
43
0
06 Oct 2021
Large Batch Experience Replay
Large Batch Experience Replay
Thibault Lahire
M. Geist
Emmanuel Rachelson
OffRL
48
13
0
04 Oct 2021
Model-based Policy Optimization with Unsupervised Model Adaptation
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
30
27
0
19 Oct 2020
1