Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.12141
Cited By
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
25 July 2022
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy"
8 / 8 papers shown
Title
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
49
9
0
09 Oct 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
44
5
0
29 May 2024
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
24
5
0
22 May 2023
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
16
0
0
13 Jun 2022
On-Policy Model Errors in Reinforcement Learning
Lukas P. Frohlich
Maksym Lefarov
M. Zeilinger
Felix Berkenkamp
OnRL
49
6
0
15 Oct 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
203
43
0
06 Oct 2021
Large Batch Experience Replay
Thibault Lahire
M. Geist
Emmanuel Rachelson
OffRL
48
13
0
04 Oct 2021
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
30
27
0
19 Oct 2020
1