v1v2v3v4 (latest)

Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy

International Conference on Machine Learning (ICML), 2022

25 July 2022

Xiyao Wang

Wichayaporn Wongkamjan

Furong Huang

ArXiv (abs)PDF HTML

Papers citing "Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy"

17 / 17 papers shown

MrCoM: A Meta-Regularized World-Model Generalizing Across Multi-Scenarios

...

124

09 Nov 2025

Fixing That Free Lunch: When, Where, and Why Synthetic Data Fails in Model-Based Policy Optimization

Brett Barkley

David Fridovich-Keil

OffRL

152

01 Oct 2025

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

586

10 Apr 2025

Learning World Models for Unconstrained Goal NavigationNeural Information Processing Systems (NeurIPS), 2024

Yuanlin Duan

Wensen Mao

He Zhu

231

03 Nov 2024

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Xiyao Wang

Furong Huang

294

09 Oct 2024

Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge DistillationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Xiyao Wang

Furong Huang

279

19 Jun 2024

Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models

Zeyu Fang

Tian Lan

OffRL

363

30 May 2024

Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption

380

29 May 2024

Mind the Model, Not the Agent: The Primacy Bias in Model-based RLEuropean Conference on Artificial Intelligence (ECAI), 2023

Zhongjian Qiao

Jiafei Lyu

Xiu Li

237

23 Oct 2023

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RLInternational Conference on Learning Representations (ICLR), 2023

Wichayaporn Wongkamjan

Huazhe Xu

Furong Huang

OffRL

292

11 Oct 2023

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

303

10 Oct 2023

How to Fine-tune the Model: Unified Model Shift and Model Bias Policy OptimizationNeural Information Processing Systems (NeurIPS), 2023

295

22 Sep 2023

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-CriticInternational Conference on Machine Learning (ICML), 2023

398

05 Jun 2023

Query-Policy Misalignment in Preference-Based Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

345

27 May 2023

TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy MatchingConference on Learning for Dynamics & Control (L4DC), 2023

200

22 May 2023

Relative Policy-Transition Optimization for Fast Policy TransferAAAI Conference on Artificial Intelligence (AAAI), 2022

136

13 Jun 2022

ED2: Environment Dynamics Decomposition World Models for Continuous Control

Jianye Hao

214

06 Dec 2021