Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.07804
Cited By
v1
v2 (latest)
A Game Theoretic Framework for Model Based Reinforcement Learning
International Conference on Machine Learning (ICML), 2020
16 April 2020
Aravind Rajeswaran
Igor Mordatch
Vikash Kumar
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Game Theoretic Framework for Model Based Reinforcement Learning"
50 / 80 papers shown
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning
Jiayu Chen
Aravind Venugopal
Shiyu Huang
Jeff Schneider
OffRL
535
0
0
19 May 2025
Imitation Learning of Correlated Policies in Stackelberg Games
Kunag-Da Wang
Ping-Chun Hsieh
Chao-Han Huck Yang
529
0
0
11 Mar 2025
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Hyungkyu Kang
Min-hwan Oh
OffRL
447
3
0
07 Mar 2025
Understanding World or Predicting Future? A Comprehensive Survey of World Models
ACM Computing Surveys (ACM CSUR), 2024
Jingtao Ding
Yunke Zhang
Yu Shang
Yuheng Zhang
Zefang Zong
...
Fengli Xu
Yong Li
Chen Gao
Fengli Xu
Yong Li
VGen
SyDa
641
17
0
21 Nov 2024
Scalable Reinforcement Post-Training Beyond Static Human Prompts: Evolving Alignment via Asymmetric Self-Play
Ziyu Ye
Rishabh Agarwal
Tianqi Liu
Rishabh Joshi
Sarmishta Velury
Quoc Le
Qijun Tan
Yating Liu
378
5
0
31 Oct 2024
Towards Differentiable Multilevel Optimization: A Gradient-Based Approach
Yuntian Gu
Xuzheng Chen
239
0
0
15 Oct 2024
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective
Changwen Zheng
Wenwen Qiang
Jianqi Zhang
Changwen Zheng
Jingyao Wang
SSL
427
0
0
19 Jul 2024
Short-Long Policy Evaluation with Novel Actions
Hyunji Alex Nam
Yash Chandak
Emma Brunskill
OffRL
382
0
0
04 Jul 2024
BiLoRA: A Bi-level Optimization Framework for Overfitting-Resilient Low-Rank Adaptation of Large Pre-trained Models
Rushi Qiang
Ruiyi Zhang
Pengtao Xie
AI4CE
196
13
0
19 Mar 2024
A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations
E. C. Ozcan
Vittorio Giammarino
James Queeney
I. Paschalidis
OffRL
232
3
0
29 Feb 2024
Performative Reinforcement Learning in Gradually Shifting Environments
Ben Rank
Stelios Triantafyllou
Debmalya Mandal
Goran Radanović
OffRL
458
10
0
15 Feb 2024
Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments
Alexander W. Goodall
Francesco Belardinelli
OffRL
316
6
0
01 Feb 2024
Data protection psychology using game theory
Mike Nkongolo
Jahrad Sewnath
136
3
0
03 Jan 2024
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans
Neural Information Processing Systems (NeurIPS), 2023
Kyowoon Lee
Seongun Kim
Jaesik Choi
DiffM
331
23
0
30 Oct 2023
Behavior Alignment via Reward Function Optimization
Neural Information Processing Systems (NeurIPS), 2023
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
435
23
0
29 Oct 2023
Memory-based Controllers for Efficient Data-driven Control of Soft Robots
Yuzhe Wu
Ehsan Nekouei
122
3
0
19 Sep 2023
Approximate Model-Based Shielding for Safe Reinforcement Learning
European Conference on Artificial Intelligence (ECAI), 2023
Alexander W. Goodall
Francesco Belardinelli
301
6
0
27 Jul 2023
Learning non-Markovian Decision-Making from State-only Sequences
Neural Information Processing Systems (NeurIPS), 2023
Aoyang Qin
Feng Gao
Qing Li
Song-Chun Zhu
Sirui Xie
408
13
0
27 Jun 2023
Stackelberg Games for Learning Emergent Behaviors During Competitive Autocurricula
IEEE International Conference on Robotics and Automation (ICRA), 2023
Boling Yang
Liyuan Zheng
Lillian J. Ratliff
Byron Boots
Joshua R. Smith
226
8
0
04 May 2023
Masked Trajectory Models for Prediction, Representation, and Control
International Conference on Machine Learning (ICML), 2023
Philipp Wu
Arjun Majumdar
Kevin Stone
Yixin Lin
Igor Mordatch
Pieter Abbeel
Aravind Rajeswaran
OffRL
332
57
0
04 May 2023
Causal Semantic Communication for Digital Twins: A Generalizable Imitation Learning Approach
IEEE Journal on Selected Areas in Information Theory (JSAIT), 2023
Christo Kurisummoottil Thomas
Walid Saad
Yong Xiao
256
38
0
25 Apr 2023
Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks
Adaptive Agents and Multi-Agent Systems (AAMAS), 2023
Mohammad Mohammadi
Jonathan Nöther
Debmalya Mandal
Adish Singla
Goran Radanović
AAML
OffRL
234
13
0
27 Feb 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
James Queeney
M. Benosman
OOD
OffRL
350
17
0
30 Jan 2023
Beyond Inverted Pendulums: Task-optimal Simple Models of Legged Locomotion
IEEE Transactions on robotics (TRO), 2023
Yu-Ming Chen
Jian-bo Hu
Michael Posa
490
11
0
05 Jan 2023
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Marc Rigter
Bruno Lacerda
Nick Hawes
OffRL
540
14
0
30 Nov 2022
Task-Driven Hybrid Model Reduction for Dexterous Manipulation
IEEE Transactions on robotics (TRO), 2022
Wanxin Jin
Michael Posa
468
21
0
30 Nov 2022
Learning Modular Robot Locomotion from Demonstrations
Julian Whitman
Howie Choset
238
1
0
31 Oct 2022
Learning Modular Robot Visual-motor Locomotion Policies
IEEE International Conference on Robotics and Automation (ICRA), 2022
Julian Whitman
Howie Choset
276
2
0
31 Oct 2022
Real World Offline Reinforcement Learning with Realistic Data Source
IEEE International Conference on Robotics and Automation (ICRA), 2022
G. Zhou
Liyiming Ke
S. Srinivasa
Abhi Gupta
Aravind Rajeswaran
Vikash Kumar
OffRL
325
27
0
12 Oct 2022
A Unified Framework for Alternating Offline Model Training and Policy Learning
Neural Information Processing Systems (NeurIPS), 2022
Shentao Yang
Shujian Zhang
Yihao Feng
Mi Zhou
OffRL
323
17
0
12 Oct 2022
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
International Conference on Learning Representations (ICLR), 2022
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
449
31
0
18 Sep 2022
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
International Conference on Machine Learning (ICML), 2022
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
458
20
0
25 Jul 2022
A Survey of Decision Making in Adversarial Games
Science China Information Sciences (Sci. China Inf. Sci.), 2022
Xiuxian Li
Min Meng
Yiguang Hong
Jie-bin Chen
AAML
355
26
0
16 Jul 2022
Betty: An Automatic Differentiation Library for Multilevel Optimization
International Conference on Learning Representations (ICLR), 2022
Sang Keun Choe
Willie Neiswanger
P. Xie
Eric P. Xing
AI4CE
359
40
0
05 Jul 2022
Performative Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Debmalya Mandal
Stelios Triantafyllou
Goran Radanović
466
25
0
30 Jun 2022
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
IEEE Transactions on Automatic Control (TAC), 2022
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
402
4
0
28 Jun 2022
A Survey on Model-based Reinforcement Learning
Science China Information Sciences (Sci. China Inf. Sci.), 2022
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
487
167
0
19 Jun 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Shentao Yang
Yihao Feng
Shujian Zhang
Mi Zhou
OffRL
294
14
0
14 Jun 2022
Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization
Neural Information Processing Systems (NeurIPS), 2022
Aniket Das
Bernhard Schölkopf
Michael Muehlebach
339
10
0
07 Jun 2022
Beyond backpropagation: bilevel optimization through implicit differentiation and equilibrium propagation
Neural Computation (Neural Comput.), 2022
Nicolas Zucchet
João Sacramento
423
36
0
06 May 2022
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Che Wang
Xufang Luo
George Andriopoulos
Dongsheng Li
OffRL
520
66
0
17 Feb 2022
A Ranking Game for Imitation Learning
Harshit S. Sikchi
Akanksha Saran
Wonjoon Goo
S. Niekum
OffRL
372
24
0
07 Feb 2022
Adversarially Trained Actor Critic for Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
394
153
0
05 Feb 2022
Offline Reinforcement Learning for Road Traffic Control
Mayuresh Kunjir
Sanjay Chawla
OffRL
303
6
0
07 Jan 2022
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Sai Li
367
34
0
27 Dec 2021
Lyapunov Exponents for Diversity in Differentiable Games
Adaptive Agents and Multi-Agent Systems (AAMAS), 2021
Jonathan Lorraine
Paul Vicol
Jack Parker-Holder
Tal Kachman
Luke Metz
Jakob N. Foerster
219
9
0
24 Dec 2021
On Effective Scheduling of Model-based Reinforcement Learning
Hang Lai
Jian Shen
Weinan Zhang
Yimin Huang
Xingzhi Zhang
Ruiming Tang
Yong Yu
Zhenguo Li
325
22
0
16 Nov 2021
Robot Learning from Randomized Simulations: A Review
Frontiers in Robotics and AI (Front. Robot. AI), 2021
Fabio Muratore
Fabio Ramos
Greg Turk
Wenhao Yu
Michael Gienger
Jan Peters
AI4CE
448
130
0
01 Nov 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
556
52
0
06 Oct 2021
Learning Dynamics Models for Model Predictive Agents
M. Lutter
Leonard Hasenclever
Arunkumar Byravan
Gabriel Dulac-Arnold
Piotr Trochim
N. Heess
J. Merel
Yuval Tassa
AI4CE
291
31
0
29 Sep 2021
1
2
Next
Page 1 of 2