ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.07804
  4. Cited By
A Game Theoretic Framework for Model Based Reinforcement Learning
v1v2 (latest)

A Game Theoretic Framework for Model Based Reinforcement Learning

International Conference on Machine Learning (ICML), 2020
16 April 2020
Aravind Rajeswaran
Igor Mordatch
Vikash Kumar
    OffRL
ArXiv (abs)PDFHTML

Papers citing "A Game Theoretic Framework for Model Based Reinforcement Learning"

50 / 80 papers shown
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning
Jiayu Chen
Aravind Venugopal
Shiyu Huang
Jeff Schneider
OffRL
535
0
0
19 May 2025
Imitation Learning of Correlated Policies in Stackelberg Games
Imitation Learning of Correlated Policies in Stackelberg Games
Kunag-Da Wang
Ping-Chun Hsieh
Chao-Han Huck Yang
529
0
0
11 Mar 2025
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Adversarial Policy Optimization for Offline Preference-based Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Hyungkyu Kang
Min-hwan Oh
OffRL
447
3
0
07 Mar 2025
Understanding World or Predicting Future? A Comprehensive Survey of World Models
Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024
Jingtao Ding
Yunke Zhang
Yu Shang
Yuheng Zhang
Zefang Zong
...
Fengli Xu
Yong Li
Chen Gao
Fengli Xu
Yong Li
VGenSyDa
641
17
0
21 Nov 2024
Scalable Reinforcement Post-Training Beyond Static Human Prompts: Evolving Alignment via Asymmetric Self-Play
Scalable Reinforcement Post-Training Beyond Static Human Prompts: Evolving Alignment via Asymmetric Self-Play
Ziyu Ye
Rishabh Agarwal
Tianqi Liu
Rishabh Joshi
Sarmishta Velury
Quoc Le
Qijun Tan
Yating Liu
378
5
0
31 Oct 2024
Towards Differentiable Multilevel Optimization: A Gradient-Based
  Approach
Towards Differentiable Multilevel Optimization: A Gradient-Based Approach
Yuntian Gu
Xuzheng Chen
239
0
0
15 Oct 2024
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective
Changwen Zheng
Wenwen Qiang
Jianqi Zhang
Changwen Zheng
Jingyao Wang
SSL
427
0
0
19 Jul 2024
Short-Long Policy Evaluation with Novel Actions
Short-Long Policy Evaluation with Novel Actions
Hyunji Alex Nam
Yash Chandak
Emma Brunskill
OffRL
382
0
0
04 Jul 2024
BiLoRA: A Bi-level Optimization Framework for Overfitting-Resilient
  Low-Rank Adaptation of Large Pre-trained Models
BiLoRA: A Bi-level Optimization Framework for Overfitting-Resilient Low-Rank Adaptation of Large Pre-trained Models
Rushi Qiang
Ruiyi Zhang
Pengtao Xie
AI4CE
196
13
0
19 Mar 2024
A Model-Based Approach for Improving Reinforcement Learning Efficiency
  Leveraging Expert Observations
A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations
E. C. Ozcan
Vittorio Giammarino
James Queeney
I. Paschalidis
OffRL
232
3
0
29 Feb 2024
Performative Reinforcement Learning in Gradually Shifting Environments
Performative Reinforcement Learning in Gradually Shifting Environments
Ben Rank
Stelios Triantafyllou
Debmalya Mandal
Goran Radanović
OffRL
458
10
0
15 Feb 2024
Leveraging Approximate Model-based Shielding for Probabilistic Safety
  Guarantees in Continuous Environments
Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments
Alexander W. Goodall
Francesco Belardinelli
OffRL
316
6
0
01 Feb 2024
Data protection psychology using game theory
Data protection psychology using game theory
Mike Nkongolo
Jahrad Sewnath
136
3
0
03 Jan 2024
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic
  Detection of Infeasible Plans
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible PlansNeural Information Processing Systems (NeurIPS), 2023
Kyowoon Lee
Seongun Kim
Jaesik Choi
DiffM
331
23
0
30 Oct 2023
Behavior Alignment via Reward Function Optimization
Behavior Alignment via Reward Function OptimizationNeural Information Processing Systems (NeurIPS), 2023
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
435
23
0
29 Oct 2023
Memory-based Controllers for Efficient Data-driven Control of Soft
  Robots
Memory-based Controllers for Efficient Data-driven Control of Soft Robots
Yuzhe Wu
Ehsan Nekouei
122
3
0
19 Sep 2023
Approximate Model-Based Shielding for Safe Reinforcement Learning
Approximate Model-Based Shielding for Safe Reinforcement LearningEuropean Conference on Artificial Intelligence (ECAI), 2023
Alexander W. Goodall
Francesco Belardinelli
301
6
0
27 Jul 2023
Learning non-Markovian Decision-Making from State-only Sequences
Learning non-Markovian Decision-Making from State-only SequencesNeural Information Processing Systems (NeurIPS), 2023
Aoyang Qin
Feng Gao
Qing Li
Song-Chun Zhu
Sirui Xie
408
13
0
27 Jun 2023
Stackelberg Games for Learning Emergent Behaviors During Competitive
  Autocurricula
Stackelberg Games for Learning Emergent Behaviors During Competitive AutocurriculaIEEE International Conference on Robotics and Automation (ICRA), 2023
Boling Yang
Liyuan Zheng
Lillian J. Ratliff
Byron Boots
Joshua R. Smith
226
8
0
04 May 2023
Masked Trajectory Models for Prediction, Representation, and Control
Masked Trajectory Models for Prediction, Representation, and ControlInternational Conference on Machine Learning (ICML), 2023
Philipp Wu
Arjun Majumdar
Kevin Stone
Yixin Lin
Igor Mordatch
Pieter Abbeel
Aravind Rajeswaran
OffRL
332
57
0
04 May 2023
Causal Semantic Communication for Digital Twins: A Generalizable
  Imitation Learning Approach
Causal Semantic Communication for Digital Twins: A Generalizable Imitation Learning ApproachIEEE Journal on Selected Areas in Information Theory (JSAIT), 2023
Christo Kurisummoottil Thomas
Walid Saad
Yong Xiao
256
38
0
25 Apr 2023
Implicit Poisoning Attacks in Two-Agent Reinforcement Learning:
  Adversarial Policies for Training-Time Attacks
Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time AttacksAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Mohammad Mohammadi
Jonathan Nöther
Debmalya Mandal
Adish Singla
Goran Radanović
AAMLOffRL
234
13
0
27 Feb 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe
  Reinforcement Learning
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
James Queeney
M. Benosman
OODOffRL
350
17
0
30 Jan 2023
Beyond Inverted Pendulums: Task-optimal Simple Models of Legged
  Locomotion
Beyond Inverted Pendulums: Task-optimal Simple Models of Legged LocomotionIEEE Transactions on robotics (TRO), 2023
Yu-Ming Chen
Jian-bo Hu
Michael Posa
490
11
0
05 Jan 2023
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based
  Offline Reinforcement Learning
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Marc Rigter
Bruno Lacerda
Nick Hawes
OffRL
540
14
0
30 Nov 2022
Task-Driven Hybrid Model Reduction for Dexterous Manipulation
Task-Driven Hybrid Model Reduction for Dexterous ManipulationIEEE Transactions on robotics (TRO), 2022
Wanxin Jin
Michael Posa
468
21
0
30 Nov 2022
Learning Modular Robot Locomotion from Demonstrations
Learning Modular Robot Locomotion from Demonstrations
Julian Whitman
Howie Choset
238
1
0
31 Oct 2022
Learning Modular Robot Visual-motor Locomotion Policies
Learning Modular Robot Visual-motor Locomotion PoliciesIEEE International Conference on Robotics and Automation (ICRA), 2022
Julian Whitman
Howie Choset
276
2
0
31 Oct 2022
Real World Offline Reinforcement Learning with Realistic Data Source
Real World Offline Reinforcement Learning with Realistic Data SourceIEEE International Conference on Robotics and Automation (ICRA), 2022
G. Zhou
Liyiming Ke
S. Srinivasa
Abhi Gupta
Aravind Rajeswaran
Vikash Kumar
OffRL
325
27
0
12 Oct 2022
A Unified Framework for Alternating Offline Model Training and Policy
  Learning
A Unified Framework for Alternating Offline Model Training and Policy LearningNeural Information Processing Systems (NeurIPS), 2022
Shentao Yang
Shujian Zhang
Yihao Feng
Mi Zhou
OffRL
323
17
0
12 Oct 2022
Simplifying Model-based RL: Learning Representations, Latent-space
  Models, and Policies with One Objective
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
449
31
0
18 Sep 2022
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Live in the Moment: Learning Dynamics Model Adapted to Evolving PolicyInternational Conference on Machine Learning (ICML), 2022
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
458
20
0
25 Jul 2022
A Survey of Decision Making in Adversarial Games
A Survey of Decision Making in Adversarial GamesScience China Information Sciences (Sci. China Inf. Sci.), 2022
Xiuxian Li
Min Meng
Yiguang Hong
Jie-bin Chen
AAML
355
26
0
16 Jul 2022
Betty: An Automatic Differentiation Library for Multilevel Optimization
Betty: An Automatic Differentiation Library for Multilevel OptimizationInternational Conference on Learning Representations (ICLR), 2022
Sang Keun Choe
Willie Neiswanger
P. Xie
Eric P. Xing
AI4CE
359
40
0
05 Jul 2022
Performative Reinforcement Learning
Performative Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Debmalya Mandal
Stelios Triantafyllou
Goran Radanović
466
25
0
30 Jun 2022
Generalized Policy Improvement Algorithms with Theoretically Supported
  Sample Reuse
Generalized Policy Improvement Algorithms with Theoretically Supported Sample ReuseIEEE Transactions on Automatic Control (TAC), 2022
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
402
4
0
28 Jun 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement LearningScience China Information Sciences (Sci. China Inf. Sci.), 2022
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRLLRM
487
167
0
19 Jun 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize
  Offline Reinforcement Learning
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Shentao Yang
Yihao Feng
Shujian Zhang
Mi Zhou
OffRL
294
14
0
14 Jun 2022
Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax
  Optimization
Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax OptimizationNeural Information Processing Systems (NeurIPS), 2022
Aniket Das
Bernhard Schölkopf
Michael Muehlebach
339
10
0
07 Jun 2022
Beyond backpropagation: bilevel optimization through implicit
  differentiation and equilibrium propagation
Beyond backpropagation: bilevel optimization through implicit differentiation and equilibrium propagationNeural Computation (Neural Comput.), 2022
Nicolas Zucchet
João Sacramento
423
36
0
06 May 2022
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
VRL3: A Data-Driven Framework for Visual Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Che Wang
Xufang Luo
George Andriopoulos
Dongsheng Li
OffRL
520
66
0
17 Feb 2022
A Ranking Game for Imitation Learning
A Ranking Game for Imitation Learning
Harshit S. Sikchi
Akanksha Saran
Wonjoon Goo
S. Niekum
OffRL
372
24
0
07 Feb 2022
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Adversarially Trained Actor Critic for Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
394
153
0
05 Feb 2022
Offline Reinforcement Learning for Road Traffic Control
Offline Reinforcement Learning for Road Traffic Control
Mayuresh Kunjir
Sanjay Chawla
OffRL
303
6
0
07 Jan 2022
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in
  General-Sum Markov Games with Myopic Followers?
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Sai Li
367
34
0
27 Dec 2021
Lyapunov Exponents for Diversity in Differentiable Games
Lyapunov Exponents for Diversity in Differentiable GamesAdaptive Agents and Multi-Agent Systems (AAMAS), 2021
Jonathan Lorraine
Paul Vicol
Jack Parker-Holder
Tal Kachman
Luke Metz
Jakob N. Foerster
219
9
0
24 Dec 2021
On Effective Scheduling of Model-based Reinforcement Learning
On Effective Scheduling of Model-based Reinforcement Learning
Hang Lai
Jian Shen
Weinan Zhang
Yimin Huang
Xingzhi Zhang
Ruiming Tang
Yong Yu
Zhenguo Li
325
22
0
16 Nov 2021
Robot Learning from Randomized Simulations: A Review
Robot Learning from Randomized Simulations: A ReviewFrontiers in Robotics and AI (Front. Robot. AI), 2021
Fabio Muratore
Fabio Ramos
Greg Turk
Wenhao Yu
Michael Gienger
Jan Peters
AI4CE
448
130
0
01 Nov 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
556
52
0
06 Oct 2021
Learning Dynamics Models for Model Predictive Agents
Learning Dynamics Models for Model Predictive Agents
M. Lutter
Leonard Hasenclever
Arunkumar Byravan
Gabriel Dulac-Arnold
Piotr Trochim
N. Heess
J. Merel
Yuval Tassa
AI4CE
291
31
0
29 Sep 2021
12
Next
Page 1 of 2