ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.10207
  4. Cited By
Learning to Steer Markovian Agents under Model Uncertainty

Learning to Steer Markovian Agents under Model Uncertainty

14 July 2024
Jiawei Huang
Vinzenz Thoma
Zebang Shen
H. Nax
Niao He
ArXivPDFHTML

Papers citing "Learning to Steer Markovian Agents under Model Uncertainty"

8 / 8 papers shown
Title
Steering No-Regret Agents in MFGs under Model Uncertainty
Steering No-Regret Agents in MFGs under Model Uncertainty
Leo Widmer
Jiawei Huang
Niao He
LLMSV
59
0
0
12 Mar 2025
Stochastic Bilevel Optimization with Lower-Level Contextual Markov
  Decision Processes
Stochastic Bilevel Optimization with Lower-Level Contextual Markov Decision Processes
Vinzenz Thoma
Barna Pásztor
Andreas Krause
Giorgia Ramponi
Yifan Hu
26
1
0
03 Jun 2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and
  RLHF
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen
Zhuoran Yang
Tianyi Chen
OffRL
32
14
0
10 Feb 2024
Model-Based RL for Mean-Field Games is not Statistically Harder than
  Single-Agent RL
Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL
Jiawei Huang
Niao He
Andreas Krause
24
6
0
08 Feb 2024
Policy Mirror Ascent for Efficient and Independent Learning in Mean
  Field Games
Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games
Batuhan Yardim
Semih Cayci
M. Geist
Niao He
48
27
0
29 Dec 2022
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures
  Global Convergence
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence
Boyi Liu
Jiayang Li
Zhuoran Yang
Hoi-To Wai
Mingyi Hong
Y. Nie
Zhaoran Wang
41
18
0
04 Oct 2021
Independent Policy Gradient Methods for Competitive Reinforcement
  Learning
Independent Policy Gradient Methods for Competitive Reinforcement Learning
C. Daskalakis
Dylan J. Foster
Noah Golowich
51
158
0
11 Jan 2021
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
98
99
0
15 Oct 2019
1