ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.11746
  4. Cited By
Configurable Mirror Descent: Towards a Unification of Decision Making

Configurable Mirror Descent: Towards a Unification of Decision Making

20 May 2024
Pengdeng Li
Shuxin Li
Chang Yang
Xinrun Wang
Shuyue Hu
Xiao Huang
Hau Chan
Bo An
ArXivPDFHTML

Papers citing "Configurable Mirror Descent: Towards a Unification of Decision Making"

7 / 7 papers shown
Title
Improving Thompson Sampling via Information Relaxation for Budgeted
  Multi-armed Bandits
Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits
Woojin Jeong
Seungki Min
38
0
0
28 Aug 2024
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum
  Markov Games
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games
Shicong Cen
Yuejie Chi
S. Du
Lin Xiao
41
35
0
03 Oct 2022
Student of Games: A unified learning algorithm for both perfect and
  imperfect information games
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
19
20
0
06 Dec 2021
SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter
  Optimization
SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization
Marius Lindauer
Katharina Eggensperger
Matthias Feurer
André Biedenkapp
Difan Deng
C. Benjamins
Tim Ruhopf
René Sass
Frank Hutter
83
323
0
20 Sep 2021
EMVLight: A Decentralized Reinforcement Learning Framework for Efficient
  Passage of Emergency Vehicles
EMVLight: A Decentralized Reinforcement Learning Framework for Efficient Passage of Emergency Vehicles
Haoran Su
Yaofeng Desmond Zhong
Biswadip Dey
Amit Chakraborty
46
18
0
12 Sep 2021
Solving Common-Payoff Games with Approximate Policy Iteration
Solving Common-Payoff Games with Approximate Policy Iteration
Samuel Sokota
Edward Lockhart
Finbarr Timbers
Elnaz Davoodi
Ryan DÓrazio
Neil Burch
Martin Schmid
Michael H. Bowling
Marc Lanctot
35
22
0
11 Jan 2021
Input Convex Neural Networks
Input Convex Neural Networks
Brandon Amos
Lei Xu
J. Zico Kolter
169
596
0
22 Sep 2016
1