ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.03216
  4. Cited By
Balancing Two-Player Stochastic Games with Soft Q-Learning
v1v2 (latest)

Balancing Two-Player Stochastic Games with Soft Q-Learning

9 February 2018
Jordi Grau-Moya
Felix Leibfried
Haitham Bou-Ammar
ArXiv (abs)PDFHTML

Papers citing "Balancing Two-Player Stochastic Games with Soft Q-Learning"

23 / 23 papers shown
Variational Inference for Model-Free and Model-Based Reinforcement
  Learning
Variational Inference for Model-Free and Model-Based Reinforcement Learning
Felix Leibfried
OffRL
289
1
0
04 Sep 2022
Maximum-Entropy Multi-Agent Dynamic Games: Forward and Inverse Solutions
Maximum-Entropy Multi-Agent Dynamic Games: Forward and Inverse Solutions
Negar Mehr
Mingyu Wang
Mac Schwager
280
83
0
03 Oct 2021
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via
  Convex Relaxation
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
Chuangchuang Sun
Dong-Ki Kim
Jonathan P. How
AAML
267
24
0
14 Sep 2021
Towards General Function Approximation in Zero-Sum Markov Games
Towards General Function Approximation in Zero-Sum Markov GamesInternational Conference on Learning Representations (ICLR), 2021
Baihe Huang
Jason D. Lee
Zhaoran Wang
Zhuoran Yang
297
49
0
30 Jul 2021
Identity Concealment Games: How I Learned to Stop Revealing and Love the
  Coincidences
Identity Concealment Games: How I Learned to Stop Revealing and Love the Coincidences
Mustafa O. Karabag
Melkior Ornik
Ufuk Topcu
293
4
0
12 May 2021
Learning in Nonzero-Sum Stochastic Games with Potentials
Learning in Nonzero-Sum Stochastic Games with PotentialsInternational Conference on Machine Learning (ICML), 2021
D. Mguni
Yutong Wu
Yali Du
Yaodong Yang
Ziyi Wang
Minne Li
Ying Wen
Joel Jennings
Jun Wang
480
51
0
16 Mar 2021
Modeling the Interaction between Agents in Cooperative Multi-Agent
  Reinforcement Learning
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2021
Xiaoteng Ma
Yiqin Yang
Chenghao Li
Yiwen Lu
Qianchuan Zhao
Yang Jun
228
17
0
10 Feb 2021
A Tutorial on Sparse Gaussian Processes and Variational Inference
A Tutorial on Sparse Gaussian Processes and Variational Inference
Felix Leibfried
Vincent Dutordoir
S. T. John
N. Durrande
GP
1.5K
66
0
27 Dec 2020
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for
  Language Model Adaptation
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
Minki Kang
Moonsu Han
Sung Ju Hwang
OOD
295
18
0
06 Oct 2020
Energy-based Surprise Minimization for Multi-Agent Value Factorization
Energy-based Surprise Minimization for Multi-Agent Value Factorization
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
230
1
0
16 Sep 2020
Assessing Game Balance with AlphaZero: Exploring Alternative Rule Sets
  in Chess
Assessing Game Balance with AlphaZero: Exploring Alternative Rule Sets in Chess
Nenad Tomašev
Ulrich Paquet
Demis Hassabis
Vladimir Kramnik
207
33
0
09 Sep 2020
Learning Nash Equilibria in Zero-Sum Stochastic Games via
  Entropy-Regularized Policy Approximation
Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy ApproximationInternational Joint Conference on Artificial Intelligence (IJCAI), 2020
Yue Guan
Qifan Zhang
Panagiotis Tsiotras
135
8
0
01 Sep 2020
Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch
Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch
Luca Viano
Yu-ting Huang
Parameswaran Kamalaruban
Adrian Weller
Volkan Cevher
374
36
0
02 Jul 2020
Competitive Policy Optimization
Competitive Policy Optimization
Manish Prajapat
Kamyar Azizzadenesheli
Alexander Liniger
Yisong Yue
Anima Anandkumar
262
16
0
18 Jun 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR
  Control in Active Distribution Networks
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
193
122
0
20 May 2020
Learning Zero-Sum Simultaneous-Move Markov Games Using Function
  Approximation and Correlated Equilibrium
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated EquilibriumAnnual Conference Computational Learning Theory (COLT), 2020
Qiaomin Xie
Yudong Chen
Zhaoran Wang
Zhuoran Yang
522
137
0
17 Feb 2020
Improving Fictitious Play Reinforcement Learning with Expanding Models
Improving Fictitious Play Reinforcement Learning with Expanding Models
Rongjun Qin
Jing-Cheng Pang
Yang Yu
252
2
0
27 Nov 2019
$α^α$-Rank: Practically Scaling $α$-Rank through
  Stochastic Optimisation
ααα^ααα-Rank: Practically Scaling ααα-Rank through Stochastic OptimisationAdaptive Agents and Multi-Agent Systems (AAMAS), 2019
Yaodong Yang
Rasul Tutunov
Phu Sakulwongtana
Haitham Bou-Ammar
479
21
0
25 Sep 2019
Disentangled Skill Embeddings for Reinforcement Learning
Disentangled Skill Embeddings for Reinforcement Learning
Janith C. Petangoda
Sergio Pascual-Diaz
Vincent Adam
Peter Vrancx
Jordi Grau-Moya
DRLOffRL
197
17
0
21 Jun 2019
A Regularized Opponent Model with Maximum Entropy Objective
A Regularized Opponent Model with Maximum Entropy ObjectiveInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Zheng Tian
Ying Wen
Zhichen Gong
Faiz Punakkath
Shihao Zou
Jun Wang
260
34
0
17 May 2019
Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative
  Systems
Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative SystemsAdaptive Agents and Multi-Agent Systems (AAMAS), 2019
D. Mguni
Joel Jennings
Sergio Valcarcel Macua
Emilio Sison
S. Ceppi
Enrique Munoz de Cote
212
43
0
30 Jan 2019
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized
  Recursive Reasoning
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
Ying Wen
Yaodong Yang
Rui Luo
Jun Wang
LRM
288
60
0
26 Jan 2019
An Information-Theoretic Optimality Principle for Deep Reinforcement
  Learning
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
534
24
0
06 Aug 2017
1
Page 1 of 1