Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.03216
Cited By
v1
v2 (latest)
Balancing Two-Player Stochastic Games with Soft Q-Learning
9 February 2018
Jordi Grau-Moya
Felix Leibfried
Haitham Bou-Ammar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Balancing Two-Player Stochastic Games with Soft Q-Learning"
23 / 23 papers shown
Variational Inference for Model-Free and Model-Based Reinforcement Learning
Felix Leibfried
OffRL
289
1
0
04 Sep 2022
Maximum-Entropy Multi-Agent Dynamic Games: Forward and Inverse Solutions
Negar Mehr
Mingyu Wang
Mac Schwager
280
83
0
03 Oct 2021
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
Chuangchuang Sun
Dong-Ki Kim
Jonathan P. How
AAML
267
24
0
14 Sep 2021
Towards General Function Approximation in Zero-Sum Markov Games
International Conference on Learning Representations (ICLR), 2021
Baihe Huang
Jason D. Lee
Zhaoran Wang
Zhuoran Yang
297
49
0
30 Jul 2021
Identity Concealment Games: How I Learned to Stop Revealing and Love the Coincidences
Mustafa O. Karabag
Melkior Ornik
Ufuk Topcu
293
4
0
12 May 2021
Learning in Nonzero-Sum Stochastic Games with Potentials
International Conference on Machine Learning (ICML), 2021
D. Mguni
Yutong Wu
Yali Du
Yaodong Yang
Ziyi Wang
Minne Li
Ying Wen
Joel Jennings
Jun Wang
480
51
0
16 Mar 2021
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning
Adaptive Agents and Multi-Agent Systems (AAMAS), 2021
Xiaoteng Ma
Yiqin Yang
Chenghao Li
Yiwen Lu
Qianchuan Zhao
Yang Jun
228
17
0
10 Feb 2021
A Tutorial on Sparse Gaussian Processes and Variational Inference
Felix Leibfried
Vincent Dutordoir
S. T. John
N. Durrande
GP
1.5K
66
0
27 Dec 2020
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
Minki Kang
Moonsu Han
Sung Ju Hwang
OOD
295
18
0
06 Oct 2020
Energy-based Surprise Minimization for Multi-Agent Value Factorization
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
230
1
0
16 Sep 2020
Assessing Game Balance with AlphaZero: Exploring Alternative Rule Sets in Chess
Nenad Tomašev
Ulrich Paquet
Demis Hassabis
Vladimir Kramnik
207
33
0
09 Sep 2020
Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation
International Joint Conference on Artificial Intelligence (IJCAI), 2020
Yue Guan
Qifan Zhang
Panagiotis Tsiotras
135
8
0
01 Sep 2020
Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch
Luca Viano
Yu-ting Huang
Parameswaran Kamalaruban
Adrian Weller
Volkan Cevher
374
36
0
02 Jul 2020
Competitive Policy Optimization
Manish Prajapat
Kamyar Azizzadenesheli
Alexander Liniger
Yisong Yue
Anima Anandkumar
262
16
0
18 Jun 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
193
122
0
20 May 2020
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
Annual Conference Computational Learning Theory (COLT), 2020
Qiaomin Xie
Yudong Chen
Zhaoran Wang
Zhuoran Yang
522
137
0
17 Feb 2020
Improving Fictitious Play Reinforcement Learning with Expanding Models
Rongjun Qin
Jing-Cheng Pang
Yang Yu
252
2
0
27 Nov 2019
α
α
α^α
α
α
-Rank: Practically Scaling
α
α
α
-Rank through Stochastic Optimisation
Adaptive Agents and Multi-Agent Systems (AAMAS), 2019
Yaodong Yang
Rasul Tutunov
Phu Sakulwongtana
Haitham Bou-Ammar
479
21
0
25 Sep 2019
Disentangled Skill Embeddings for Reinforcement Learning
Janith C. Petangoda
Sergio Pascual-Diaz
Vincent Adam
Peter Vrancx
Jordi Grau-Moya
DRL
OffRL
197
17
0
21 Jun 2019
A Regularized Opponent Model with Maximum Entropy Objective
International Joint Conference on Artificial Intelligence (IJCAI), 2019
Zheng Tian
Ying Wen
Zhichen Gong
Faiz Punakkath
Shihao Zou
Jun Wang
260
34
0
17 May 2019
Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative Systems
Adaptive Agents and Multi-Agent Systems (AAMAS), 2019
D. Mguni
Joel Jennings
Sergio Valcarcel Macua
Emilio Sison
S. Ceppi
Enrique Munoz de Cote
212
43
0
30 Jan 2019
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
Ying Wen
Yaodong Yang
Rui Luo
Jun Wang
LRM
288
60
0
26 Jan 2019
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
534
24
0
06 Aug 2017
1
Page 1 of 1