Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.13590
Cited By
Suphx: Mastering Mahjong with Deep Reinforcement Learning
30 March 2020
Junjie Li
Sotetsu Koyamada
Qiwei Ye
Guoqing Liu
Chao Wang
Ruihan Yang
Li Zhao
Tao Qin
Tie-Yan Liu
H. Hon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Suphx: Mastering Mahjong with Deep Reinforcement Learning"
12 / 12 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
46
8
0
02 Aug 2024
Nash Equilibrium and Learning Dynamics in Three-Player Matching
m
m
m
-Action Games
Yuma Fujimoto
Kaito Ariu
Kenshi Abe
24
1
0
16 Feb 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
J. Wang
Zonghong Dai
Yaodong Yang
24
2
0
09 Aug 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
42
8
0
07 Mar 2023
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
20
13
0
01 Dec 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
11
6
0
31 Oct 2022
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han
Siyuan Li
Bo An
Wei Zhao
P. Liu
21
0
0
24 Oct 2022
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
X. Wang
Jing Xiao
11
0
0
11 May 2022
A Fast Algorithm for Computing the Deficiency Number of a Mahjong Hand
Xueqing Yan
Yongming Li
Sanjiang Li
12
0
0
15 Aug 2021
Universal Trading for Order Execution with Oracle Policy Distillation
Yuchen Fang
Kan Ren
Weiqing Liu
Dong Zhou
Weinan Zhang
Jiang Bian
Yong Yu
Tie-Yan Liu
OffRL
8
45
0
28 Jan 2021
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
31
55
0
15 Oct 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
29
19
0
14 Aug 2020
1