ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.13590
  4. Cited By
Suphx: Mastering Mahjong with Deep Reinforcement Learning

Suphx: Mastering Mahjong with Deep Reinforcement Learning

30 March 2020
Junjie Li
Sotetsu Koyamada
Qiwei Ye
Guoqing Liu
Chao Wang
Ruihan Yang
Li Zhao
Tao Qin
Tie-Yan Liu
H. Hon
ArXivPDFHTML

Papers citing "Suphx: Mastering Mahjong with Deep Reinforcement Learning"

12 / 12 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
46
8
0
02 Aug 2024
Nash Equilibrium and Learning Dynamics in Three-Player Matching $m$-Action Games
Nash Equilibrium and Learning Dynamics in Three-Player Matching mmm-Action Games
Yuma Fujimoto
Kaito Ariu
Kenshi Abe
24
1
0
16 Feb 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player
  Zero-Sum Games
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
J. Wang
Zonghong Dai
Yaodong Yang
24
2
0
09 Aug 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End
  Policy and Optimistic Smooth Fictitious Play
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
42
8
0
07 Mar 2023
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player
  Multi-Agent Learning Toolbox
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
20
13
0
01 Dec 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
11
6
0
31 Oct 2022
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with
  Multi-Agent Reinforcement Learning
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han
Siyuan Li
Bo An
Wei Zhao
P. Liu
21
0
0
24 Oct 2022
Efficient Distributed Framework for Collaborative Multi-Agent
  Reinforcement Learning
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
X. Wang
Jing Xiao
11
0
0
11 May 2022
A Fast Algorithm for Computing the Deficiency Number of a Mahjong Hand
A Fast Algorithm for Computing the Deficiency Number of a Mahjong Hand
Xueqing Yan
Yongming Li
Sanjiang Li
12
0
0
15 Aug 2021
Universal Trading for Order Execution with Oracle Policy Distillation
Universal Trading for Order Execution with Oracle Policy Distillation
Yuchen Fang
Kan Ren
Weiqing Liu
Dong Zhou
Weinan Zhang
Jiang Bian
Yong Yu
Tie-Yan Liu
OffRL
8
45
0
28 Jan 2021
Masked Contrastive Representation Learning for Reinforcement Learning
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
31
55
0
15 Oct 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect
  Information
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
29
19
0
14 Aug 2020
1