Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.04376
Cited By
RLCard: A Toolkit for Reinforcement Learning in Card Games
10 October 2019
Daochen Zha
Kwei-Herng Lai
Yuanpu Cao
Songyi Huang
Ruzhe Wei
Junyu Guo
Xia Hu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RLCard: A Toolkit for Reinforcement Learning in Card Games"
31 / 31 papers shown
Title
PolicyEvol-Agent: Evolving Policy via Environment Perception and Self-Awareness with Theory of Mind
Yajie Yu
Yue Feng
LLMAG
26
0
0
20 Apr 2025
A Survey on the Optimization of Large Language Model-based Agents
Shangheng Du
Jiabao Zhao
Jinxin Shi
Zhentao Xie
Xin Jiang
Yanhong Bai
Liang He
LLMAG
LM&Ro
LM&MA
185
0
0
16 Mar 2025
Imitation Learning of Correlated Policies in Stackelberg Games
Kunag-Da Wang
Ping-Chun Hsieh
Wen-Chih Peng
43
0
0
11 Mar 2025
AI-driven control of bioelectric signalling for real-time topological reorganization of cells
Gonçalo Hora de Carvalho
AI4CE
43
0
0
10 Mar 2025
Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search
Jiamian Li
20
0
0
15 Oct 2024
EgoSocialArena: Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective
Guiyang Hou
Wenqi Zhang
Yongliang Shen
Zeqi Tan
Sihao Shen
Weiming Lu
31
0
0
08 Oct 2024
Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Yauwai Yim
Chunkit Chan
Tianyu Shi
Zheye Deng
Wei Fan
Tianshi Zheng
Yangqiu Song
LLMAG
26
9
0
05 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
46
8
0
02 Aug 2024
AlphaDou: High-Performance End-to-End Doudizhu AI Integrating Bidding
Chang Lei
Huan Lei
20
0
0
14 Jul 2024
Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning
Yadong Zhang
Shaoguang Mao
Wenshan Wu
Yan Xia
Tao Ge
Man Lan
Furu Wei
48
2
0
08 Jul 2024
UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models
Zhanyue Qin
Haochuan Wang
Deyuan Liu
Ziyang Song
Cunhang Fan
...
Zhen Lei
Zhiying Tu
Dianhui Chu
Xiaoyan Yu
Dianbo Sui
ELM
LRM
54
1
0
24 Jun 2024
PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning
Martin Balla
G. E. Long
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
OffRL
GP
13
1
0
28 May 2024
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Wenqi Zhang
Ke Tang
Hai Wu
Mengna Wang
Yongliang Shen
Guiyang Hou
Zeqi Tan
Peng Li
Y. Zhuang
Weiming Lu
LLMAG
31
36
0
27 Feb 2024
Two-Step Reinforcement Learning for Multistage Strategy Card Game
Konrad Godlewski
B. Sawicki
OffRL
22
0
0
29 Nov 2023
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4
Jiaxian Guo
Bo Yang
Paul D. Yoo
Bill Yuchen Lin
Yusuke Iwasawa
Yutaka Matsuo
LLMAG
13
40
0
29 Sep 2023
PyTAG: Challenges and Opportunities for Reinforcement Learning in Tabletop Games
Martin Balla
G. E. Long
Dominik Jeurissen
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
LMTD
OffRL
OnRL
17
1
0
19 Jul 2023
Towards Personalized Preprocessing Pipeline Search
Diego Martinez
Daochen Zha
Qiaoyu Tan
Xia Hu
AI4TS
21
2
0
28 Feb 2023
Hearts Gym: Learning Reinforcement Learning as a Team Event
Jana Ebert
Danimir T. Doncevic
R. Kloss
Stefan Kesselheim
OffRL
19
0
0
07 Sep 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
17
26
0
30 Mar 2022
Automatic Meta-Path Discovery for Effective Graph-Based Recommendation
Wentao Ning
Reynold Cheng
Jiajun Shen
Nur Al Hasan Haldar
B. Kao
Xiao Yan
Nan Huo
Wai Lam
Tian Li
Bo Tang
24
18
0
23 Dec 2021
Playing 2048 With Reinforcement Learning
Shilun Li
Veronica Peng
17
0
0
20 Oct 2021
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation
Yaowen Yao
Li Xiao
Zhicheng An
Wanpeng Zhang
Dijun Luo
55
20
0
05 Jul 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
14
116
0
11 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
20
15
0
10 Jun 2021
Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments
Daochen Zha
Wenye Ma
Lei Yuan
Xia Hu
Ji Liu
14
43
0
20 Jan 2021
OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research
Kai Li
Hang Xu
Enmin Zhao
Zhe Wu
Junliang Xing
VLM
8
0
0
11 Dec 2020
PettingZoo: Gym for Multi-Agent Reinforcement Learning
J. K. Terry
Benjamin Black
Nathaniel Grammel
Mario Jayakumar
Ananth Hari
...
Caroline Horsch
Clemens Dieffendahl
Niall L. Williams
Yashas Lokesh
Praveen Ravi
OffRL
14
270
0
30 Sep 2020
Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Mingyang Wan
X. Hu
11
53
0
16 Sep 2020
Policy-GNN: Aggregation Optimization for Graph Neural Networks
Kwei-Herng Lai
Daochen Zha
Kaixiong Zhou
Xia Hu
14
90
0
26 Jun 2020
Dual Policy Distillation
Kwei-Herng Lai
Daochen Zha
Yuening Li
Xia Hu
OffRL
11
10
0
07 Jun 2020
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
264
5,326
0
05 Nov 2016
1