Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.05077
Cited By
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay
18 July 2016
Ionel-Alexandru Hosu
Traian Rebedea
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay"
12 / 12 papers shown
Title
Imitation Learning of Correlated Policies in Stackelberg Games
Kunag-Da Wang
Ping-Chun Hsieh
Wen-Chih Peng
43
0
0
11 Mar 2025
Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning
Chen Zhang
Huan Hu
Yuan Zhou
Qiyang Cao
Ruochen Liu
Wenya Wei
Elvis S. Liu
AI4CE
21
0
0
07 Oct 2024
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Zhiheng Xi
Wenxiang Chen
Boyang Hong
Senjie Jin
Rui Zheng
...
Xinbo Zhang
Peng Sun
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
32
20
0
08 Feb 2024
Deep Q-Network Based Decision Making for Autonomous Driving
M. Ronecker
Yuan-xian Zhu
19
32
0
21 Mar 2023
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
33
107
0
05 Apr 2022
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
D. Mguni
Taher Jafferjee
Jianhong Wang
Oliver Slumbers
Nicolas Perez Nieves
Feifei Tong
Yang Li
Jiangcheng Zhu
Yaodong Yang
Jun Wang
29
18
0
05 Dec 2021
Reinforcement Learning For Constraint Satisfaction Game Agents (15-Puzzle, Minesweeper, 2048, and Sudoku)
Anav Mehta
AI4CE
16
4
0
09 Feb 2021
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
13
4
0
17 Jun 2020
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey
Sanmit Narvekar
Bei Peng
Matteo Leonetti
Jivko Sinapov
Matthew E. Taylor
Peter Stone
ODL
137
457
0
10 Mar 2020
RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration
Brahma S. Pavse
F. Torabi
Josiah P. Hanna
Garrett A. Warnell
Peter Stone
16
33
0
18 Jun 2019
Active Deep Q-learning with Demonstration
Si-An Chen
Voot Tangkaratt
Hsuan-Tien Lin
Masashi Sugiyama
11
32
0
06 Dec 2018
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
E. Liu
Kelvin Guu
Panupong Pasupat
Tianlin Shi
Percy Liang
OnRL
16
205
0
24 Feb 2018
1