ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1207.4708
  4. Cited By
The Arcade Learning Environment: An Evaluation Platform for General
  Agents

The Arcade Learning Environment: An Evaluation Platform for General Agents

19 July 2012
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
ArXivPDFHTML

Papers citing "The Arcade Learning Environment: An Evaluation Platform for General Agents"

43 / 43 papers shown
Title
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
107
1
0
26 Mar 2025
Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs
Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs
Nicolas Le Roux
Marc G. Bellemare
Jonathan Lebensold
Arnaud Bergeron
Joshua Greaves
Alex Fréchette
Carolyne Pelletier
Eric Thibodeau-Laufer
Sándor Toth
Sam Work
OffRL
112
5
0
18 Mar 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
128
1
0
20 Feb 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
336
0
0
17 Feb 2025
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Bryan L. M. de Oliveira
Murilo L. da Luz
Bruno Brandão
Luana G. B. Martins
Telma W. de L. Soares
Luckeciano C. Melo
OffRL
96
1
0
17 Feb 2025
Beyond Interpolation: Extrapolative Reasoning with Reinforcement Learning and Graph Neural Networks
Beyond Interpolation: Extrapolative Reasoning with Reinforcement Learning and Graph Neural Networks
Niccolò Grillo
Andrea Toccaceli
Joël Mathys
Benjamin Estermann
Stefania Fresca
Roger Wattenhofer
AI4CE
LRM
172
0
0
06 Feb 2025
Divergence-Augmented Policy Optimization
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
120
16
0
28 Jan 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
68
0
0
24 Jan 2025
Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning
Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning
Matyáš Lorenc
58
1
0
23 Jan 2025
CoMAL: Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic
CoMAL: Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic
Huaiyuan Yao
Longchao Da
Vishnu Nandam
Justin Turnau
Zhiwei Liu
Linsey Pang
Hua Wei
LLMAG
77
6
0
10 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
101
0
0
03 Jan 2025
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
152
16
0
20 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
82
1
0
06 Nov 2024
Soft Condorcet Optimization for Ranking of General Agents
Soft Condorcet Optimization for Ranking of General Agents
Marc Lanctot
Kate Larson
Michael Kaisers
Quentin Berthet
I. Gemp
Manfred Diaz
Roberto-Rafael Maura-Rivero
Yoram Bachrach
Anna Koop
Doina Precup
135
0
0
31 Oct 2024
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
Günter Klambauer
Razvan Pascanu
Sepp Hochreiter
161
5
0
29 Oct 2024
Fourier Head: Helping Large Language Models Learn Complex Probability Distributions
Fourier Head: Helping Large Language Models Learn Complex Probability Distributions
Nate Gillman
Daksh Aggarwal
Michael Freeman
Saurabh Singh
Chen Sun
AI4TS
60
3
0
29 Oct 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Rameswar Panda
OffRL
98
8
0
23 Oct 2024
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Hikaru Shindo
Quentin Delfosse
Devendra Singh Dhami
Kristian Kersting
57
3
0
15 Oct 2024
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Zhong Zheng
Haochen Zhang
Lingzhou Xue
OffRL
95
2
0
10 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
237
5
0
02 Oct 2024
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Alihan Hüyük
A. R. Koblitz
Atefeh Mohajeri
M. Andrews
OffRL
52
0
0
19 Sep 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
70
6
0
06 Aug 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
101
3
0
18 Jul 2024
Gradient Boosting Reinforcement Learning
Gradient Boosting Reinforcement Learning
Benjamin Fuhrer
Chen Tessler
Gal Dalal
OffRL
AI4CE
93
3
0
11 Jul 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
78
21
0
05 Jul 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
105
2
0
15 Jun 2024
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Yan Yang
Bin Gao
Ya-xiang Yuan
94
2
0
30 May 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
90
0
0
29 May 2024
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Cong Lu
Shengran Hu
Jeff Clune
LLMAG
57
10
0
24 May 2024
A Survey on Large Language Model-Based Game Agents
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
88
55
0
02 Apr 2024
Return-Aligned Decision Transformer
Return-Aligned Decision Transformer
Tsunehiko Tanaka
Kenshi Abe
Kaito Ariu
Tetsuro Morimura
Edgar Simo-Serra
OffRL
96
1
0
06 Feb 2024
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
102
5
0
13 Dec 2023
Life-inspired Interoceptive Artificial Intelligence for Autonomous and Adaptive Agents
Life-inspired Interoceptive Artificial Intelligence for Autonomous and Adaptive Agents
Sungwoo Lee
Younghyun Oh
Hyunhoe An
Hyebhin Yoon
K. Friston
Seok Jun Hong
Choong-Wan Woo
AI4CE
86
1
0
12 Sep 2023
Improving robot navigation in crowded environments using intrinsic rewards
Improving robot navigation in crowded environments using intrinsic rewards
Diego Martínez Baselga
L. Riazuelo
Luis Montano
63
13
0
13 Feb 2023
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a
  Survey
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDL
OffRL
54
17
0
11 Aug 2020
Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement
  Learning-based Traffic Congestion Control Systems
Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement Learning-based Traffic Congestion Control Systems
Yue Wang
Esha Sarkar
Wenqing Li
Michail Maniatakos
Saif Eddin Jabari
AAML
96
62
0
17 Mar 2020
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Zhang-Wei Hong
Tzu-Yun Shann
Shih-Yang Su
Yi-Hsiang Chang
Chun-Yi Lee
44
123
0
13 Feb 2018
Learning to Factor Policies and Action-Value Functions: Factored Action
  Space Representations for Deep Reinforcement learning
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning
Sahil Sharma
A. Suresh
Rahul Ramesh
Balaraman Ravindran
OffRL
33
36
0
20 May 2017
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement
  Learning
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
Nat Dilokthanakul
Christos Kaplanis
Nick Pawlowski
Murray Shanahan
41
92
0
18 May 2017
Deep Semi-Random Features for Nonlinear Function Approximation
Deep Semi-Random Features for Nonlinear Function Approximation
Kenji Kawaguchi
Bo Xie
Vikas Verma
Le Song
104
15
0
28 Feb 2017
Collaborative Deep Reinforcement Learning
Collaborative Deep Reinforcement Learning
Kaixiang Lin
Shu Wang
Jiayu Zhou
40
21
0
19 Feb 2017
An Approximation of the Universal Intelligence Measure
An Approximation of the Universal Intelligence Measure
Shane Legg
J. Veness
45
72
0
27 Sep 2011
Measuring Intelligence through Games
Measuring Intelligence through Games
Tom Schaul
Julian Togelius
Jürgen Schmidhuber
ELM
AI4CE
64
54
0
06 Sep 2011
1