ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.01315
  4. Cited By
Empirical Design in Reinforcement Learning

Empirical Design in Reinforcement Learning

3 April 2023
Andrew Patterson
Samuel Neumann
Martha White
Adam White
ArXiv (abs)PDFHTML

Papers citing "Empirical Design in Reinforcement Learning"

22 / 22 papers shown
Learning to Reason Efficiently with Discounted Reinforcement Learning
Learning to Reason Efficiently with Discounted Reinforcement Learning
Alex Ayoub
Kavosh Asadi
Dale Schuurmans
Csaba Szepesvári
Karim Bouyarmane
OffRLLRM
136
0
0
27 Oct 2025
The Formalism-Implementation Gap in Reinforcement Learning Research
The Formalism-Implementation Gap in Reinforcement Learning Research
Pablo Samuel Castro
OffRL
166
0
0
17 Oct 2025
Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards
Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards
Aaron Tu
Weihao Xuan
Heli Qi
X. Y. Huang
Qingcheng Zeng
...
Amin Saberi
Naoto Yokoya
Jure Leskovec
Yejin Choi
Fang Wu
OffRL
157
3
0
26 Sep 2025
Learning Robust Penetration-Testing Policies under Partial Observability: A systematic evaluation
Learning Robust Penetration-Testing Policies under Partial Observability: A systematic evaluation
Raphael Simon
Pieter Libin
Wim Mees
OffRLAAML
124
0
0
24 Sep 2025
Deep Reinforcement Learning with Gradient Eligibility Traces
Deep Reinforcement Learning with Gradient Eligibility Traces
Esraa Elelimy
Brett Daley
Andrew Patterson
Marlos C. Machado
Adam White
Martha White
OffRL
131
2
0
12 Jul 2025
Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces
Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces
Jiamin He
A. Rupam Mahmood
Martha White
102
0
0
19 Jun 2025
Monotone and Conservative Policy Iteration Beyond the Tabular Case
Monotone and Conservative Policy Iteration Beyond the Tabular Case
Eshwar S. R.
Gugan Thoppe
Ananyabrata Barua
Aditya Gopalan
Gal Dalal
165
1
0
08 Jun 2025
Calibrated Value-Aware Model Learning with Probabilistic Environment Models
Calibrated Value-Aware Model Learning with Probabilistic Environment Models
C. Voelcker
Anastasiia Pedan
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
193
0
0
28 May 2025
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
Andreas Hochlehnert
Hardik Bhatnagar
Vishaal Udandarao
Samuel Albanie
Christian Schroeder de Witt
Matthias Bethge
ReLMALMLRM
602
67
0
09 Apr 2025
Trust-Region Twisted Policy Improvement
Trust-Region Twisted Policy Improvement
Joery A. de Vries
Jinke He
Yaniv Oren
M. Spaan
OffRLLRM
466
0
0
08 Apr 2025
A Research Agenda for Usability and Generalisation in Reinforcement Learning
A Research Agenda for Usability and Generalisation in Reinforcement Learning
Dennis J. N. J. Soemers
Spyridon Samothrakis
Kurt Driessens
M. Winands
OffRL
398
1
0
22 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics SimulationInternational Conference on Learning Representations (ICLR), 2024
Eliot Xing
Vernon Luk
Jean Oh
392
10
0
16 Dec 2024
Diversity Progress for Goal Selection in Discriminability-Motivated RL
Diversity Progress for Goal Selection in Discriminability-Motivated RL
Erik M. Lintunen
Nadia M. Ady
Christian Guckelsberger
206
2
0
03 Nov 2024
Can we hop in general? A discussion of benchmark selection and design
  using the Hopper environment
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
C. Voelcker
Marcel Hussing
Eric Eaton
OffRL
328
6
0
11 Oct 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement
  Learning
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
186
2
0
26 Jul 2024
Position: Benchmarking is Limited in Reinforcement Learning Research
Position: Benchmarking is Limited in Reinforcement Learning Research
Scott M. Jordan
Adam White
Bruno Castro da Silva
Martha White
Philip S. Thomas
OffRL
114
13
0
23 Jun 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement
  Learning
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
249
21
0
05 Feb 2024
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for
  Training and Benchmarking Agents that Solve Fuzzy Tasks
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy TasksNeural Information Processing Systems (NeurIPS), 2023
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
Rohin Shah
302
7
0
05 Dec 2023
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
Antonin Raffin
Olivier Sigaud
Jens Kober
Alin Albu-Schäffer
João Silvério
F. Stulp
177
4
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRLOnRL
259
7
0
09 Oct 2023
Can Differentiable Decision Trees Enable Interpretable Reward Learning
  from Human Feedback?
Can Differentiable Decision Trees Enable Interpretable Reward Learning from Human Feedback?
Basavasagar Patil
Daniel S. Brown
440
0
0
22 Jun 2023
Efficient Deep Reinforcement Learning with Predictive Processing
  Proximal Policy Optimization
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy OptimizationNeurons, Behavior, Data analysis, and Theory (NBDT), 2022
Burcu Küçükoglu
Walraaf Borkent
Bodo Rueckauer
Nasir Ahmad
Umut Güçlü
Marcel van Gerven
256
2
0
11 Nov 2022
1