What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study

10 June 2020

Sertan Girgin

Olivier Pietquin

Olivier Bachem

Papers citing "What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study"

50 / 136 papers shown

An Invitation to Deep Reinforcement Learning

Bernhard Jaeger

Andreas Geiger

OffRL OOD

497

13 Dec 2023

Guaranteed Trust Region Optimization via Two-Phase KL Penalization

170

08 Dec 2023

Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods

Zhengpeng Xie

Changdong Yu

Weizheng Qiao

401

31 Oct 2023

LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision ScenariosNeural Information Processing Systems (NeurIPS), 2023

384

12 Oct 2023

Reinforcement Learning for Node Selection in Branch-and-Bound

Alexander Mattick

Christopher Mutschler

294

29 Sep 2023

HyperPPO: A scalable method for finding small policies for robotic controlIEEE International Conference on Robotics and Automation (ICRA), 2023

Luming Tang

Zhehui Huang

Gaurav Sukhatme

234

28 Sep 2023

Evaluation of Constrained Reinforcement Learning Algorithms for Legged Locomotion

Marco Hutter

213

27 Sep 2023

Reward Function Design for Crowd Simulation via Reinforcement LearningMotion in Games (MiG), 2023

153

22 Sep 2023

Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension

Miguel Abreu

Luis Paulo Reis

Nuno Lau

311

06 Sep 2023

Commodities Trading through Deep Policy Gradient Methods

Jonas Hanetho

151

10 Aug 2023

Deep Reinforcement Learning for Autonomous Spacecraft Inspection using Illumination

154

04 Aug 2023

Benchmarking Potential Based Rewards for Learning Humanoid LocomotionIEEE International Conference on Robotics and Automation (ICRA), 2023

155

19 Jul 2023

Comparing Reinforcement Learning and Human Learning using the Game of Hidden RulesIEEE Access (IEEE Access), 2023

116

30 Jun 2023

The RL Perceptron: Generalisation Dynamics of Policy Learning in High DimensionsPhysical Review X (PRX), 2023

Nishil Patel

Sebastian Lee

Stefano Sarao Mannelli

Sebastian Goldt

Adrew Saxe

OffRL

455

17 Jun 2023

Emergent Agentic Transformer from Chain of Hindsight ExperienceInternational Conference on Machine Learning (ICML), 2023

Hao Liu

Pieter Abbeel

OffRL

269

26 May 2023

RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning

Alexander Scarlatos

Andrew Lan

OffRL LRM

268

23 May 2023

INVICTUS: Optimizing Boolean Logic Circuit Synthesis via Synergistic Learning and Search

211

22 May 2023

Policy Gradient Algorithms Implicitly Optimize by Continuation

Adrien Bolland

Gilles Louppe

D. Ernst

287

11 May 2023

DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic RewardsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

137

21 Apr 2023

Aiding reinforcement learning for set point controlIFAC-PapersOnLine (IFAC-PapersOnLine), 2023

Ruoqing Zhang

Per Mattsson

T. Wigren

188

20 Apr 2023

Robust nonlinear set-point control with reinforcement learningAmerican Control Conference (ACC), 2023

150

20 Apr 2023

Tracker: Model-based Reinforcement Learning for Tracking Control of Human Finger Attached with Thin McKibben MusclesIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023

117

01 Apr 2023

Autonomous Blimp Control via H-infinity Robust Deep Residual Reinforcement Learning

Yang Zuo

Y. Liu

Aamir Ahmad

24 Mar 2023

Order Matters: Agent-by-agent Policy OptimizationInternational Conference on Learning Representations (ICLR), 2023

323

13 Feb 2023

Planning Multiple Epidemic Interventions with Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

267

30 Jan 2023

Mastering Diverse Domains through World Models

Danijar Hafner

J. Pašukonis

Jimmy Ba

Timothy Lillicrap

369

862

10 Jan 2023

Transformers as Policies for Variable Action Environments

Niklas Zwingenberger

113

09 Jan 2023

Backward Curriculum Reinforcement LearningIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2022

Kyungmin Ko

OnRL

214

29 Dec 2022

Explainable and Safe Reinforcement Learning for Autonomous Air Mobility

131

24 Nov 2022

Reinforcement learning for traffic signal control in hybrid action space

Haoqing Luo

Sheng Jin

176

23 Nov 2022

Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy OptimizationNeurons, Behavior, Data analysis, and Theory (NBDT), 2022

265

11 Nov 2022

Understanding the Evolution of Linear Regions in Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

172

24 Oct 2022

On Many-Actions Policy GradientInternational Conference on Machine Learning (ICML), 2022

Michal Nauman

Marek Cygan

350

24 Oct 2022

Climate Change Policy Exploration using Reinforcement Learning

Theodore Wolf

114

23 Oct 2022

The Impact of Task Underspecification in Evaluating Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

197

16 Oct 2022

GoalsEye: Learning High Speed Precision Table Tennis on a Physical RobotIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022

364

07 Oct 2022

Towards a Standardised Performance Evaluation Protocol for Cooperative MARLNeural Information Processing Systems (NeurIPS), 2022

231

21 Sep 2022

Understanding reinforcement learned crowdsComputers & graphics (Comput. Graph.), 2022

130

19 Sep 2022

Grounding Aleatoric Uncertainty for Unsupervised Environment DesignNeural Information Processing Systems (NeurIPS), 2022

361

11 Jul 2022

Efficient entity-based reinforcement learning

128

06 Jun 2022

Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks

Pashootan Vaezipoor

140

03 Jun 2022

Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal SearchInternational Conference on Learning Representations (ICLR), 2022

568

01 Jun 2022

Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement LearningComputer Vision and Pattern Recognition (CVPR), 2022

190

29 May 2022

An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration EnvironmentsInternational Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2022

Alain Andres

Esther Villar-Rodriguez

Javier Del Ser

189

23 May 2022

Reinforcement Learning Policy Recommendation for Interbank Network StabilityJournal of Financial Stability (JFS), 2022

Alessio Brini

G. Tedeschi

Daniele Tantari

169

14 Apr 2022

Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability

Jie Tan

244

08 Apr 2022

Combining imitation and deep reinforcement learning to accomplish human-level performance on a virtual foraging taskAdaptive Behavior (AB), 2022

354

11 Mar 2022

Learning Torque Control for Quadrupedal LocomotionIEEE-RAS International Conference on Humanoid Robots (Humanoids), 2022

254

10 Mar 2022

A Survey on Reinforcement Learning Methods in Character Animation

264

07 Mar 2022

You May Not Need Ratio Clipping in PPO

206

31 Jan 2022