ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.06893
  4. Cited By
A Study on Overfitting in Deep Reinforcement Learning

A Study on Overfitting in Deep Reinforcement Learning

18 April 2018
Chiyuan Zhang
Oriol Vinyals
Rémi Munos
Samy Bengio
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "A Study on Overfitting in Deep Reinforcement Learning"

49 / 49 papers shown
Title
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Songming Liu
Dong Yan
Jun Zhu
64
3
0
17 Feb 2025
Evolution and The Knightian Blindspot of Machine Learning
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
86
1
0
22 Jan 2025
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
114
2
0
23 Oct 2024
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Alihan Hüyük
A. R. Koblitz
Atefeh Mohajeri
M. Andrews
OffRL
35
0
0
19 Sep 2024
The Overcooked Generalisation Challenge
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
48
4
0
25 Jun 2024
Learning to Select Goals in Automated Planning with Deep-Q Learning
Learning to Select Goals in Automated Planning with Deep-Q Learning
Carlos Núnez-Molina
Juan Fernández-Olivares
Raúl Pérez
28
10
0
20 Jun 2024
Intervention-Assisted Policy Gradient Methods for Online Stochastic
  Queuing Network Optimization: Technical Report
Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report
Jerrod Wigmore
B. Shrader
E. Modiano
OffRL
21
1
0
05 Apr 2024
General-purpose foundation models for increased autonomy in
  robot-assisted surgery
General-purpose foundation models for increased autonomy in robot-assisted surgery
Samuel Schmidgall
Ji Woong Kim
Alan Kuntz
A. Ghazi
Axel Krieger
MedIm
46
9
0
01 Jan 2024
Adversarial Style Transfer for Robust Policy Optimization in Deep
  Reinforcement Learning
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
23
4
0
29 Aug 2023
Contextual Pre-planning on Reward Machine Abstractions for Enhanced
  Transfer in Deep Reinforcement Learning
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
Guy Azran
Mohamad H. Danesh
Stefano V. Albrecht
Sarah Keren
AI4CE
29
1
0
11 Jul 2023
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Nicolai Dorka
Tim Welschehold
Wolfram Burgard
16
3
0
17 Mar 2023
Expediting Distributed DNN Training with Device Topology-Aware Graph
  Deployment
Expediting Distributed DNN Training with Device Topology-Aware Graph Deployment
Shiwei Zhang
Xiaodong Yi
Lansong Diao
Chuan Wu
Siyu Wang
W. Lin
GNN
11
5
0
13 Feb 2023
Multi-Environment Pretraining Enables Transfer to Action Limited
  Datasets
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
David Venuto
Sherry Yang
Pieter Abbeel
Doina Precup
Igor Mordatch
Ofir Nachum
OffRL
22
5
0
23 Nov 2022
Scaling Laws for Reward Model Overoptimization
Scaling Laws for Reward Model Overoptimization
Leo Gao
John Schulman
Jacob Hilton
ALM
38
473
0
19 Oct 2022
Exploration via Elliptical Episodic Bonuses
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
29
39
0
11 Oct 2022
Exploration Policies for On-the-Fly Controller Synthesis: A
  Reinforcement Learning Approach
Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach
Tomás Delgado
Marco Sánchez Sorondo
V. Braberman
Sebastián Uchitel
OffRL
19
1
0
07 Oct 2022
Look where you look! Saliency-guided Q-networks for generalization in
  visual Reinforcement Learning
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
25
29
0
16 Sep 2022
Bootstrap State Representation using Style Transfer for Better
  Generalization in Deep Reinforcement Learning
Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
23
4
0
15 Jul 2022
GriddlyJS: A Web IDE for Reinforcement Learning
GriddlyJS: A Web IDE for Reinforcement Learning
C. Bamford
Minqi Jiang
Mikayel Samvelyan
Tim Rocktaschel
OnRL
38
4
0
13 Jul 2022
Chain of Thought Imitation with Procedure Cloning
Chain of Thought Imitation with Procedure Cloning
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
30
29
0
22 May 2022
The Primacy Bias in Deep Reinforcement Learning
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Aaron C. Courville
OnRL
90
178
0
16 May 2022
Local Feature Swapping for Generalization in Reinforcement Learning
Local Feature Swapping for Generalization in Reinforcement Learning
David Bertoin
Emmanuel Rachelson
OOD
18
14
0
13 Apr 2022
Evolving Curricula with Regret-Based Environment Design
Evolving Curricula with Regret-Based Environment Design
Jack Parker-Holder
Minqi Jiang
Michael Dennis
Mikayel Samvelyan
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
31
116
0
02 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
40
11
0
01 Mar 2022
Machine Learning Empowered Intelligent Data Center Networking: A Survey
Machine Learning Empowered Intelligent Data Center Networking: A Survey
Bo-wen Li
Ting Wang
Peng Yang
Mingsong Chen
Shui Yu
Mounir Hamdi
AI4CE
14
4
0
28 Feb 2022
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for
  Visual Reinforcement Learning
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning
Zhecheng Yuan
Guozheng Ma
Yao Mu
Bo Xia
Bo Yuan
Xueqian Wang
Ping Luo
Huazhe Xu
25
28
0
21 Feb 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
30
100
0
11 Jan 2022
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit
  Partial Observability
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
272
109
0
13 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
20
133
0
01 Jul 2021
Generalization of Reinforcement Learning with Policy-Aware Adversarial
  Data Augmentation
Generalization of Reinforcement Learning with Policy-Aware Adversarial Data Augmentation
Hanping Zhang
Yuhong Guo
22
23
0
29 Jun 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
27
21
0
17 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
24
174
0
10 Mar 2021
Sparse Attention Guided Dynamic Value Estimation for Single-Task
  Multi-Scene Reinforcement Learning
Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning
Jaskirat Singh
Liang Zheng
OffRL
16
3
0
14 Feb 2021
A Deep Generative Model for Molecule Optimization via One Fragment
  Modification
A Deep Generative Model for Molecule Optimization via One Fragment Modification
Ziqi Chen
Martin Renqiang Min
S. Parthasarathy
Xia Ning
21
61
0
08 Dec 2020
Online Safety Assurance for Deep Reinforcement Learning
Online Safety Assurance for Deep Reinforcement Learning
Noga H. Rotman
Michael Schapira
Aviv Tamar
OffRL
36
5
0
07 Oct 2020
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
10
70
0
04 Jul 2020
Transient Non-Stationarity and Generalisation in Deep Reinforcement
  Learning
Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning
Maximilian Igl
Gregory Farquhar
Jelena Luketina
Wendelin Boehmer
Shimon Whiteson
21
83
0
10 Jun 2020
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
22
541
0
03 Dec 2019
An Empirical Study on Hyperparameters and their Interdependence for RL
  Generalization
An Empirical Study on Hyperparameters and their Interdependence for RL Generalization
Xingyou Song
Yilun Du
Jacob Jackson
AI4CE
19
8
0
02 Jun 2019
Meta reinforcement learning as task inference
Meta reinforcement learning as task inference
Jan Humplik
Alexandre Galashov
Leonard Hasenclever
Pedro A. Ortega
Yee Whye Teh
N. Heess
OffRL
18
127
0
15 May 2019
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
Yilun Du
Karthik Narasimhan
19
33
0
13 May 2019
Neural Logic Reinforcement Learning
Neural Logic Reinforcement Learning
Zhengyao Jiang
Shan Luo
NAI
21
71
0
24 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning
  Without a Supercomputer
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRL
LRM
24
25
0
03 Apr 2019
Quantifying Generalization in Reinforcement Learning
Quantifying Generalization in Reinforcement Learning
K. Cobbe
Oleg Klimov
Christopher Hesse
Taehoon Kim
John Schulman
OffRL
10
658
0
06 Dec 2018
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
28
392
0
19 Nov 2018
Generalization and Regularization in DQN
Generalization and Regularization in DQN
Jesse Farebrother
Marlos C. Machado
Michael H. Bowling
25
203
0
29 Sep 2018
A Dissection of Overfitting and Generalization in Continuous
  Reinforcement Learning
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
19
176
0
20 Jun 2018
Relational Deep Reinforcement Learning
Relational Deep Reinforcement Learning
V. Zambaldi
David Raposo
Adam Santoro
V. Bapst
Yujia Li
...
Victoria Langston
Razvan Pascanu
M. Botvinick
Oriol Vinyals
Peter W. Battaglia
OffRL
18
218
0
05 Jun 2018
Transfer Learning for Related Reinforcement Learning Tasks via
  Image-to-Image Translation
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
Shani Gamrian
Yoav Goldberg
24
104
0
31 May 2018
1