ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.06350
  4. Cited By
Interference and Generalization in Temporal Difference Learning

Interference and Generalization in Temporal Difference Learning

International Conference on Machine Learning (ICML), 2020
13 March 2020
Emmanuel Bengio
Joelle Pineau
Doina Precup
ArXiv (abs)PDFHTML

Papers citing "Interference and Generalization in Temporal Difference Learning"

42 / 42 papers shown
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
J. Obando-Ceron
Walter Mayor
Samuel Lavoie
Scott Fujimoto
Aaron Courville
Pablo Samuel Castro
191
7
0
15 Oct 2025
Imagined Autocurricula
Imagined Autocurricula
Ahmet H. Güzel
Matthew Jackson
Jarek Liesen
Tim Rocktaschel
Jakob Foerster
Ilija Bogunovic
Jack Parker-Holder
311
2
0
11 Sep 2025
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Ahmet H. Güzel
Ilija Bogunovic
Jack Parker-Holder
OffRLOnRL
261
0
0
17 Aug 2025
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Guozheng Ma
Lu Li
Zilin Wang
Li Shen
Pierre-Luc Bacon
Dacheng Tao
OffRL
231
8
0
20 Jun 2025
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Hongyao Tang
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
Glen Berseth
242
12
0
31 May 2025
Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
Chenhao Li
Andreas Krause
Marco Hutter
OffRL
511
6
0
23 Apr 2025
Teleology-Driven Affective Computing: A Causal Framework for Sustained Well-Being
Teleology-Driven Affective Computing: A Causal Framework for Sustained Well-Being
Bin Yin
Chong-Yi Liu
Liya Fu
Jinkun Zhang
AI4TS
357
0
0
24 Feb 2025
Continual Task Learning through Adaptive Policy Self-Composition
Shengchao Hu
Yuhang Zhou
Ziqing Fan
Jifeng Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
470
1
0
18 Nov 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy ChurnNeural Information Processing Systems (NeurIPS), 2024
Hongyao Tang
Glen Berseth
OffRL
369
12
0
07 Sep 2024
Jointly Training and Pruning CNNs via Learnable Agent Guidance and
  Alignment
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
409
19
0
28 Mar 2024
Investigating Generalization Behaviours of Generative Flow Networks
Investigating Generalization Behaviours of Generative Flow Networks
Lazar Atanackovic
Emmanuel Bengio
AI4CE
374
8
0
07 Feb 2024
The Generalization Gap in Offline Reinforcement Learning
The Generalization Gap in Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
541
24
0
10 Dec 2023
Measuring and Mitigating Interference in Reinforcement Learning
Measuring and Mitigating Interference in Reinforcement Learning
Vincent Liu
Zheng Chen
Ruo Yu Tao
Khurram Javed
Adam White
Martha White
431
9
0
10 Jul 2023
Minibatch training of neural network ensembles via trajectory sampling
Minibatch training of neural network ensembles via trajectory sampling
Jamie F. Mair
Luke Causer
J. P. Garrahan
322
0
0
23 Jun 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCVOffRL
235
40
0
08 Jun 2023
Continual Task Allocation in Meta-Policy Network via Sparse Prompting
Continual Task Allocation in Meta-Policy Network via Sparse PromptingInternational Conference on Machine Learning (ICML), 2023
Yijun Yang
Tianyi Zhou
Jing Jiang
Guodong Long
Yuhui Shi
CLLOffRL
399
14
0
29 May 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
The Dormant Neuron Phenomenon in Deep Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
369
149
0
24 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven EnvironmentsInternational Conference on Learning Representations (ICLR), 2023
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLLOffRL
595
5
0
04 Feb 2023
Learning to Optimize for Reinforcement Learning
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
503
10
0
03 Feb 2023
Adversarial Cheap Talk
Adversarial Cheap TalkInternational Conference on Machine Learning (ICML), 2022
Chris Xiaoxuan Lu
Timon Willi
Alistair Letcher
Jakob N. Foerster
AAML
370
17
0
20 Nov 2022
Rethinking Value Function Learning for Generalization in Reinforcement
  Learning
Rethinking Value Function Learning for Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Seungyong Moon
JunYeong Lee
Hyun Oh Song
OODOffRL
266
16
0
18 Oct 2022
Learning GFlowNets from partial episodes for improved convergence and
  stability
Learning GFlowNets from partial episodes for improved convergence and stabilityInternational Conference on Machine Learning (ICML), 2022
Kanika Madan
Jarrid Rector-Brooks
Maksym Korablyov
Emmanuel Bengio
Moksh Jain
A. Nica
Tom Bosc
Yoshua Bengio
Nikolay Malkin
329
134
0
26 Sep 2022
Learning Dynamics and Generalization in Reinforcement Learning
Learning Dynamics and Generalization in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
Marta Z. Kwiatkowska
Y. Gal
OODOffRL
238
17
0
05 Jun 2022
The Primacy Bias in Deep Reinforcement Learning
The Primacy Bias in Deep Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Rameswar Panda
OnRL
388
265
0
16 May 2022
Generative Flow Networks for Discrete Probabilistic Modeling
Generative Flow Networks for Discrete Probabilistic ModelingInternational Conference on Machine Learning (ICML), 2022
Dinghuai Zhang
Nikolay Malkin
Ziqiang Liu
Alexandra Volokhova
Aaron Courville
Yoshua Bengio
440
131
0
03 Feb 2022
Trajectory balance: Improved credit assignment in GFlowNets
Trajectory balance: Improved credit assignment in GFlowNetsNeural Information Processing Systems (NeurIPS), 2022
Nikolay Malkin
Moksh Jain
Emmanuel Bengio
Chen Sun
Yoshua Bengio
601
266
0
31 Jan 2022
Quantum Architecture Search via Continual Reinforcement Learning
Quantum Architecture Search via Continual Reinforcement Learning
Esther Ye
Samuel Yen-Chi Chen
354
43
0
10 Dec 2021
DR3: Value-Based Deep Reinforcement Learning Requires Explicit
  Regularization
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Aviral Kumar
Rishabh Agarwal
Tengyu Ma
Aaron Courville
George Tucker
Sergey Levine
OffRL
247
84
0
09 Dec 2021
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
Robert Kirk
Amy Zhang
Edward Grefenstette
Tim Rocktaschel
OffRL
874
249
0
18 Nov 2021
The Difficulty of Passive Learning in Deep Reinforcement Learning
The Difficulty of Passive Learning in Deep Reinforcement Learning
Georg Ostrovski
Pablo Samuel Castro
Will Dabney
OffRL
203
70
0
26 Oct 2021
Catastrophic Interference in Reinforcement Learning: A Solution Based on
  Context Division and Knowledge Distillation
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Tiantian Zhang
Xueqian Wang
Bin Liang
Bo Yuan
OffRL
220
24
0
01 Sep 2021
Disparity Between Batches as a Signal for Early Stopping
Disparity Between Batches as a Signal for Early Stopping
Mahsa Forouzesh
Patrick Thiran
370
12
0
14 Jul 2021
Preferential Temporal Difference Learning
Preferential Temporal Difference LearningInternational Conference on Machine Learning (ICML), 2021
N. Anand
Doina Precup
OOD
199
9
0
11 Jun 2021
Flow Network based Generative Models for Non-Iterative Diverse Candidate
  Generation
Flow Network based Generative Models for Non-Iterative Diverse Candidate GenerationNeural Information Processing Systems (NeurIPS), 2021
Emmanuel Bengio
Moksh Jain
Maksym Korablyov
Doina Precup
Yoshua Bengio
424
475
0
08 Jun 2021
Correcting Momentum in Temporal Difference Learning
Correcting Momentum in Temporal Difference Learning
Emmanuel Bengio
Joelle Pineau
Doina Precup
250
11
0
07 Jun 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation
  Perspective
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation PerspectiveInternational Conference on Machine Learning (ICML), 2021
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
368
67
0
11 May 2021
Decoupling Value and Policy for Generalization in Reinforcement Learning
Decoupling Value and Policy for Generalization in Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Roberta Raileanu
Rob Fergus
DRLOffRL
340
115
0
20 Feb 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and PerspectivesJournal of Artificial Intelligence Research (JAIR), 2020
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLLOffRL
670
405
0
25 Dec 2020
Implicit Under-Parameterization Inhibits Data-Efficient Deep
  Reinforcement Learning
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2020
Aviral Kumar
Rishabh Agarwal
Dibya Ghosh
Sergey Levine
OffRL
414
151
0
27 Oct 2020
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
A Deeper Look at Discounting Mismatch in Actor-Critic AlgorithmsAdaptive Agents and Multi-Agent Systems (AAMAS), 2020
Shangtong Zhang
Romain Laroche
H. V. Seijen
Shimon Whiteson
Rémi Tachet des Combes
540
15
0
02 Oct 2020
Towards a practical measure of interference for reinforcement learning
Towards a practical measure of interference for reinforcement learning
Vincent Liu
Adam White
Hengshuai Yao
Martha White
242
6
0
07 Jul 2020
MOPO: Model-based Offline Policy Optimization
MOPO: Model-based Offline Policy OptimizationNeural Information Processing Systems (NeurIPS), 2020
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
869
906
0
27 May 2020
1
Page 1 of 1