Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.06721
Cited By
Optimizing Agent Behavior over Long Time Scales by Transporting Value
15 October 2018
Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
M. Berk Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimizing Agent Behavior over Long Time Scales by Transporting Value"
12 / 12 papers shown
Title
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
86
1
0
22 Jan 2025
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity
Jaedong Hwang
Zhang-Wei Hong
Eric Chen
Akhilan Boopathy
Pulkit Agrawal
Ila Fiete
CLL
33
5
0
26 Oct 2023
PCGPT: Procedural Content Generation via Transformers
Sajad Mohaghegh
Mohammad Amin Ramezan Dehnavi
Golnoosh Abdollahinejad
Matin Hashemi
ViT
16
2
0
03 Oct 2023
Selective Credit Assignment
Veronica Chelu
Diana Borsa
Doina Precup
Hado van Hasselt
19
2
0
20 Feb 2022
Bayesian sense of time in biological and artificial brains
Z. Fountas
Alexey Zakharov
32
0
0
14 Jan 2022
Biological learning in key-value memory networks
Danil Tyulmankov
Ching Fang
Annapurna Vadaparty
G. R. Yang
20
27
0
26 Oct 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
27
11
0
08 Jun 2021
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Andrea Banino
Felix Hill
24
47
0
28 May 2021
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
22
17
0
10 Mar 2021
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
13
509
0
30 Mar 2020
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
22
212
0
20 Jun 2018
1