ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.06721
  4. Cited By
Optimizing Agent Behavior over Long Time Scales by Transporting Value

Optimizing Agent Behavior over Long Time Scales by Transporting Value

15 October 2018
Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
M. Berk Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
ArXivPDFHTML

Papers citing "Optimizing Agent Behavior over Long Time Scales by Transporting Value"

12 / 12 papers shown
Title
Evolution and The Knightian Blindspot of Machine Learning
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
86
1
0
22 Jan 2025
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic
  Forgetting in Curiosity
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity
Jaedong Hwang
Zhang-Wei Hong
Eric Chen
Akhilan Boopathy
Pulkit Agrawal
Ila Fiete
CLL
33
5
0
26 Oct 2023
PCGPT: Procedural Content Generation via Transformers
PCGPT: Procedural Content Generation via Transformers
Sajad Mohaghegh
Mohammad Amin Ramezan Dehnavi
Golnoosh Abdollahinejad
Matin Hashemi
ViT
16
2
0
03 Oct 2023
Selective Credit Assignment
Selective Credit Assignment
Veronica Chelu
Diana Borsa
Doina Precup
Hado van Hasselt
19
2
0
20 Feb 2022
Bayesian sense of time in biological and artificial brains
Bayesian sense of time in biological and artificial brains
Z. Fountas
Alexey Zakharov
32
0
0
14 Jan 2022
Biological learning in key-value memory networks
Biological learning in key-value memory networks
Danil Tyulmankov
Ching Fang
Annapurna Vadaparty
G. R. Yang
20
27
0
26 Oct 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
27
11
0
08 Jun 2021
Towards mental time travel: a hierarchical memory for reinforcement
  learning agents
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Andrea Banino
Felix Hill
24
47
0
28 May 2021
An Information-Theoretic Perspective on Credit Assignment in
  Reinforcement Learning
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
22
17
0
10 Mar 2021
Agent57: Outperforming the Atari Human Benchmark
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
13
509
0
30 Mar 2020
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
22
212
0
20 Jun 2018
1