ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.01417
  4. Cited By
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing
  Shaped Rewards

Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards

4 November 2019
Alexander R. Trott
Stephan Zheng
Caiming Xiong
R. Socher
ArXivPDFHTML

Papers citing "Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards"

20 / 20 papers shown
Title
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Zijing Hu
Fengda Zhang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
56
0
0
14 Mar 2025
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
38
1
0
07 Oct 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
29
5
0
06 Aug 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
46
1
0
11 Jun 2024
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned
  Reinforcement Learning
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
26
0
0
19 Apr 2024
Informativeness of Reward Functions in Reinforcement Learning
Informativeness of Reward Functions in Reinforcement Learning
R. Devidze
Parameswaran Kamalaruban
Adish Singla
26
2
0
10 Feb 2024
Building Open-Ended Embodied Agent via Language-Policy Bidirectional
  Adaptation
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
37
1
0
12 Dec 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
41
7
0
30 Oct 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement
  Learning
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
32
4
0
16 Jul 2023
Fulfilling Formal Specifications ASAP by Model-free Reinforcement
  Learning
Fulfilling Formal Specifications ASAP by Model-free Reinforcement Learning
Mengyu Liu
Pengyuan Lu
Xin Chen
Fanxin Kong
O. Sokolsky
Insup Lee
26
3
0
25 Apr 2023
Mastering Asymmetrical Multiplayer Game with Multi-Agent
  Asymmetric-Evolution Reinforcement Learning
Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning
Chenglu Sun
Yi-cui Zhang
Yu Zhang
Ziling Lu
Jingbin Liu
Si-Qi Xu
Weidong Zhang
20
0
0
20 Apr 2023
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer
  across Agents
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
Minghuan Liu
Zhengbang Zhu
Menghui Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
18
0
0
18 Dec 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward
  Long-Horizon Goal-Conditioned Reinforcement Learning
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
26
3
0
28 Oct 2022
Reinforcement Learning for Branch-and-Bound Optimisation using
  Retrospective Trajectories
Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories
Christopher W. F. Parsonson
Alexandre Laterre
Thomas D. Barrett
17
19
0
28 May 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
27
132
0
20 Jan 2022
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
31
1
0
11 Nov 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
92
0
14 Sep 2021
An Information-Theoretic Perspective on Credit Assignment in
  Reinforcement Learning
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
22
17
0
10 Mar 2021
Self-Supervised Online Reward Shaping in Sparse-Reward Environments
Self-Supervised Online Reward Shaping in Sparse-Reward Environments
F. Memarian
Wonjoon Goo
Rudolf Lioutikov
S. Niekum
Ufuk Topcu
OffRL
26
47
0
08 Mar 2021
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
22
212
0
20 Jun 2018
1