ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.04529
  4. Cited By
Self-Supervised Online Reward Shaping in Sparse-Reward Environments

Self-Supervised Online Reward Shaping in Sparse-Reward Environments

8 March 2021
F. Memarian
Wonjoon Goo
Rudolf Lioutikov
S. Niekum
Ufuk Topcu
    OffRL
ArXivPDFHTML

Papers citing "Self-Supervised Online Reward Shaping in Sparse-Reward Environments"

7 / 7 papers shown
Title
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Zijing Hu
Fengda Zhang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
51
0
0
14 Mar 2025
Offline Model-Based Optimization by Learning to Rank
Offline Model-Based Optimization by Learning to Rank
Rong-Xi Tan
Ke Xue
Shen-Huan Lyu
Haopu Shang
Yao Wang
Yaoyuan Wang
Sheng Fu
Chao Qian
OffRL
81
2
0
15 Oct 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
29
5
0
06 Aug 2024
Informativeness of Reward Functions in Reinforcement Learning
Informativeness of Reward Functions in Reinforcement Learning
R. Devidze
Parameswaran Kamalaruban
Adish Singla
26
2
0
10 Feb 2024
A reinforcement learning path planning approach for range-only
  underwater target localization with autonomous vehicles
A reinforcement learning path planning approach for range-only underwater target localization with autonomous vehicles
Ivan Masmitja
Mario Martin
K. Katija
S. Gomáriz
J. Navarro
19
5
0
17 Jan 2023
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Daniel Shin
Anca Dragan
Daniel S. Brown
OffRL
14
53
0
03 Jan 2023
Modular Deep Reinforcement Learning for Continuous Motion Planning with
  Temporal Logic
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic
Mingyu Cai
Mohammadhosein Hasanbeig
Shaoping Xiao
Alessandro Abate
Z. Kan
80
86
0
24 Feb 2021
1