ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.09281
  4. Cited By
Dealing with Sparse Rewards in Reinforcement Learning

Dealing with Sparse Rewards in Reinforcement Learning

21 October 2019
J. Hare
ArXivPDFHTML

Papers citing "Dealing with Sparse Rewards in Reinforcement Learning"

8 / 8 papers shown
Title
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Zijing Hu
Fengda Zhang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
51
0
0
14 Mar 2025
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Xiaoguang Niu
Yan Li
Leong Hou U
33
1
0
27 Oct 2024
On shallow planning under partial observability
On shallow planning under partial observability
Randy Lefebvre
Audrey Durand
OffRL
31
0
0
22 Jul 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
37
3
0
20 May 2024
A Survey on Self-Evolution of Large Language Models
A Survey on Self-Evolution of Large Language Models
Zhengwei Tao
Ting-En Lin
Xiancai Chen
Hangyu Li
Yuchuan Wu
Yongbin Li
Zhi Jin
Fei Huang
Dacheng Tao
Jingren Zhou
LRM
LM&Ro
54
22
0
22 Apr 2024
Collaborative Route Planning of UAVs, Workers and Cars for Crowdsensing
  in Disaster Response
Collaborative Route Planning of UAVs, Workers and Cars for Crowdsensing in Disaster Response
Lei Han
Chunyu Tu
Zhiwen Yu
Zhiyong Yu
Weihua Shan
Liang Wang
Bin Guo
14
2
0
21 Aug 2023
AutoCAT: Reinforcement Learning for Automated Exploration of
  Cache-Timing Attacks
AutoCAT: Reinforcement Learning for Automated Exploration of Cache-Timing Attacks
Mulong Luo
Wenjie Xiong
G. G. Lee
Yueying Li
Xiaomeng Yang
Amy Zhang
Yuandong Tian
Hsien-Hsin S. Lee
G. E. Suh
AAML
34
10
0
17 Aug 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
12
9
0
24 Feb 2022
1