ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.03359
  4. Cited By
Reward Estimation for Variance Reduction in Deep Reinforcement Learning

Reward Estimation for Variance Reduction in Deep Reinforcement Learning

9 May 2018
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
ArXivPDFHTML

Papers citing "Reward Estimation for Variance Reduction in Deep Reinforcement Learning"

9 / 9 papers shown
Title
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A
  Model-Based Reinforcement Learning Approach
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
47
9
0
03 Jun 2024
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Florian Tambon
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
G. Antoniol
41
7
0
13 Jan 2023
Actively Learning Costly Reward Functions for Reinforcement Learning
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
43
0
0
23 Nov 2022
Distributional Reward Estimation for Effective Multi-Agent Deep
  Reinforcement Learning
Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning
Jifeng Hu
Yanchao Sun
Hechang Chen
Sili Huang
Haiyin Piao
Yi-Ju Chang
Lichao Sun
28
5
0
14 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Bo Li
Ding Zhao
79
45
0
16 Sep 2022
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Vincent Mai
Kaustubh Mani
Liam Paull
43
34
0
05 Jan 2022
Disturbing Reinforcement Learning Agents with Corrupted Rewards
Disturbing Reinforcement Learning Agents with Corrupted Rewards
Rubén Majadas
Javier A. García
Fernando Fernández
AAML
21
6
0
12 Feb 2021
Variance Reduction for Deep Q-Learning using Stochastic Recursive
  Gradient
Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient
Hao Jia
Xiao Zhang
Jun Xu
Wei Zeng
Hao Jiang
Xiao Yan
Ji-Rong Wen
27
3
0
25 Jul 2020
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
35
213
0
20 Jun 2018
1