Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.03359
Cited By
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
9 May 2018
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reward Estimation for Variance Reduction in Deep Reinforcement Learning"
9 / 9 papers shown
Title
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
47
9
0
03 Jun 2024
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Florian Tambon
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
G. Antoniol
41
7
0
13 Jan 2023
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
43
0
0
23 Nov 2022
Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning
Jifeng Hu
Yanchao Sun
Hechang Chen
Sili Huang
Haiyin Piao
Yi-Ju Chang
Lichao Sun
28
5
0
14 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Bo Li
Ding Zhao
79
45
0
16 Sep 2022
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Vincent Mai
Kaustubh Mani
Liam Paull
43
34
0
05 Jan 2022
Disturbing Reinforcement Learning Agents with Corrupted Rewards
Rubén Majadas
Javier A. García
Fernando Fernández
AAML
21
6
0
12 Feb 2021
Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient
Hao Jia
Xiao Zhang
Jun Xu
Wei Zeng
Hao Jiang
Xiao Yan
Ji-Rong Wen
27
3
0
25 Jul 2020
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
35
213
0
20 Jun 2018
1