Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.07627
Cited By
Redeeming Intrinsic Rewards via Constrained Optimization
14 November 2022
Eric Chen
Zhang-Wei Hong
J. Pajarinen
Pulkit Agrawal
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Redeeming Intrinsic Rewards via Constrained Optimization"
15 / 15 papers shown
Title
Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards
Grant C. Forbes
Leonardo Villalobos-Arias
Jianxun Wang
Arnav Jhala
David L. Roberts
16
0
0
16 Oct 2024
Automatic Environment Shaping is the Next Frontier in RL
Younghyo Park
G. Margolis
Pulkit Agrawal
OffRL
32
3
0
23 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
28
3
0
18 Jul 2024
Constrained Intrinsic Motivation for Reinforcement Learning
Xiang Zheng
Xingjun Ma
Chao Shen
Cong Wang
21
1
0
12 Jul 2024
InsigHTable: Insight-driven Hierarchical Table Visualization with Reinforcement Learning
Guozheng Li
Peng He
Xinyu Wang
Runfei Li
Chi Harold Liu
Chuangxin Ou
Dong He
Guoren Wang
33
1
0
27 May 2024
Curiosity-driven Red-teaming for Large Language Models
Zhang-Wei Hong
Idan Shenfeld
T. Wang
Yung-Sung Chuang
Aldo Pareja
James R. Glass
Akash Srivastava
Pulkit Agrawal
LRM
28
39
0
29 Feb 2024
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape
Timothy R. McIntosh
Teo Susnjak
Tong Liu
Paul Watters
Malka N. Halgamuge
79
46
0
18 Dec 2023
Multi-Objective Reinforcement Learning-based Approach for Pressurized Water Reactor Optimization
Paul Seurin
K. Shirvan
9
9
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
62
5
0
13 Dec 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
16
1
0
12 Oct 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
M. Torné
Max Balsells
Zihan Wang
Samedh Desai
Tao Chen
Pulkit Agrawal
Abhishek Gupta
13
8
0
20 Jul 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
9
12
0
06 Jul 2023
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Lukas Schafer
Oliver Slumbers
Stephen Marcus McAleer
Yali Du
Stefano V. Albrecht
D. Mguni
61
7
0
07 Feb 2023
Walk These Ways: Tuning Robot Control for Generalization with Multiplicity of Behavior
G. Margolis
Pulkit Agrawal
11
145
0
06 Dec 2022
Multi-Objective reward generalization: Improving performance of Deep Reinforcement Learning for applications in single-asset trading
F. Cornalba
C. Disselkamp
Davide Scassola
Christopher Helf
9
4
0
09 Mar 2022
1