ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.07627
  4. Cited By
Redeeming Intrinsic Rewards via Constrained Optimization

Redeeming Intrinsic Rewards via Constrained Optimization

14 November 2022
Eric Chen
Zhang-Wei Hong
J. Pajarinen
Pulkit Agrawal
    OnRL
ArXivPDFHTML

Papers citing "Redeeming Intrinsic Rewards via Constrained Optimization"

15 / 15 papers shown
Title
Potential-Based Intrinsic Motivation: Preserving Optimality With
  Complex, Non-Markovian Shaping Rewards
Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards
Grant C. Forbes
Leonardo Villalobos-Arias
Jianxun Wang
Arnav Jhala
David L. Roberts
16
0
0
16 Oct 2024
Automatic Environment Shaping is the Next Frontier in RL
Automatic Environment Shaping is the Next Frontier in RL
Younghyo Park
G. Margolis
Pulkit Agrawal
OffRL
32
3
0
23 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Constrained Intrinsic Motivation for Reinforcement Learning
Constrained Intrinsic Motivation for Reinforcement Learning
Xiang Zheng
Xingjun Ma
Chao Shen
Cong Wang
21
1
0
12 Jul 2024
InsigHTable: Insight-driven Hierarchical Table Visualization with
  Reinforcement Learning
InsigHTable: Insight-driven Hierarchical Table Visualization with Reinforcement Learning
Guozheng Li
Peng He
Xinyu Wang
Runfei Li
Chi Harold Liu
Chuangxin Ou
Dong He
Guoren Wang
38
1
0
27 May 2024
Curiosity-driven Red-teaming for Large Language Models
Curiosity-driven Red-teaming for Large Language Models
Zhang-Wei Hong
Idan Shenfeld
T. Wang
Yung-Sung Chuang
Aldo Pareja
James R. Glass
Akash Srivastava
Pulkit Agrawal
LRM
28
39
0
29 Feb 2024
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the
  Generative Artificial Intelligence (AI) Research Landscape
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape
Timothy R. McIntosh
Teo Susnjak
Tong Liu
Paul Watters
Malka N. Halgamuge
79
46
0
18 Dec 2023
Multi-Objective Reinforcement Learning-based Approach for Pressurized
  Water Reactor Optimization
Multi-Objective Reinforcement Learning-based Approach for Pressurized Water Reactor Optimization
Paul Seurin
K. Shirvan
11
9
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
67
5
0
13 Dec 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
19
1
0
12 Oct 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from
  Human-in-the-Loop Feedback
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
M. Torné
Max Balsells
Zihan Wang
Samedh Desai
Tao Chen
Pulkit Agrawal
Abhishek Gupta
13
8
0
20 Jul 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
9
12
0
06 Jul 2023
Ensemble Value Functions for Efficient Exploration in Multi-Agent
  Reinforcement Learning
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Lukas Schafer
Oliver Slumbers
Stephen Marcus McAleer
Yali Du
Stefano V. Albrecht
D. Mguni
61
7
0
07 Feb 2023
Walk These Ways: Tuning Robot Control for Generalization with
  Multiplicity of Behavior
Walk These Ways: Tuning Robot Control for Generalization with Multiplicity of Behavior
G. Margolis
Pulkit Agrawal
11
149
0
06 Dec 2022
Multi-Objective reward generalization: Improving performance of Deep
  Reinforcement Learning for applications in single-asset trading
Multi-Objective reward generalization: Improving performance of Deep Reinforcement Learning for applications in single-asset trading
F. Cornalba
C. Disselkamp
Davide Scassola
Christopher Helf
9
5
0
09 Mar 2022
1