Balancing Constraints and Rewards with Meta-Gradient D4PG

13 October 2020

Papers citing "Balancing Constraints and Rewards with Meta-Gradient D4PG"

6 / 6 papers shown

Title
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation Shangding Gu Bilgehan Sel Yuhao Ding Lu Wang Qingwei Lin Ming Jin Alois Knoll 57 9 0 02 May 2024
Confronting Reward Model Overoptimization with Constrained RLHF Ted Moskovitz Aaditya K. Singh DJ Strouse T. Sandholm Ruslan Salakhutdinov Anca D. Dragan Stephen Marcus McAleer 34 47 0 06 Oct 2023
IQ-Flow: Mechanism Design for Inducing Cooperative Behavior to Self-Interested Agents in Sequential Social Dilemmas Bengisu Guresti Abdullah Vanlioglu N. K. Üre 13 5 0 28 Feb 2023
MuZero with Self-competition for Rate Control in VP9 Video Compression Amol Mandhane A. Zhernov Maribeth Rauh Chenjie Gu Miaosen Wang ... Jackson Broshear Julian Schrittwieser Thomas Hubert Oriol Vinyals Timothy A. Mann 29 43 0 14 Feb 2022
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification D. Mankowitz D. A. Calian Rae Jeong Cosmin Paduraru N. Heess Sumanth Dathathri Martin Riedmiller Timothy A. Mann 24 11 0 20 Oct 2020
Forward and Reverse Gradient-Based Hyperparameter Optimization Luca Franceschi Michele Donini P. Frasconi Massimiliano Pontil 127 406 0 06 Mar 2017