Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback

6 July 2023

Papers citing "Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback"

6 / 6 papers shown

Title
Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model Zilong Deng Simon Khan Shaofeng Zou 42 0 0 11 Mar 2025
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation Yu Chen Xiangcheng Zhang Siwei Wang Longbo Huang 21 3 0 28 Feb 2024
Improving alignment of dialogue agents via targeted human judgements Amelia Glaese Nat McAleese Maja Trkebacz John Aslanides Vlad Firoiu ... John F. J. Mellor Demis Hassabis Koray Kavukcuoglu Lisa Anne Hendricks G. Irving ALM AAML 225 495 0 28 Sep 2022
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 301 11,730 0 04 Mar 2022
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP Zihan Zhang Jiaqi Yang Xiangyang Ji S. Du 54 36 0 29 Jan 2021
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach Yinlam Chow Aviv Tamar Shie Mannor Marco Pavone 62 310 0 06 Jun 2015