Batch Active Preference-Based Learning of Reward Functions

Batch Active Preference-Based Learning of Reward Functions

10 October 2018

Dorsa Sadigh

Papers citing "Batch Active Preference-Based Learning of Reward Functions"

14 / 14 papers shown

Title
Preference Elicitation for Offline Reinforcement Learning Alizée Pace Bernhard Schölkopf Gunnar Rätsch Giorgia Ramponi OffRL 52 1 0 26 Jun 2024
Pareto-Optimal Learning from Preferences with Hidden Context Ryan Boldi Li Ding Lee Spector S. Niekum 51 6 0 21 Jun 2024
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation JoonHo Lee Jae Oh Woo Juree Seok Parisa Hassanzadeh Wooseok Jang ... Hankyu Moon Wenjun Hu Yeong-Dae Kwon Taehee Lee Seungjai Min 40 2 0 10 May 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning Calarina Muslimani M. E. Taylor OffRL 38 2 0 30 Apr 2024
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards Haoxiang Wang Yong Lin Wei Xiong Rui Yang Shizhe Diao Shuang Qiu Han Zhao Tong Zhang 40 70 0 28 Feb 2024
A density estimation perspective on learning from pairwise human preferences Vincent Dumoulin Daniel D. Johnson Pablo Samuel Castro Hugo Larochelle Yann Dauphin 21 12 0 23 Nov 2023
Active Inverse Learning in Stackelberg Trajectory Games Yue Yu Jacob Levy Negar Mehr David Fridovich-Keil Ufuk Topcu 11 2 0 15 Aug 2023
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning Xinran Liang Katherine Shu Kimin Lee Pieter Abbeel 9 57 0 24 May 2022
B-Pref: Benchmarking Preference-Based Reinforcement Learning Kimin Lee Laura M. Smith Anca Dragan Pieter Abbeel OffRL 13 91 0 04 Nov 2021
The Reasonable Crowd: Towards evidence-based and interpretable models of driving behavior Bassam Helou Aditya Dusi Anne-Sophie Collin N. Mehdipour Zhiliang Chen Cristhian G. Lizarazo C. Belta Tichakorn Wongpiromsarn R. D. Tebbens Oscar Beijbom 15 21 0 28 Jul 2021
Uncertain Decisions Facilitate Better Preference Learning Cassidy Laidlaw Stuart J. Russell 17 10 0 19 Jun 2021
Learning an Urban Air Mobility Encounter Model from Expert Preferences Sydney M. Katz Anne-Claire Le Bihan Mykel J. Kochenderfer 11 17 0 12 Jul 2019
Early Detection of Combustion Instabilities using Deep Convolutional Selective Autoencoders on Hi-speed Flame Video Chandrayee Basu Qian Yang M. Singhal Anca Dragan 49 174 0 25 Mar 2016
Determinantal point processes for machine learning Alex Kulesza B. Taskar 152 1,123 0 25 Jul 2012