Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04303
Cited By
Batch Active Preference-Based Learning of Reward Functions
10 October 2018
Erdem Biyik
Dorsa Sadigh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Batch Active Preference-Based Learning of Reward Functions"
14 / 14 papers shown
Title
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
52
1
0
26 Jun 2024
Pareto-Optimal Learning from Preferences with Hidden Context
Ryan Boldi
Li Ding
Lee Spector
S. Niekum
51
6
0
21 Jun 2024
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
JoonHo Lee
Jae Oh Woo
Juree Seok
Parisa Hassanzadeh
Wooseok Jang
...
Hankyu Moon
Wenjun Hu
Yeong-Dae Kwon
Taehee Lee
Seungjai Min
40
2
0
10 May 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
M. E. Taylor
OffRL
38
2
0
30 Apr 2024
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Haoxiang Wang
Yong Lin
Wei Xiong
Rui Yang
Shizhe Diao
Shuang Qiu
Han Zhao
Tong Zhang
40
70
0
28 Feb 2024
A density estimation perspective on learning from pairwise human preferences
Vincent Dumoulin
Daniel D. Johnson
Pablo Samuel Castro
Hugo Larochelle
Yann Dauphin
21
12
0
23 Nov 2023
Active Inverse Learning in Stackelberg Trajectory Games
Yue Yu
Jacob Levy
Negar Mehr
David Fridovich-Keil
Ufuk Topcu
11
2
0
15 Aug 2023
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
Xinran Liang
Katherine Shu
Kimin Lee
Pieter Abbeel
9
57
0
24 May 2022
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
13
91
0
04 Nov 2021
The Reasonable Crowd: Towards evidence-based and interpretable models of driving behavior
Bassam Helou
Aditya Dusi
Anne-Sophie Collin
N. Mehdipour
Zhiliang Chen
Cristhian G. Lizarazo
C. Belta
Tichakorn Wongpiromsarn
R. D. Tebbens
Oscar Beijbom
15
21
0
28 Jul 2021
Uncertain Decisions Facilitate Better Preference Learning
Cassidy Laidlaw
Stuart J. Russell
17
10
0
19 Jun 2021
Learning an Urban Air Mobility Encounter Model from Expert Preferences
Sydney M. Katz
Anne-Claire Le Bihan
Mykel J. Kochenderfer
11
17
0
12 Jul 2019
Early Detection of Combustion Instabilities using Deep Convolutional Selective Autoencoders on Hi-speed Flame Video
Chandrayee Basu
Qian Yang
M. Singhal
Anca Dragan
49
174
0
25 Mar 2016
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
152
1,123
0
25 Jul 2012
1