Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.02575
Cited By
Active Preference-Based Gaussian Process Regression for Reward Learning
6 May 2020
Erdem Biyik
Nicolas Huynh
Mykel J. Kochenderfer
Dorsa Sadigh
GP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Active Preference-Based Gaussian Process Regression for Reward Learning"
18 / 18 papers shown
Title
Approximation to Deep Q-Network by Stochastic Delay Differential Equations
Jianya Lu
Yingjun Mo
21
0
0
01 May 2025
Towards Uncertainty Unification: A Case Study for Preference Learning
Shaoting Peng
Haonan Chen
Katherine Driggs-Campbell
51
0
0
25 Mar 2025
Towards Dynamic Trend Filtering through Trend Point Detection with Reinforcement Learning
Jihyeon Seong
Sekwang Oh
Jaesik Choi
AI4TS
29
0
0
06 Jun 2024
Learning Reward for Robot Skills Using Large Language Models via Self-Alignment
Yuwei Zeng
Yao Mu
Lin Shao
26
12
0
12 May 2024
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
JoonHo Lee
Jae Oh Woo
Juree Seok
Parisa Hassanzadeh
Wooseok Jang
...
Hankyu Moon
Wenjun Hu
Yeong-Dae Kwon
Taehee Lee
Seungjai Min
40
2
0
10 May 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
M. E. Taylor
OffRL
38
2
0
30 Apr 2024
A Generalized Acquisition Function for Preference-based Reward Learning
Evan Ellis
Gaurav R. Ghosal
Stuart J. Russell
Anca Dragan
Erdem Biyik
29
1
0
09 Mar 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
20
94
0
08 Jan 2024
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
8
0
0
23 Nov 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
Xinran Liang
Katherine Shu
Kimin Lee
Pieter Abbeel
9
57
0
24 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Safety-Aware Preference-Based Learning for Safety-Critical Control
Ryan K. Cosner
Maegan Tucker
Andrew J. Taylor
Kejun Li
Tamás G. Molnár
Wyatt Ubellacker
Anil Alan
G. Orosz
Yisong Yue
Aaron D. Ames
20
24
0
15 Dec 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
11
91
0
04 Nov 2021
Correct Me if I am Wrong: Interactive Learning for Robotic Manipulation
Eugenio Chisari
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
Abhinav Valada
11
36
0
07 Oct 2021
Learning Reward Functions from Scale Feedback
Nils Wilde
Erdem Biyik
Dorsa Sadigh
Stephen L. Smith
39
32
0
01 Oct 2021
Bayesian Robust Optimization for Imitation Learning
Daniel S. Brown
S. Niekum
Marek Petrik
12
32
0
24 Jul 2020
Preference-Based Learning for Exoskeleton Gait Optimization
Maegan Tucker
Ellen R. Novoseller
Claudia K. Kann
Yanan Sui
Yisong Yue
J. W. Burdick
Aaron D. Ames
63
89
0
26 Sep 2019
Early Detection of Combustion Instabilities using Deep Convolutional Selective Autoencoders on Hi-speed Flame Video
Chandrayee Basu
Qian Yang
M. Singhal
Anca Dragan
49
174
0
25 Mar 2016
1