Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.10941
Cited By
Crowd-PrefRL: Preference-Based Reward Learning from Crowds
17 January 2024
David Chhan
Ellen R. Novoseller
Vernon J. Lawhern
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Crowd-PrefRL: Preference-Based Reward Learning from Crowds"
7 / 7 papers shown
Title
RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning
Mingkang Wu
Devin White
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
OffRL
34
0
0
13 Jan 2025
Learning From Crowdsourced Noisy Labels: A Signal Processing Perspective
Shahana Ibrahim
Panagiotis A. Traganitis
Xiao Fu
G. Giannakis
NoLa
32
0
0
09 Jul 2024
Corruption Robust Offline Reinforcement Learning with Human Feedback
Debmalya Mandal
Andi Nika
Parameswaran Kamalaruban
Adish Singla
Goran Radanović
OffRL
15
8
0
09 Feb 2024
Scalable Interactive Machine Learning for Future Command and Control
Anna Madison
Ellen R. Novoseller
Vinicius G. Goecks
Benjamin T. Files
Nicholas R. Waytowich
Alfred Yu
Vernon J. Lawhern
Steven Thurman
Christopher Kelshaw
Kaleb McDowell
19
3
0
09 Feb 2024
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Learning Reward Functions from Scale Feedback
Nils Wilde
Erdem Biyik
Dorsa Sadigh
Stephen L. Smith
31
27
0
01 Oct 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,662
0
04 May 2020
1