ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.10941
  4. Cited By
Crowd-PrefRL: Preference-Based Reward Learning from Crowds

Crowd-PrefRL: Preference-Based Reward Learning from Crowds

17 January 2024
David Chhan
Ellen R. Novoseller
Vernon J. Lawhern
ArXivPDFHTML

Papers citing "Crowd-PrefRL: Preference-Based Reward Learning from Crowds"

7 / 7 papers shown
Title
RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning
RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning
Mingkang Wu
Devin White
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
OffRL
34
0
0
13 Jan 2025
Learning From Crowdsourced Noisy Labels: A Signal Processing Perspective
Learning From Crowdsourced Noisy Labels: A Signal Processing Perspective
Shahana Ibrahim
Panagiotis A. Traganitis
Xiao Fu
G. Giannakis
NoLa
32
0
0
09 Jul 2024
Corruption Robust Offline Reinforcement Learning with Human Feedback
Corruption Robust Offline Reinforcement Learning with Human Feedback
Debmalya Mandal
Andi Nika
Parameswaran Kamalaruban
Adish Singla
Goran Radanović
OffRL
15
8
0
09 Feb 2024
Scalable Interactive Machine Learning for Future Command and Control
Scalable Interactive Machine Learning for Future Command and Control
Anna Madison
Ellen R. Novoseller
Vinicius G. Goecks
Benjamin T. Files
Nicholas R. Waytowich
Alfred Yu
Vernon J. Lawhern
Steven Thurman
Christopher Kelshaw
Kaleb McDowell
19
3
0
09 Feb 2024
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Learning Reward Functions from Scale Feedback
Learning Reward Functions from Scale Feedback
Nils Wilde
Erdem Biyik
Dorsa Sadigh
Stephen L. Smith
31
27
0
01 Oct 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,662
0
04 May 2020
1