ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.04332
  4. Cited By
RLHF-Blender: A Configurable Interactive Interface for Learning from
  Diverse Human Feedback

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

8 August 2023
Yannick Metz
David Lindner
Raphael Baur
Daniel A. Keim
Mennatallah El-Assady
    AI4CE
ArXivPDFHTML

Papers citing "RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback"

2 / 2 papers shown
Title
CREW: Facilitating Human-AI Teaming Research
CREW: Facilitating Human-AI Teaming Research
Lingyu Zhang
Zhengran Ji
Boyuan Chen
34
3
0
03 Jan 2025
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
1