RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

8 August 2023

Mennatallah El-Assady

Papers citing "RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback"

2 / 2 papers shown

Title
CREW: Facilitating Human-AI Teaming Research Lingyu Zhang Zhengran Ji Boyuan Chen 34 3 0 03 Jan 2025
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 301 11,730 0 04 Mar 2022