Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.15893
Cited By
When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels
28 October 2022
Weiyan Shi
Emily Dinan
Kurt Shuster
Jason Weston
Jing Xu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels"
5 / 5 papers shown
Title
ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model
Lifan Jiang
Zhihui Wang
Siqi Yin
Guangxiao Ma
Peng Zhang
Boxi Wu
DiffM
46
2
0
28 Aug 2024
Let Me Teach You: Pedagogical Foundations of Feedback for Language Models
Beatriz Borges
Niket Tandon
Tanja Kaser
Antoine Bosselut
17
3
0
01 Jul 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Internet-Augmented Dialogue Generation
M. Komeili
Kurt Shuster
Jason Weston
RALM
228
278
0
15 Jul 2021
Dialogue Learning With Human-In-The-Loop
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
OffRL
207
132
0
29 Nov 2016
1