Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06601
Cited By
Causal Confusion and Reward Misidentification in Preference-Based Reward Learning
13 April 2022
J. Tien
Jerry Zhi-Yang He
Zackory M. Erickson
Anca Dragan
Daniel S. Brown
CML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Causal Confusion and Reward Misidentification in Preference-Based Reward Learning"
3 / 3 papers shown
Title
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
33
2
0
17 Jan 2025
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong
Wei Xiong
Deepanshu Goyal
Yihan Zhang
Winnie Chow
Rui Pan
Shizhe Diao
Jipeng Zhang
Kashun Shum
Tong Zhang
ALM
6
397
0
13 Apr 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
1