Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.19733
Cited By
Differentially Private Reward Estimation with Preference Feedback
30 October 2023
Sayak Ray Chowdhury
Xingyu Zhou
Nagarajan Natarajan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Differentially Private Reward Estimation with Preference Feedback"
3 / 3 papers shown
Title
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
225
495
0
28 Sep 2022
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
Xiaoyu Chen
Han Zhong
Zhuoran Yang
Zhaoran Wang
Liwei Wang
118
59
0
23 May 2022
Differentially Private Fine-tuning of Language Models
Da Yu
Saurabh Naik
A. Backurs
Sivakanth Gopi
Huseyin A. Inan
...
Y. Lee
Andre Manoel
Lukas Wutschitz
Sergey Yekhanin
Huishuai Zhang
134
344
0
13 Oct 2021
1