Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.06135
Cited By
Rewarding Chatbots for Real-World Engagement with Millions of Users
10 March 2023
R. Irvine
D. Boubert
Vyas Raina
Adian Liusie
Ziyi Zhu
Vineet Mudupalli
Aliaksei Korshuk
Z. Liu
Fritz Cremer
Valentin Assassi
Christie-Carol Beauchamp
Xiaoding Lu
Thomas Rialan
W. Beauchamp
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rewarding Chatbots for Real-World Engagement with Millions of Users"
9 / 9 papers shown
Title
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Bradley Brown
Jordan Juravsky
Ryan Ehrlich
Ronald Clark
Quoc V. Le
Christopher Ré
Azalia Mirhoseini
ALM
LRM
76
211
0
03 Jan 2025
MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation
S. Gorti
Ilan Gofman
Zhaoyan Liu
Jiapeng Wu
Noël Vouitsis
Guangwei Yu
Jesse C. Cresswell
Rasa Hosseinzadeh
SyDa
41
6
0
16 Oct 2024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Zhaolin Gao
Wenhao Zhan
Jonathan D. Chang
Gokul Swamy
Kianté Brantley
Jason D. Lee
Wen Sun
OffRL
56
3
0
06 Oct 2024
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation
Ilya Gusev
LLMAG
55
3
0
10 Sep 2024
Unveiling the Impact of Multi-Modal Interactions on User Engagement: A Comprehensive Evaluation in AI-driven Conversations
Lichao Zhang
Jia Yu
Shuai Zhang
Long Li
Yangyang Zhong
...
Fangsheng Weng
Fayu Pan
Jing Li
Renjun Xu
Zhenzhong Lan
32
4
0
21 Jun 2024
PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences
Daiwei Chen
Yi Chen
Aniket Rege
Ramya Korlakai Vinayak
35
17
0
12 Jun 2024
Social AI Improves Well-Being Among Female Young Adults
Ebony Zhang
Xiaoding Lu
AI4MH
13
2
0
12 Nov 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
241
1,444
0
18 Mar 2020
1