Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2503.06358
Cited By
Language Model Personalization via Reward Factorization
8 March 2025
Idan Shenfeld
Felix Faltings
Pulkit Agrawal
Aldo Pacchiano
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Model Personalization via Reward Factorization"
6 / 6 papers shown
T-POP: Test-Time Personalization with Online Preference Feedback
Zikun Qu
Min Zhang
Mingze Kong
Xiang Li
Zhiwei Shang
Zhiyong Wang
Yikun Ban
Shuang Qiu
Yao Shu
Zhongxiang Dai
148
0
0
29 Sep 2025
Sotopia-RL: Reward Design for Social Intelligence
Haofei Yu
Zhengyang Qi
Yining Zhao
Kolby Nottingham
Keyang Xuan
Bodhisattwa Prasad Majumder
Hao Zhu
Paul Pu Liang
Jiaxuan You
OffRL
217
6
0
05 Aug 2025
Capturing Individual Human Preferences with Reward Features
André Barreto
Vincent Dumoulin
Yiran Mao
Nicolas Perez-Nieves
Bobak Shahriari
Yann Dauphin
Doina Precup
Hugo Larochelle
ALM
250
4
0
21 Mar 2025
A Survey of Personalized Large Language Models: Progress and Future Directions
Jiahong Liu
Zexuan Qiu
Zhongyang Li
Quanyu Dai
Jieming Zhu
Minda Hu
Menglin Yang
Irwin King
Tat-Seng Chua
Irwin King
LM&MA
337
30
0
17 Feb 2025
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Tao Ge
Xin Chan
Dian Yu
Haitao Mi
Dong Yu
Dong Yu
SyDa
573
273
0
28 Jun 2024
Reinforcement Learning from Human Feedback with Active Queries
Kaixuan Ji
Jiafan He
Quanquan Gu
411
32
0
14 Feb 2024
1