Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.03987
Cited By
Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization
6 May 2023
Zhoujian Sun
Chenyang Zhao
Zheng-Wei Huang
Nai Ding
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization"
2 / 2 papers shown
Title
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
42
10
0
28 Aug 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
1