Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.00230
Cited By
JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning
1 September 2023
Wai-Chung Kwan
Huimin Wang
Hongru Wang
Zezhong Wang
Xian Wu
Yefeng Zheng
Kam-Fai Wong
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning"
2 / 2 papers shown
Title
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,953
0
04 Mar 2022
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System
Yixuan Su
Lei Shu
Elman Mansimov
Arshit Gupta
Deng Cai
Yi-An Lai
Yi Zhang
150
192
0
29 Sep 2021
1