ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.00230
  4. Cited By
JoTR: A Joint Transformer and Reinforcement Learning Framework for
  Dialog Policy Learning

JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning

1 September 2023
Wai-Chung Kwan
Huimin Wang
Hongru Wang
Zezhong Wang
Xian Wu
Yefeng Zheng
Kam-Fai Wong
    OffRL
ArXivPDFHTML

Papers citing "JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning"

2 / 2 papers shown
Title
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,953
0
04 Mar 2022
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System
Yixuan Su
Lei Shu
Elman Mansimov
Arshit Gupta
Deng Cai
Yi-An Lai
Yi Zhang
150
192
0
29 Sep 2021
1