ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.07550
  4. Cited By
Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for
  Task-Completion Dialogue Policy Learning

Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning

19 November 2018
Yuexin Wu
Xiujun Li
Jingjing Liu
Jianfeng Gao
Yiming Yang
ArXivPDFHTML

Papers citing "Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning"

6 / 6 papers shown
Title
A Survey on Recent Advances and Challenges in Reinforcement Learning
  Methods for Task-Oriented Dialogue Policy Learning
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
38
43
0
28 Feb 2022
Learning Knowledge Bases with Parameters for Task-Oriented Dialogue
  Systems
Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems
Andrea Madotto
Samuel Cahyawijaya
Genta Indra Winata
Yan Xu
Zihan Liu
Zhaojiang Lin
Pascale Fung
36
59
0
28 Sep 2020
A Survey on Dialog Management: Recent Advances and Challenges
A Survey on Dialog Management: Recent Advances and Challenges
Yinpei Dai
Huihua Yu
Yixuan Jiang
Chengguang Tang
Yongbin Li
Jian Sun
OffRL
VLM
30
20
0
05 May 2020
Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness
Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness
Zheng Zhang
Lizi Liao
Xiaoyan Zhu
Tat-Seng Chua
Zitao Liu
Yan Huang
Minlie Huang
LLMAG
30
19
0
21 Apr 2020
Budgeted Policy Learning for Task-Oriented Dialogue Systems
Budgeted Policy Learning for Task-Oriented Dialogue Systems
Zhirui Zhang
Xiujun Li
Jianfeng Gao
Enhong Chen
OffRL
35
36
0
02 Jun 2019
Dyna-AIL : Adversarial Imitation Learning by Planning
Dyna-AIL : Adversarial Imitation Learning by Planning
Vaibhav Saxena
Srinivasan Sivanandan
Pulkit Mathur
11
1
0
08 Mar 2019
1