ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.06176
  4. Cited By
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy
  Learning

Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning

18 January 2018
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Kam-Fai Wong
Shang-Yu Su
    OffRL
ArXivPDFHTML

Papers citing "Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning"

27 / 27 papers shown
Title
Why Guided Dialog Policy Learning performs well? Understanding the role
  of adversarial learning and its alternative
Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Sho Shimoyama
Tetsuro Morimura
Kenshi Abe
Toda Takamichi
Yuta Tomomatsu
Masakazu Sugiyama
Asahi Hentona
Yuuki Azuma
Hirotaka Ninomiya
OffRL
26
0
0
13 Jul 2023
A Unified Framework for Slot based Response Generation in a Multimodal
  Dialogue System
A Unified Framework for Slot based Response Generation in a Multimodal Dialogue System
Mauajama Firdaus
Avinash Madasu
Asif Ekbal
47
7
0
27 May 2023
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy
  Planning
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning
Xiao Yu
Maximillian Chen
Zhou Yu
LLMAG
LM&Ro
32
35
0
23 May 2023
Deep RL with Hierarchical Action Exploration for Dialogue Generation
Deep RL with Hierarchical Action Exploration for Dialogue Generation
Itsugun Cho
Ryota Takahashi
Yusaku Yanase
Hiroaki Saito
28
2
0
22 Mar 2023
A Survey on Recent Advances and Challenges in Reinforcement Learning
  Methods for Task-Oriented Dialogue Policy Learning
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
38
43
0
28 Feb 2022
Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user
  Edge-cloud Networks
Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks
Sina Shahhosseini
Tianyi Hu
Dongjoo Seo
A. Kanduri
Bryan Donyanavard
Amir M.Rahmani
N. Dutt
26
4
0
21 Feb 2022
Reinforced Natural Language Interfaces via Entropy Decomposition
Reinforced Natural Language Interfaces via Entropy Decomposition
Xiaoran Wu
Yipeng Kang
LLMAG
30
0
0
23 Sep 2021
TREND: Trigger-Enhanced Relation-Extraction Network for Dialogues
TREND: Trigger-Enhanced Relation-Extraction Network for Dialogues
Po-Wei Lin
Shang-Yu Su
Yun-Nung Chen
36
5
0
31 Aug 2021
Task-Oriented Dialogue System as Natural Language Generation
Task-Oriented Dialogue System as Natural Language Generation
Weizhi Wang
Zhirui Zhang
Junliang Guo
Yinpei Dai
Boxing Chen
Weihua Luo
36
32
0
31 Aug 2021
RecSim NG: Toward Principled Uncertainty Modeling for Recommender
  Ecosystems
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Martin Mladenov
Chih-Wei Hsu
Vihan Jain
Eugene Ie
Christopher Colby
Nicolas Mayoraz
H. Pham
Dustin Tran
Ivan Vendrov
Craig Boutilier
BDL
15
32
0
14 Mar 2021
Towards Emotion-Aware User Simulator for Task-Oriented Dialogue
Towards Emotion-Aware User Simulator for Task-Oriented Dialogue
Rui Zhang
Kai-Li Yin
Li-Wei Li
38
2
0
19 Nov 2020
GraphDialog: Integrating Graph Knowledge into End-to-End Task-Oriented
  Dialogue Systems
GraphDialog: Integrating Graph Knowledge into End-to-End Task-Oriented Dialogue Systems
Shiquan Yang
Rui Zhang
S. Erfani
27
60
0
04 Oct 2020
Learning Knowledge Bases with Parameters for Task-Oriented Dialogue
  Systems
Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems
Andrea Madotto
Samuel Cahyawijaya
Genta Indra Winata
Yan Xu
Zihan Liu
Zhaojiang Lin
Pascale Fung
36
59
0
28 Sep 2020
DAM: Deliberation, Abandon and Memory Networks for Generating Detailed
  and Non-repetitive Responses in Visual Dialogue
DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue
X. Jiang
Jiahao Yu
Yajing Sun
Zengchang Qin
Zihao Zhu
Yue Hu
Qi Wu
MLLM
46
19
0
07 Jul 2020
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in
  Dialogue Systems
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in Dialogue Systems
Zehao Lin
Shaobo Cui
Guodun Li
Xiaoming Kang
Feng Ji
Feng-Lin Li
Zhongzhou Zhao
Haiqing Chen
Yin Zhang
34
1
0
27 May 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward
  Estimation
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
69
18
0
09 May 2020
A Survey on Dialog Management: Recent Advances and Challenges
A Survey on Dialog Management: Recent Advances and Challenges
Yinpei Dai
Huihua Yu
Yixuan Jiang
Chengguang Tang
Yongbin Li
Jian Sun
OffRL
VLM
30
20
0
05 May 2020
Towards Unsupervised Language Understanding and Generation by Joint Dual
  Learning
Towards Unsupervised Language Understanding and Generation by Joint Dual Learning
Shang-Yu Su
Chao-Wei Huang
Yun-Nung Chen
22
26
0
30 Apr 2020
RecSim: A Configurable Simulation Platform for Recommender Systems
RecSim: A Configurable Simulation Platform for Recommender Systems
Eugene Ie
Chih-Wei Hsu
Martin Mladenov
Vihan Jain
Sanmit Narvekar
Jing Wang
Rui Wu
Craig Boutilier
30
178
0
11 Sep 2019
A Corpus-free State2Seq User Simulator for Task-oriented Dialogue
A Corpus-free State2Seq User Simulator for Task-oriented Dialogue
Yutai Hou
Meng Fang
Wanxiang Che
Ting Liu
OffRL
24
7
0
10 Sep 2019
Hill Climbing on Value Estimates for Search-control in Dyna
Hill Climbing on Value Estimates for Search-control in Dyna
Yangchen Pan
Hengshuai Yao
Amir-massoud Farahmand
Martha White
22
18
0
18 Jun 2019
Budgeted Policy Learning for Task-Oriented Dialogue Systems
Budgeted Policy Learning for Task-Oriented Dialogue Systems
Zhirui Zhang
Xiujun Li
Jianfeng Gao
Enhong Chen
OffRL
35
36
0
02 Jun 2019
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy
  Policies
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
Yonathan Efroni
Nadav Merlis
Mohammad Ghavamzadeh
Shie Mannor
OffRL
24
68
0
27 May 2019
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
49
670
0
21 Sep 2018
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
Shang-Yu Su
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
OffRL
27
67
0
28 Aug 2018
Dialogue Learning With Human-In-The-Loop
Dialogue Learning With Human-In-The-Loop
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
OffRL
227
134
0
29 Nov 2016
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue
  Systems
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
Layla El Asri
Jing He
Kaheer Suleman
57
117
0
30 Jun 2016
1