Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1605.07669
Cited By
On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems
24 May 2016
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems"
24 / 24 papers shown
Title
Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Sho Shimoyama
Tetsuro Morimura
Kenshi Abe
Toda Takamichi
Yuta Tomomatsu
Masakazu Sugiyama
Asahi Hentona
Yuuki Azuma
Hirotaka Ninomiya
OffRL
26
0
0
13 Jul 2023
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
21
0
0
05 May 2023
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
144
102
0
05 Jun 2022
Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Shikib Mehri
Jinho Choi
L. F. D’Haro
Jan Deriu
M. Eskénazi
...
David Traum
Yi-Ting Yeh
Zhou Yu
Yizhe Zhang
Chen Zhang
30
21
0
18 Mar 2022
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
38
43
0
28 Feb 2022
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
Xiaofeng Gao
Qiaozi Gao
Ran Gong
Kaixiang Lin
Govind Thattai
Gaurav Sukhatme
LM&Ro
89
70
0
27 Feb 2022
How to Evaluate Your Dialogue Models: A Review of Approaches
Xinmeng Li
Wansen Wu
Long Qin
Quanjun Yin
ELM
30
8
0
03 Aug 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
47
73
0
01 Jan 2021
GraphDialog: Integrating Graph Knowledge into End-to-End Task-Oriented Dialogue Systems
Shiquan Yang
Rui Zhang
S. Erfani
27
60
0
04 Oct 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
69
18
0
09 May 2020
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition
Ryuichi Takanobu
Runze Liang
Minlie Huang
LLMAG
19
54
0
08 Apr 2020
Teaching Machines to Converse
Jiwei Li
29
4
0
31 Jan 2020
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog
Ryuichi Takanobu
Hanlin Zhu
Minlie Huang
21
89
0
28 Aug 2019
Global-to-local Memory Pointer Networks for Task-Oriented Dialogue
Chien-Sheng Wu
R. Socher
Caiming Xiong
24
165
0
15 Jan 2019
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control
Xingxing Liang
Qi Wang
Yanghe Feng
Zhong Liu
Jincai Huang
21
5
0
24 Dec 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
46
669
0
21 Sep 2018
Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement Learning
Li He
Liang Wang
Kaipeng Liu
Bo Wu
Weinan Zhang
29
7
0
20 Mar 2018
Building a Conversational Agent Overnight with Dialogue Self-Play
Pararth Shah
Dilek Z. Hakkani-Tür
Gokhan Tur
Abhinav Rastogi
Ankur Bapna
Neha Nayak Kennard
Larry Heck
34
193
0
15 Jan 2018
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
35
2,824
0
09 Aug 2017
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
OffRL
14
129
0
01 Jul 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Neural Belief Tracker: Data-Driven Dialogue State Tracking
N. Mrksic
Diarmuid Ó Séaghdha
Tsung-Hsien Wen
Blaise Thomson
S. Young
11
480
0
12 Jun 2016
Continuously Learning Neural Dialogue Management
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
19
122
0
08 Jun 2016
A Network-based End-to-End Trainable Task-oriented Dialogue System
Tsung-Hsien Wen
David Vandyke
N. Mrksic
Milica Gasic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
S. Young
21
1,098
0
15 Apr 2016
1