ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1605.07669
  4. Cited By
On-line Active Reward Learning for Policy Optimisation in Spoken
  Dialogue Systems

On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

24 May 2016
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
    OffRL
ArXivPDFHTML

Papers citing "On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems"

24 / 24 papers shown
Title
Why Guided Dialog Policy Learning performs well? Understanding the role
  of adversarial learning and its alternative
Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Sho Shimoyama
Tetsuro Morimura
Kenshi Abe
Toda Takamichi
Yuta Tomomatsu
Masakazu Sugiyama
Asahi Hentona
Yuuki Azuma
Hirotaka Ninomiya
OffRL
26
0
0
13 Jul 2023
Rescue Conversations from Dead-ends: Efficient Exploration for
  Task-oriented Dialogue Policy Optimization
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
21
0
0
05 May 2023
Offline RL for Natural Language Generation with Implicit Language Q
  Learning
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
144
102
0
05 Jun 2022
Report from the NSF Future Directions Workshop on Automatic Evaluation
  of Dialog: Research Directions and Challenges
Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Shikib Mehri
Jinho Choi
L. F. D’Haro
Jan Deriu
M. Eskénazi
...
David Traum
Yi-Ting Yeh
Zhou Yu
Yizhe Zhang
Chen Zhang
30
21
0
18 Mar 2022
A Survey on Recent Advances and Challenges in Reinforcement Learning
  Methods for Task-Oriented Dialogue Policy Learning
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
38
43
0
28 Feb 2022
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
Xiaofeng Gao
Qiaozi Gao
Ran Gong
Kaixiang Lin
Govind Thattai
Gaurav Sukhatme
LM&Ro
89
70
0
27 Feb 2022
How to Evaluate Your Dialogue Models: A Review of Approaches
How to Evaluate Your Dialogue Models: A Review of Approaches
Xinmeng Li
Wansen Wu
Long Qin
Quanjun Yin
ELM
30
8
0
03 Aug 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
47
73
0
01 Jan 2021
GraphDialog: Integrating Graph Knowledge into End-to-End Task-Oriented
  Dialogue Systems
GraphDialog: Integrating Graph Knowledge into End-to-End Task-Oriented Dialogue Systems
Shiquan Yang
Rui Zhang
S. Erfani
27
60
0
04 Oct 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward
  Estimation
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
69
18
0
09 May 2020
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward
  Decomposition
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition
Ryuichi Takanobu
Runze Liang
Minlie Huang
LLMAG
19
54
0
08 Apr 2020
Teaching Machines to Converse
Teaching Machines to Converse
Jiwei Li
29
4
0
31 Jan 2020
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain
  Task-Oriented Dialog
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog
Ryuichi Takanobu
Hanlin Zhu
Minlie Huang
21
89
0
28 Aug 2019
Global-to-local Memory Pointer Networks for Task-Oriented Dialogue
Global-to-local Memory Pointer Networks for Task-Oriented Dialogue
Chien-Sheng Wu
R. Socher
Caiming Xiong
24
165
0
15 Jan 2019
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for
  Model-based Control
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control
Xingxing Liang
Qi Wang
Yanghe Feng
Zhong Liu
Jincai Huang
21
5
0
24 Dec 2018
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
46
669
0
21 Sep 2018
Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement
  Learning
Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement Learning
Li He
Liang Wang
Kaipeng Liu
Bo Wu
Weinan Zhang
29
7
0
20 Mar 2018
Building a Conversational Agent Overnight with Dialogue Self-Play
Building a Conversational Agent Overnight with Dialogue Self-Play
Pararth Shah
Dilek Z. Hakkani-Tür
Gokhan Tur
Abhinav Rastogi
Ankur Bapna
Neha Nayak Kennard
Larry Heck
34
193
0
15 Jan 2018
Recent Trends in Deep Learning Based Natural Language Processing
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
35
2,824
0
09 Aug 2017
Sample-efficient Actor-Critic Reinforcement Learning with Supervised
  Data for Dialogue Management
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
OffRL
14
129
0
01 Jul 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Neural Belief Tracker: Data-Driven Dialogue State Tracking
Neural Belief Tracker: Data-Driven Dialogue State Tracking
N. Mrksic
Diarmuid Ó Séaghdha
Tsung-Hsien Wen
Blaise Thomson
S. Young
11
480
0
12 Jun 2016
Continuously Learning Neural Dialogue Management
Continuously Learning Neural Dialogue Management
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
19
122
0
08 Jun 2016
A Network-based End-to-End Trainable Task-oriented Dialogue System
A Network-based End-to-End Trainable Task-oriented Dialogue System
Tsung-Hsien Wen
David Vandyke
N. Mrksic
Milica Gasic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
S. Young
21
1,098
0
15 Apr 2016
1