ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.11277
  4. Cited By
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue
  Policy Learning

Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning

31 October 2017
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung (Vivian) Chen
Kam-Fai Wong
ArXivPDFHTML

Papers citing "Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning"

6 / 6 papers shown
Title
Foundation Metrics for Evaluating Effectiveness of Healthcare
  Conversations Powered by Generative AI
Foundation Metrics for Evaluating Effectiveness of Healthcare Conversations Powered by Generative AI
Mahyar Abbasian
Elahe Khatibi
Iman Azimi
David Oniani
Zahra Shakeri Hossein Abad
...
Bryant Lin
Olivier Gevaert
Li-Jia Li
Ramesh C. Jain
Amir M. Rahmani
LM&MA
ELM
AI4MH
45
66
0
21 Sep 2023
Rescue Conversations from Dead-ends: Efficient Exploration for
  Task-oriented Dialogue Policy Optimization
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
24
0
0
05 May 2023
A Survey on Recent Advances and Challenges in Reinforcement Learning
  Methods for Task-Oriented Dialogue Policy Learning
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
38
43
0
28 Feb 2022
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
60
73
0
01 Jan 2021
The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC
  Placement Based on Availability and Energy Consumption
The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC Placement Based on Availability and Energy Consumption
Guto Leoni Santos
Theo Lynn
J. Kelner
P. Endo
14
0
0
12 Oct 2020
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
Shang-Yu Su
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung (Vivian) Chen
OffRL
27
67
0
28 Aug 2018
1