ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.03809
  4. Cited By
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward
  Decomposition

Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

8 April 2020
Ryuichi Takanobu
Runze Liang
Minlie Huang
    LLMAG
ArXivPDFHTML

Papers citing "Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition"

6 / 6 papers shown
Title
Why Guided Dialog Policy Learning performs well? Understanding the role
  of adversarial learning and its alternative
Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Sho Shimoyama
Tetsuro Morimura
Kenshi Abe
Toda Takamichi
Yuta Tomomatsu
Masakazu Sugiyama
Asahi Hentona
Yuuki Azuma
Hirotaka Ninomiya
OffRL
18
0
0
13 Jul 2023
Jointly Reinforced User Simulator and Task-oriented Dialog System with
  Simplified Generative Architecture
Jointly Reinforced User Simulator and Task-oriented Dialog System with Simplified Generative Architecture
Abhishek Sethi
Zhijian Ou
Yi Huang
Junlan Feng
RALM
16
1
0
13 Oct 2022
A Survey on Recent Advances and Challenges in Reinforcement Learning
  Methods for Task-Oriented Dialogue Policy Learning
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
28
42
0
28 Feb 2022
Reinforced Natural Language Interfaces via Entropy Decomposition
Reinforced Natural Language Interfaces via Entropy Decomposition
Xiaoran Wu
Yipeng Kang
LLMAG
19
0
0
23 Sep 2021
WeaSuL: Weakly Supervised Dialogue Policy Learning: Reward Estimation
  for Multi-turn Dialogue
WeaSuL: Weakly Supervised Dialogue Policy Learning: Reward Estimation for Multi-turn Dialogue
Anant Khandelwal
OffRL
10
6
0
01 Aug 2021
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue
  Systems
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
Layla El Asri
Jing He
Kaheer Suleman
49
117
0
30 Jun 2016
1