ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.03809
  4. Cited By
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward
  Decomposition
v1v2 (latest)

Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

Annual Meeting of the Association for Computational Linguistics (ACL), 2020
8 April 2020
Ryuichi Takanobu
Runze Liang
Shiyu Huang
    LLMAG
ArXiv (abs)PDFHTML

Papers citing "Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition"

27 / 27 papers shown
Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles
Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit ProfilesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Kuang Wang
Xianrui Li
Steve Yang
Li Zhou
Feng Jiang
Haoyang Li
506
17
0
26 Feb 2025
Rewarding What Matters: Step-by-Step Reinforcement Learning for
  Task-Oriented Dialogue
Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Huifang Du
Shuqin Li
Minghao Wu
Xuejing Feng
Yuan-Fang Li
Haofen Wang
OffRL
348
5
0
20 Jun 2024
Planning Like Human: A Dual-process Framework for Dialogue Planning
Planning Like Human: A Dual-process Framework for Dialogue PlanningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Tao He
Lizi Liao
Yixin Cao
Yuanxing Liu
Ming Liu
Zerui Chen
Bing Qin
386
48
0
08 Jun 2024
A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems
A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems
Zihao Yi
Jiarui Ouyang
Yuwen Liu
Yuwen Liu
Tianhao Liao
Haohao Luo
Ying Shen
LRMLLMAG
540
180
0
28 Feb 2024
A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models
A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models
Hongru Wang
Lingzhi Wang
Yiming Du
Liang Chen
Jing Zhou
Yufei Wang
Kam-Fai Wong
LRM
576
23
0
28 Nov 2023
A Task-oriented Dialog Model with Task-progressive and Policy-aware
  Pre-training
A Task-oriented Dialog Model with Task-progressive and Policy-aware Pre-trainingNatural Language Processing and Chinese Computing (NLPCC), 2023
Lucen Zhong
Hengtong Lu
Caixia Yuan
Fangkun Zhao
Jiashen Sun
Ke Zeng
Guanglu Wan
VLM
220
1
0
01 Oct 2023
JoTR: A Joint Transformer and Reinforcement Learning Framework for
  Dialog Policy Learning
JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning
Wai-Chung Kwan
Huimin Wang
Hongru Wang
Zezhong Wang
Xian Wu
Yefeng Zheng
Kam-Fai Wong
OffRL
236
1
0
01 Sep 2023
PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator
PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User SimulatorAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Chuyi Kong
Yaxin Fan
Xiang Wan
Feng Jiang
Benyou Wang
290
26
0
21 Aug 2023
Why Guided Dialog Policy Learning performs well? Understanding the role
  of adversarial learning and its alternative
Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Sho Shimoyama
Tetsuro Morimura
Kenshi Abe
Toda Takamichi
Yuta Tomomatsu
Masakazu Sugiyama
Asahi Hentona
Yuuki Azuma
Hirotaka Ninomiya
OffRL
178
1
0
13 Jul 2023
Rescue Conversations from Dead-ends: Efficient Exploration for
  Task-oriented Dialogue Policy Optimization
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy OptimizationTransactions of the Association for Computational Linguistics (TACL), 2023
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
230
3
0
05 May 2023
An Asynchronous Updating Reinforcement Learning Framework for
  Task-oriented Dialog System
An Asynchronous Updating Reinforcement Learning Framework for Task-oriented Dialog SystemIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Bengisu Cagiltay
Bilge Mutlu
Xiaojie Wang
Caixia Yuan
OffRL
133
0
0
04 May 2023
Reinforced Approximate Exploratory Data Analysis
Reinforced Approximate Exploratory Data AnalysisAAAI Conference on Artificial Intelligence (AAAI), 2022
Shaddy Garg
Subrata Mitra
Tong Yu
Yash Gadhia
A. Kashettiwar
236
6
0
12 Dec 2022
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog
  with Reinforced Keywords Learning
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Xiao Yu
Qingyang Wu
Kun Qian
Zhou Yu
OffRL
329
13
0
30 Nov 2022
Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with
  User Simulator
Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with User SimulatorConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Qinyuan Cheng
Linyang Li
Guofeng Quan
Feng Gao
Xiaofeng Mou
Xipeng Qiu
216
20
0
26 Oct 2022
A Generative User Simulator with GPT-based Architecture and Goal State
  Tracking for Reinforced Multi-Domain Dialog Systems
A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems
Hong Liu
Yucheng Cai
Zhijian Ou
Yi Huang
Junlan Feng
ELM
219
7
0
17 Oct 2022
Jointly Reinforced User Simulator and Task-oriented Dialog System with
  Simplified Generative Architecture
Jointly Reinforced User Simulator and Task-oriented Dialog System with Simplified Generative Architecture
Abhishek Sethi
Zhijian Ou
Yi Huang
Junlan Feng
RALM
145
1
0
13 Oct 2022
A Survey on Recent Advances and Challenges in Reinforcement Learning
  Methods for Task-Oriented Dialogue Policy Learning
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy LearningMachine Intelligence Research (MIR), 2022
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
305
62
0
28 Feb 2022
Reinforced Natural Language Interfaces via Entropy Decomposition
Reinforced Natural Language Interfaces via Entropy Decomposition
Xiaoran Wu
Yipeng Kang
LLMAG
215
0
0
23 Sep 2021
Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking
  Consistency for Task-oriented Dialogue System
Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue SystemConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Libo Qin
Tianbao Xie
Shijue Huang
Qiguang Chen
Xiao Xu
Wanxiang Che
253
21
0
23 Sep 2021
WeaSuL: Weakly Supervised Dialogue Policy Learning: Reward Estimation
  for Multi-turn Dialogue
WeaSuL: Weakly Supervised Dialogue Policy Learning: Reward Estimation for Multi-turn Dialogue
Anant Khandelwal
OffRL
359
6
0
01 Aug 2021
Transferable Dialogue Systems and User Simulators
Transferable Dialogue Systems and User SimulatorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Bo-Hsiang Tseng
Yinpei Dai
Florian Kreyssig
Bill Byrne
284
62
0
25 Jul 2021
High-Quality Diversification for Task-Oriented Dialogue Systems
High-Quality Diversification for Task-Oriented Dialogue SystemsFindings (Findings), 2021
Zhiwen Tang
Hrishikesh Kulkarni
Grace Hui Yang
263
11
0
02 Jun 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic
  Survey
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic SurveyArtificial Intelligence Review (AIR), 2021
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Xiaoshi Zhong
996
336
0
10 May 2021
CR-Walker: Tree-Structured Graph Reasoning and Dialog Acts for
  Conversational Recommendation
CR-Walker: Tree-Structured Graph Reasoning and Dialog Acts for Conversational RecommendationConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Wenchang Ma
Ryuichi Takanobu
Shiyu Huang
443
67
0
20 Oct 2020
Is Your Goal-Oriented Dialog Model Performing Really Well? Empirical
  Analysis of System-wise Evaluation
Is Your Goal-Oriented Dialog Model Performing Really Well? Empirical Analysis of System-wise Evaluation
Ryuichi Takanobu
Qi Zhu
Jinchao Li
Baolin Peng
Jianfeng Gao
Shiyu Huang
168
51
0
15 May 2020
Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness
Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness
Zheng Zhang
Lizi Liao
Xiaoyan Zhu
Tat-Seng Chua
Zitao Liu
Yan Huang
Shiyu Huang
LLMAG
310
25
0
21 Apr 2020
Recent Advances and Challenges in Task-oriented Dialog System
Recent Advances and Challenges in Task-oriented Dialog SystemScience China Technological Sciences (Sci China Technol Sci), 2020
Zheng Zhang
Ryuichi Takanobu
Qi Zhu
Shiyu Huang
Xiaoyan Zhu
LLMAG
636
201
0
17 Mar 2020
1
Page 1 of 1