Adaptive Dialog Policy Learning with Hindsight and User Modeling

7 May 2020

Papers citing "Adaptive Dialog Policy Learning with Hindsight and User Modeling"

6 / 6 papers shown

Title
Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative Sho Shimoyama Tetsuro Morimura Kenshi Abe Toda Takamichi Yuta Tomomatsu Masakazu Sugiyama Asahi Hentona Yuuki Azuma Hirotaka Ninomiya OffRL 26 0 0 13 Jul 2023
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning Xiao Yu Maximillian Chen Zhou Yu LLMAG LM&Ro 32 35 0 23 May 2023
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization Yangyang Zhao Zhenyu Wang Mehdi Dastani Shihan Wang 24 0 0 05 May 2023
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning Wai-Chung Kwan Hongru Wang Huimin Wang Kam-Fai Wong OffRL 38 43 0 28 Feb 2022
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems E. Razumovskaia Goran Glavavs Olga Majewska Edoardo Ponti Anna Korhonen Ivan Vulić 33 32 0 17 Apr 2021
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems Layla El Asri Jing He Kaheer Suleman 57 117 0 30 Jun 2016