ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.00286
  4. Cited By
Reinforcement Learning for Personalized Dialogue Management

Reinforcement Learning for Personalized Dialogue Management

International Conference on Wirtschaftsinformatik (WI), 2019
1 August 2019
Floris den Hengst
Mark Hoogendoorn
F. V. Harmelen
Joost Bosman
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Reinforcement Learning for Personalized Dialogue Management"

5 / 5 papers shown
Crafting Customisable Characters with LLMs: Introducing SimsChat, a Persona-Driven Role-Playing Agent Framework
Crafting Customisable Characters with LLMs: Introducing SimsChat, a Persona-Driven Role-Playing Agent Framework
Bohao Yang
Dong Liu
Chen Tang
Chenghao Xiao
Kun Zhao
Chao Li
Lin Yuan
Guang Yang
Lanxiao Huang
Chenghua Lin
306
0
0
25 Jun 2024
Conformal Intent Classification and Clarification for Fast and Accurate
  Intent Recognition
Conformal Intent Classification and Clarification for Fast and Accurate Intent Recognition
Floris den Hengst
Ralf Wolter
Patrick Altmeyer
Arda Kaygan
369
4
0
27 Mar 2024
CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent
  Evaluation
CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Quan Tu
Shilong Fan
Zihang Tian
Rui Yan
491
121
0
02 Jan 2024
The Past, Present and Better Future of Feedback Learning in Large
  Language Models for Subjective Human Preferences and Values
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and ValuesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hannah Rose Kirk
Andrew M. Bean
Bertie Vidgen
Paul Röttger
Scott A. Hale
ALM
398
65
0
11 Oct 2023
Personalisation within bounds: A risk taxonomy and policy framework for
  the alignment of large language models with personalised feedback
Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback
Hannah Rose Kirk
Bertie Vidgen
Paul Röttger
Scott A. Hale
343
125
0
09 Mar 2023
1
Page 1 of 1