ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.10735
  4. Cited By
Building Persona Consistent Dialogue Agents with Offline Reinforcement
  Learning

Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning

16 October 2023
Ryan Shea
Zhou Yu
    OffRL
ArXivPDFHTML

Papers citing "Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning"

8 / 8 papers shown
Title
Enhancing Persona Consistency for LLMs' Role-Playing using Persona-Aware Contrastive Learning
Enhancing Persona Consistency for LLMs' Role-Playing using Persona-Aware Contrastive Learning
Ke Ji
Yixin Lian
Linxu Li
Jingsheng Gao
Weiyuan Li
Bin Dai
36
0
0
22 Mar 2025
From Individual to Society: A Survey on Social Simulation Driven by
  Large Language Model-based Agents
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents
Xinyi Mou
Xuanwen Ding
Qi He
Liang Wang
Jingcong Liang
...
L. Sun
Jiayu Lin
Jie Zhou
Xuanjing Huang
Zhongyu Wei
LLMAG
LM&Ro
AI4CE
86
13
0
04 Dec 2024
The Oscars of AI Theater: A Survey on Role-Playing with Language Models
The Oscars of AI Theater: A Survey on Role-Playing with Language Models
Nuo Chen
Yan Wang
Yang Deng
Jia Li
30
15
0
16 Jul 2024
From Persona to Personalization: A Survey on Role-Playing Language
  Agents
From Persona to Personalization: A Survey on Role-Playing Language Agents
Jiangjie Chen
Xintao Wang
Rui Xu
Siyu Yuan
Yikai Zhang
...
Caiyu Hu
Siye Wu
Scott Ren
Ziquan Fu
Yanghua Xiao
50
76
0
28 Apr 2024
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Shaoteng Liu
Haoqi Yuan
Minda Hu
Yanwei Li
Yukang Chen
Shu Liu
Zongqing Lu
Jiaya Jia
LLMAG
40
14
0
29 Feb 2024
Offline RL for Natural Language Generation with Implicit Language Q
  Learning
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
123
101
0
05 Jun 2022
Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion
  Dialogues via Reinforcement Learning and Human Demonstration
Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration
Weiyan Shi
Yu Li
Saurav Sahay
Zhou Yu
OffRL
112
26
0
31 Dec 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
334
1,951
0
04 May 2020
1