ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.11995
  4. Cited By
Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations

Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations

23 August 2023
Karthik Gopalakrishnan
Behnam Hedayatnia
Qinlang Chen
Anna Gottardi
Sanjeev Kwatra
Anu Venkatesh
Raefer Gabriel
Dilek Z. Hakkani-Tür
    AI4MH
    BDL
ArXivPDFHTML

Papers citing "Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations"

50 / 203 papers shown
Title
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts
Hanhua Hong
Chenghao Xiao
Yang Wang
Y. Liu
Wenge Rong
Chenghua Lin
26
0
0
29 Apr 2025
Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection
Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection
Atharva Kulkarni
Yuan-kang Zhang
Joel Ruben Antony Moniz
Xiou Ge
Bo-Hsiang Tseng
Dhivya Piraviperumal
S.
Hong-ye Yu
HILM
76
0
0
25 Apr 2025
A Scalable Framework for Evaluating Health Language Models
A Scalable Framework for Evaluating Health Language Models
Neil Mallinar
A. Heydari
Xin Liu
Anthony Z. Faranesh
Brent Winslow
...
Mark Malhotra
Shwetak N. Patel
Javier L. Prieto
Daniel J. McDuff
Ahmed A. Metwally
LM&MA
56
2
0
30 Mar 2025
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
Ivan Kartáč
Mateusz Lango
Ondrej Dusek
ELM
49
1
0
14 Mar 2025
Grammar Control in Dialogue Response Generation for Language Learning Chatbots
Grammar Control in Dialogue Response Generation for Language Learning Chatbots
Dominik Glandorf
Peng Cui
Detmar Meurers
Mrinmaya Sachan
KELM
58
1
0
11 Feb 2025
Measuring the Robustness of Reference-Free Dialogue Evaluation Systems
Measuring the Robustness of Reference-Free Dialogue Evaluation Systems
Justin Vasselli
Adam Nohejl
Taro Watanabe
AAML
44
0
0
12 Jan 2025
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic
  Survey
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
40
16
0
14 Nov 2024
Policy-driven Knowledge Selection and Response Generation for
  Document-grounded Dialogue
Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue
Longxuan Ma
Jiapeng Li
Mingda Li
W. Zhang
Ting Liu
33
1
0
21 Oct 2024
Multi-Facet Counterfactual Learning for Content Quality Evaluation
Multi-Facet Counterfactual Learning for Content Quality Evaluation
Jiasheng Zheng
Hongyu Lin
Boxi Cao
M. Liao
Y. Lu
Xianpei Han
Le Sun
28
0
0
10 Oct 2024
From Pixels to Personas: Investigating and Modeling
  Self-Anthropomorphism in Human-Robot Dialogues
From Pixels to Personas: Investigating and Modeling Self-Anthropomorphism in Human-Robot Dialogues
Yu Li
Devamanyu Hazarika
Di Jin
Julia Hirschberg
Yang Liu
26
0
0
04 Oct 2024
Pre-trained Language Models Return Distinguishable Probability
  Distributions to Unfaithfully Hallucinated Texts
Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts
Taehun Cha
Donghun Lee
HILM
29
1
0
25 Sep 2024
Large Language Models for Automatic Detection of Sensitive Topics
Large Language Models for Automatic Detection of Sensitive Topics
Ruoyu Wen
Stephanie Elena Crowe
Kunal Gupta
Xinyue Li
Mark Billinghurst
S. Hoermann
Dwain Allan
Alaeddin Nassani
Thammathip Piumsomboon
AI4MH
29
0
0
02 Sep 2024
Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs
Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs
John Mendonça
Isabel Trancoso
A. Lavie
ALM
29
1
0
20 Aug 2024
Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation
  Instructions
Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions
Bhuvanashree Murugadoss
Christian Poelitz
Ian Drosos
Vu Le
Nick McKenna
Carina Negreanu
Chris Parnin
Advait Sarkar
ELM
ALM
35
13
0
16 Aug 2024
Enhancing Hallucination Detection through Perturbation-Based Synthetic
  Data Generation in System Responses
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses
Dongxu Zhang
Varun Gangal
B. Lattimer
Yi Yang
35
6
0
07 Jul 2024
LLM Roleplay: Simulating Human-Chatbot Interaction
LLM Roleplay: Simulating Human-Chatbot Interaction
Hovhannes Tamoyan
Hendrik Schuff
Iryna Gurevych
44
8
0
04 Jul 2024
On the Benchmarking of LLMs for Open-Domain Dialogue Evaluation
On the Benchmarking of LLMs for Open-Domain Dialogue Evaluation
John Mendonça
A. Lavie
Isabel Trancoso
ELM
43
2
0
04 Jul 2024
LLMs instead of Human Judges? A Large Scale Empirical Study across 20
  NLP Evaluation Tasks
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
A. Bavaresco
Raffaella Bernardi
Leonardo Bertolazzi
Desmond Elliott
Raquel Fernández
...
David Schlangen
Alessandro Suglia
Aditya K Surikuchi
Ece Takmaz
A. Testoni
ALM
ELM
46
62
0
26 Jun 2024
Themis: Towards Flexible and Interpretable NLG Evaluation
Themis: Towards Flexible and Interpretable NLG Evaluation
Xinyu Hu
Li Lin
Mingqi Gao
Xunjian Yin
Xiaojun Wan
ELM
34
6
0
26 Jun 2024
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and
  Metrics for Open Domain Question Answering in the Era of Large Language
  Models
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models
Akchay Srivastava
Atif Memon
ELM
40
1
0
19 Jun 2024
ComperDial: Commonsense Persona-grounded Dialogue Dataset and Benchmark
ComperDial: Commonsense Persona-grounded Dialogue Dataset and Benchmark
Hiromi Wakaki
Yuki Mitsufuji
Yoshinori Maeda
Yukiko Nishimura
Silin Gao
Mengjie Zhao
Keiichi Yamada
Antoine Bosselut
37
0
0
17 Jun 2024
Detecting Response Generation Not Requiring Factual Judgment
Detecting Response Generation Not Requiring Factual Judgment
Ryohei Kamei
Daiki Shiono
Reina Akama
Jun Suzuki
HILM
32
0
0
14 Jun 2024
Designing a Dashboard for Transparency and Control of Conversational AI
Designing a Dashboard for Transparency and Control of Conversational AI
Yida Chen
Aoyu Wu
Trevor DePodesta
Catherine Yeh
Kenneth Li
...
Jan Riecke
Shivam Raval
Olivia Seow
Martin Wattenberg
Fernanda Viégas
44
16
0
12 Jun 2024
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation
Se Jin Park
Chae Won Kim
Hyeongseop Rha
Minsu Kim
Joanna Hong
Jeong Hun Yeo
Yong Man Ro
CVBM
AuLLM
40
6
0
12 Jun 2024
Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt
  LLMs for Dialogue
Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for Dialogue
Simone Alghisi
Massimo Rizzoli
Gabriel Roccabruna
Seyed Mahed Mousavi
Giuseppe Riccardi
OffRL
36
8
0
10 Jun 2024
SLIDE: A Framework Integrating Small and Large Language Models for
  Open-Domain Dialogues Evaluation
SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation
Kun Zhao
Bohao Yang
Chen Tang
Chenghua Lin
Liang Zhan
41
5
0
24 May 2024
CHARP: Conversation History AwaReness Probing for Knowledge-grounded
  Dialogue Systems
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
Abbas Ghaddar
David Alfonso-Hermelo
Philippe Langlais
Mehdi Rezagholizadeh
Boxing Chen
Prasanna Parthasarathi
34
0
0
24 May 2024
Efficient Data Generation for Source-grounded Information-seeking
  Dialogs: A Use Case for Meeting Transcripts
Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts
Lotem Golany
Filippo Galgani
Maya Mamo
Nimrod Parasol
Omer Vandsburger
Nadav Bar
Ido Dagan
27
2
0
02 May 2024
Can We Catch the Elephant? A Survey of the Evolvement of Hallucination
  Evaluation on Natural Language Generation
Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation
Siya Qi
Yulan He
Zheng Yuan
LRM
HILM
38
1
0
18 Apr 2024
A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded
  Dialogue Generation
A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation
Jifan Yu
Xiaohan Zhang
Yifan Xu
Xuanyu Lei
Zijun Yao
Jing Zhang
Lei Hou
Juanzi Li
HILM
28
1
0
04 Apr 2024
PairEval: Open-domain Dialogue Evaluation with Pairwise Comparison
PairEval: Open-domain Dialogue Evaluation with Pairwise Comparison
chaeHun Park
Minseok Choi
Dohyun Lee
Jaegul Choo
35
5
0
01 Apr 2024
CheckEval: A reliable LLM-as-a-Judge framework for evaluating text generation using checklists
CheckEval: A reliable LLM-as-a-Judge framework for evaluating text generation using checklists
Yukyung Lee
Joonghoon Kim
Jaehee Kim
Hyowon Cho
Pilsung Kang
Pilsung Kang
Najoung Kim
ELM
47
4
0
27 Mar 2024
A Large Collection of Model-generated Contradictory Responses for
  Consistency-aware Dialogue Systems
A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems
Shiki Sato
Reina Akama
Jun Suzuki
Kentaro Inui
15
0
0
19 Mar 2024
StreamingDialogue: Prolonged Dialogue Learning via Long Context
  Compression with Minimal Losses
StreamingDialogue: Prolonged Dialogue Learning via Long Context Compression with Minimal Losses
Jia-Nan Li
Quan Tu
Cunli Mao
Zhengtao Yu
Ji-Rong Wen
Rui Yan
OffRL
24
3
0
13 Mar 2024
PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification,
  Retrieval, and Synthesis in Question Answering
PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
Yiming Du
Hongru Wang
Zhengyi Zhao
Bin Liang
Baojun Wang
Wanjun Zhong
Zezhong Wang
Kam-Fai Wong
RALM
37
7
0
26 Feb 2024
HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical
  Criteria Decomposition
HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition
Yuxuan Liu
Tianchi Yang
Shaohan Huang
Zihan Zhang
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
26
12
0
24 Feb 2024
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on
  Zero-shot LLM Assessment
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment
Vyas Raina
Adian Liusie
Mark J. F. Gales
AAML
ELM
24
52
0
21 Feb 2024
Investigating Content Planning for Navigating Trade-offs in
  Knowledge-Grounded Dialogue
Investigating Content Planning for Navigating Trade-offs in Knowledge-Grounded Dialogue
Kushal Chawla
Hannah Rashkin
Gaurav Singh Tomar
David Reitter
26
1
0
03 Feb 2024
Making a Long Story Short in Conversation Modeling
Making a Long Story Short in Conversation Modeling
Yufei Tao
Tiernan Mines
Ameeta Agrawal
22
0
0
31 Jan 2024
Leveraging Large Language Models for NLG Evaluation: Advances and
  Challenges
Leveraging Large Language Models for NLG Evaluation: Advances and Challenges
Zhen Li
Xiaohan Xu
Tao Shen
Can Xu
Jia-Chen Gu
Yuxuan Lai
Chongyang Tao
Shuai Ma
LM&MA
ELM
34
9
0
13 Jan 2024
$\textit{Dial BeInfo for Faithfulness}$: Improving Factuality of
  Information-Seeking Dialogue via Behavioural Fine-Tuning
Dial BeInfo for Faithfulness\textit{Dial BeInfo for Faithfulness}Dial BeInfo for Faithfulness: Improving Factuality of Information-Seeking Dialogue via Behavioural Fine-Tuning
E. Razumovskaia
Ivan Vulić
Pavle Marković
Tomasz Cichy
Qian Zheng
Tsung-Hsien Wen
Paweł Budzianowski
HILM
32
10
0
16 Nov 2023
X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented
  Instruction Tuning with Auxiliary Evaluation Aspects
X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects
Minqian Liu
Ying Shen
Zhiyang Xu
Yixin Cao
Eunah Cho
Vaibhav Kumar
Reza Ghanadan
Lifu Huang
ELM
LM&MA
ALM
44
25
0
15 Nov 2023
Multi-User MultiWOZ: Task-Oriented Dialogues among Multiple Users
Multi-User MultiWOZ: Task-Oriented Dialogues among Multiple Users
Yohan Jo
Xinyan Zhao
Arijit Biswas
Nikoletta Basiou
Vincent Auvray
Nikolaos Malandrakis
A. Metallinou
Alexandros Potamianos
24
2
0
31 Oct 2023
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded
  Dialogue Generation
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation
Yixin Wan
Fanyou Wu
Weijie Xu
Srinivasan H. Sengamedu
HILM
24
5
0
28 Oct 2023
Topic Segmentation of Semi-Structured and Unstructured Conversational
  Datasets using Language Models
Topic Segmentation of Semi-Structured and Unstructured Conversational Datasets using Language Models
Reshmi Ghosh
Harjeet Singh Kajal
Sharanya Kamath
Dhuri Shrivastava
Samyadeep Basu
Hansi Zeng
Soundararajan Srinivasan
10
0
0
26 Oct 2023
DiQAD: A Benchmark Dataset for End-to-End Open-domain Dialogue
  Assessment
DiQAD: A Benchmark Dataset for End-to-End Open-domain Dialogue Assessment
Yukun Zhao
Lingyong Yan
Weiwei Sun
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
Dawei Yin
ELM
14
0
0
25 Oct 2023
Alquist 5.0: Dialogue Trees Meet Generative Models. A Novel Approach for
  Enhancing SocialBot Conversations
Alquist 5.0: Dialogue Trees Meet Generative Models. A Novel Approach for Enhancing SocialBot Conversations
Ondrej Kobza
Jan Cuhel
Tommaso Gargiani
David Herel
Petr Marek
16
3
0
24 Oct 2023
Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenario
  Multi-Domain Dialogue Summarization
Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenario Multi-Domain Dialogue Summarization
Weixiao Zhou
Gengyao Li
Xianfu Cheng
Xinnian Liang
Junnan Zhu
Feifei Zhai
Zhoujun Li
16
5
0
16 Oct 2023
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems
  via Knowledge Enhancement and Alignment
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
Boyang Xue
Weichao Wang
Hongru Wang
Fei Mi
Rui Wang
Yasheng Wang
Lifeng Shang
Xin Jiang
Qun Liu
Kam-Fai Wong
KELM
HILM
211
15
0
12 Oct 2023
A Closer Look into Automatic Evaluation Using Large Language Models
A Closer Look into Automatic Evaluation Using Large Language Models
Cheng-Han Chiang
Hunghuei Lee
ELM
ALM
LM&MA
25
13
0
09 Oct 2023
12345
Next