Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images

Annual Meeting of the Association for Computational Linguistics (ACL), 2021

19 July 2021

ArXiv (abs)PDF HTML Github (22★)

Papers citing "Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images"

24 / 24 papers shown

Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models

226

03 Dec 2025

MDSEval: A Meta-Evaluation Benchmark for Multimodal Dialogue Summarization

Srikanth Vishnubhotla

Robinson Piramuthu

Saab Mansour

104

02 Oct 2025

F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model

185

25 Aug 2025

On the Effectiveness of Integration Methods for Multimodal Dialogue Response Retrieval

190

13 Jun 2025

Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic InteractionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

191

31 May 2025

KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue CorpusNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

...

468

10 Mar 2025

MuDoC: An Interactive Multimodal Document-grounded Conversational AI SystemAAAI Spring Symposia (ASS), 2025

Karan Taneja

Ashok K. Goel

662

14 Feb 2025

MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational AgentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Wanqi Yang

Yongqian Li

Meng Fang

Lawrence Yunliang Chen

279

09 Feb 2025

Ño' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue

335

31 Oct 2024

An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation

Meishan Zhang

330

16 Aug 2024

BI-MDRG: Bridging Image History in Multimodal Dialogue Response GenerationEuropean Conference on Computer Vision (ECCV), 2024

Yu-Jung Heo

Chang D. Yoo

222

12 Aug 2024

Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

293

04 Jul 2024

MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets

280

05 Mar 2024

Large Language Models can Share Images, Too!Annual Meeting of the Association for Computational Linguistics (ACL), 2023

384

23 Oct 2023

Teaching Text-to-Image Models to Communicate in Dialog

Yuxuan Wang

Dongyan Zhao

158

27 Sep 2023

Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models

182

31 Aug 2023

MPCHAT: Towards Multimodal Persona-Grounded ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

174

27 May 2023

PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional ExpertsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Yunshui Li

Binyuan Hui

Zhichao Yin

Min Yang

Fei Huang

Yongbin Li

MoE

197

24 May 2023

IMAD: IMage-Augmented multi-modal DialogueJournal of Mathematical Sciences (J. Math. Sci.), 2023

Viktor Moskvoretskii

Anton Frolov

Denis Kuznetsov

169

17 May 2023

Retrieving Multimodal Information for Augmented Generation: A SurveyConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Hailin Chen

...

394

125

20 Mar 2023

TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real WorldACM Multimedia (ACM MM), 2023

...

Qin Jin

201

14 Jan 2023

DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

296

08 Dec 2022

MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Dongyan Zhao

251

10 Nov 2022

BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue DatasetsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Jinyoung Yeo

210

23 Oct 2022