Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images

Annual Meeting of the Association for Computational Linguistics (ACL), 2021

19 July 2021

ArXiv (abs)PDF HTML Github (22★)

Papers citing "Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images"

24 / 24 papers shown

Title
Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models Shojiro Yamabe Futa Waseda Daiki Shiono Tsubasa Takahashi DiffM MLLM VLM 202 0 0 03 Dec 2025
MDSEval: A Meta-Evaluation Benchmark for Multimodal Dialogue Summarization Yinhong Liu Jianfeng He Hang Su Ruixue Lian Yi Nian Jake W. Vincent Srikanth Vishnubhotla Robinson Piramuthu Saab Mansour 104 0 0 02 Oct 2025
F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model Hanbo Bi Zhiqiang Yuan Zexi Jia Jiapei Zhang Chongyang Li Peixiang Luo Ying Deng Xiaoyue Duan Jinchao Zhang VLM 181 0 0 25 Aug 2025
On the Effectiveness of Integration Methods for Multimodal Dialogue Response Retrieval Seongbo Jang Seonghyeon Lee Dongha Lee Hwanjo Yu 186 0 0 13 Jun 2025
Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic InteractionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Jihyoung Jang Minwook Bae Minji Kim Dilek Z. Hakkani-Tür Hyounghun Kim 183 1 0 31 May 2025
KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue CorpusNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 Xiaoming Shi Zeming Liu Chenkai Zhang Yiming Lei Haitao Leng ... Qingjie Liu Wanxiang Che Shaoguo Liu Size Li Yanjie Wang 440 1 0 10 Mar 2025
MuDoC: An Interactive Multimodal Document-grounded Conversational AI SystemAAAI Spring Symposia (ASS), 2025 Karan Taneja Ashok K. Goel 638 4 0 14 Feb 2025
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational AgentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 Wanqi Yang Yongqian Li Meng Fang Lawrence Yunliang Chen 271 1 0 09 Feb 2025
Ño' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue Rena Gao Xuetong Wu Siwen Luo Caren Han Feng Liu OODD 311 1 0 31 Oct 2024
An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation Peiming Guo Sinuo Liu Yanzhao Zhang Dingkun Long Pengjun Xie Meishan Zhang Hao Fei DiffM 322 1 0 16 Aug 2024
BI-MDRG: Bridging Image History in Multimodal Dialogue Response GenerationEuropean Conference on Computer Vision (ECCV), 2024 Hee Suk Yoon Eunseop Yoon Joshua Tian Jin Tee Kang Zhang Yu-Jung Heo Du-Seong Chang Chang D. Yoo 210 6 0 12 Aug 2024
Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge Young-Jun Lee Dokyong Lee Junyoung Youn Kyeongjin Oh ByungSoo Ko Jonghwan Hyeon Ho-Jin Choi 289 7 0 04 Jul 2024
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Hossein Aboutalebi Hwanjun Song Yusheng Xie Arshit Gupta Justin Sun Hang Su Igor Shalyminov Nikolaos Pappas Siffi Singh Saab Mansour DiffM EGVM 260 9 0 05 Mar 2024
Large Language Models can Share Images, Too!Annual Meeting of the Association for Computational Linguistics (ACL), 2023 Young-Jun Lee Dokyong Lee Joo Won Sung Jonghwan Hyeon Ho-Jin Choi MLLM 376 4 0 23 Oct 2023
Teaching Text-to-Image Models to Communicate in Dialog Xiaowen Sun Jiazhan Feng Yuxuan Wang Yuxuan Lai Xingyu Shen Dongyan Zhao DiffM 150 1 0 27 Sep 2023
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models Yupan Huang Zaiqiao Meng Fangyu Liu Yixuan Su Nigel Collier Yutong Lu MLLM 154 32 0 31 Aug 2023
MPCHAT: Towards Multimodal Persona-Grounded ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Jaewoo Ahn Yeda Song Sangdoo Yun Gunhee Kim 166 26 0 27 May 2023
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional ExpertsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Yunshui Li Binyuan Hui Zhichao Yin Min Yang Fei Huang Yongbin Li MoE 181 23 0 24 May 2023
IMAD: IMage-Augmented multi-modal DialogueJournal of Mathematical Sciences (J. Math. Sci.), 2023 Viktor Moskvoretskii Anton Frolov Denis Kuznetsov 157 6 0 17 May 2023
Retrieving Multimodal Information for Augmented Generation: A SurveyConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Ruochen Zhao Hailin Chen Weishi Wang Fangkai Jiao Do Xuan Long ... Bosheng Ding Xiaobao Guo Minzhi Li Xingxuan Li Shafiq Joty 386 125 0 20 Mar 2023
TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real WorldACM Multimedia (ACM MM), 2023 Hongpeng Lin Ludan Ruan Wenke Xia Peiyu Liu Jing Wen ... Di Hu Ruihua Song Wayne Xin Zhao Qin Jin Zhiwu Lu VGen 193 13 0 14 Jan 2023
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022 Young-Jun Lee ByungSoo Ko Han-Gyu Kim Jonghwan Hyeon Ho-Jin Choi 288 12 0 08 Dec 2022
MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Jiazhan Feng Qingfeng Sun Can Xu Lu Wang Yaming Yang Chongyang Tao Dongyan Zhao Qingwei Lin 235 67 0 10 Nov 2022
BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue DatasetsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Minju Kim Chaehyeong Kim Yongho Song Seung-won Hwang Jinyoung Yeo 206 18 0 23 Oct 2022