ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.08685
  4. Cited By
Constructing Multi-Modal Dialogue Dataset by Replacing Text with
  Semantically Relevant Images

Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images

Annual Meeting of the Association for Computational Linguistics (ACL), 2021
19 July 2021
Nyoungwoo Lee
Suwon Shin
Jaegul Choo
Ho‐Jin Choi
S. Myaeng
ArXiv (abs)PDFHTMLGithub (22★)

Papers citing "Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images"

24 / 24 papers shown
Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
Shojiro Yamabe
Futa Waseda
Daiki Shiono
Tsubasa Takahashi
DiffMMLLMVLM
226
0
0
03 Dec 2025
MDSEval: A Meta-Evaluation Benchmark for Multimodal Dialogue Summarization
MDSEval: A Meta-Evaluation Benchmark for Multimodal Dialogue Summarization
Yinhong Liu
Jianfeng He
Hang Su
Ruixue Lian
Yi Nian
Jake W. Vincent
Srikanth Vishnubhotla
Robinson Piramuthu
Saab Mansour
104
0
0
02 Oct 2025
F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model
F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model
Hanbo Bi
Zhiqiang Yuan
Zexi Jia
Jiapei Zhang
Chongyang Li
Peixiang Luo
Ying Deng
Xiaoyue Duan
Jinchao Zhang
VLM
185
0
0
25 Aug 2025
On the Effectiveness of Integration Methods for Multimodal Dialogue Response Retrieval
On the Effectiveness of Integration Methods for Multimodal Dialogue Response Retrieval
Seongbo Jang
Seonghyeon Lee
Dongha Lee
Hwanjo Yu
190
0
0
13 Jun 2025
Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic Interactions
Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic InteractionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jihyoung Jang
Minwook Bae
Minji Kim
Dilek Z. Hakkani-Tür
Hyounghun Kim
191
1
0
31 May 2025
KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus
KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue CorpusNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Xiaoming Shi
Zeming Liu
Chenkai Zhang
Yiming Lei
Haitao Leng
...
Qingjie Liu
Wanxiang Che
Shaoguo Liu
Size Li
Yanjie Wang
468
1
0
10 Mar 2025
MuDoC: An Interactive Multimodal Document-grounded Conversational AI System
MuDoC: An Interactive Multimodal Document-grounded Conversational AI SystemAAAI Spring Symposia (ASS), 2025
Karan Taneja
Ashok K. Goel
662
4
0
14 Feb 2025
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational AgentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Wanqi Yang
Yongqian Li
Meng Fang
Lawrence Yunliang Chen
279
1
0
09 Feb 2025
Ño' Matters: Out-of-Distribution Detection in Multimodality Long
  Dialogue
Ño' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue
Rena Gao
Xuetong Wu
Siwen Luo
Caren Han
Feng Liu
OODD
335
1
0
31 Oct 2024
An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation
An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation
Peiming Guo
Sinuo Liu
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
Hao Fei
DiffM
330
1
0
16 Aug 2024
BI-MDRG: Bridging Image History in Multimodal Dialogue Response
  Generation
BI-MDRG: Bridging Image History in Multimodal Dialogue Response GenerationEuropean Conference on Computer Vision (ECCV), 2024
Hee Suk Yoon
Eunseop Yoon
Joshua Tian Jin Tee
Kang Zhang
Yu-Jung Heo
Du-Seong Chang
Chang D. Yoo
222
7
0
12 Aug 2024
Stark: Social Long-Term Multi-Modal Conversation with Persona
  Commonsense Knowledge
Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge
Young-Jun Lee
Dokyong Lee
Junyoung Youn
Kyeongjin Oh
ByungSoo Ko
Jonghwan Hyeon
Ho-Jin Choi
293
7
0
04 Jul 2024
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal
  Datasets
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Hossein Aboutalebi
Hwanjun Song
Yusheng Xie
Arshit Gupta
Justin Sun
Hang Su
Igor Shalyminov
Nikolaos Pappas
Siffi Singh
Saab Mansour
DiffMEGVM
280
9
0
05 Mar 2024
Large Language Models can Share Images, Too!
Large Language Models can Share Images, Too!Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Young-Jun Lee
Dokyong Lee
Joo Won Sung
Jonghwan Hyeon
Ho-Jin Choi
MLLM
384
4
0
23 Oct 2023
Teaching Text-to-Image Models to Communicate in Dialog
Teaching Text-to-Image Models to Communicate in Dialog
Xiaowen Sun
Jiazhan Feng
Yuxuan Wang
Yuxuan Lai
Xingyu Shen
Dongyan Zhao
DiffM
158
1
0
27 Sep 2023
Sparkles: Unlocking Chats Across Multiple Images for Multimodal
  Instruction-Following Models
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models
Yupan Huang
Zaiqiao Meng
Fangyu Liu
Yixuan Su
Nigel Collier
Yutong Lu
MLLM
182
32
0
31 Aug 2023
MPCHAT: Towards Multimodal Persona-Grounded Conversation
MPCHAT: Towards Multimodal Persona-Grounded ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jaewoo Ahn
Yeda Song
Sangdoo Yun
Gunhee Kim
174
26
0
27 May 2023
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and
  Compositional Experts
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional ExpertsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yunshui Li
Binyuan Hui
Zhichao Yin
Min Yang
Fei Huang
Yongbin Li
MoE
197
23
0
24 May 2023
IMAD: IMage-Augmented multi-modal Dialogue
IMAD: IMage-Augmented multi-modal DialogueJournal of Mathematical Sciences (J. Math. Sci.), 2023
Viktor Moskvoretskii
Anton Frolov
Denis Kuznetsov
169
6
0
17 May 2023
Retrieving Multimodal Information for Augmented Generation: A Survey
Retrieving Multimodal Information for Augmented Generation: A SurveyConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ruochen Zhao
Hailin Chen
Weishi Wang
Fangkai Jiao
Do Xuan Long
...
Bosheng Ding
Xiaobao Guo
Minzhi Li
Xingxuan Li
Shafiq Joty
394
125
0
20 Mar 2023
TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real
  World
TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real WorldACM Multimedia (ACM MM), 2023
Hongpeng Lin
Ludan Ruan
Wenke Xia
Peiyu Liu
Jing Wen
...
Di Hu
Ruihua Song
Wayne Xin Zhao
Qin Jin
Zhiwu Lu
VGen
201
13
0
14 Jan 2023
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal
  Dialogue Dataset
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Young-Jun Lee
ByungSoo Ko
Han-Gyu Kim
Jonghwan Hyeon
Ho-Jin Choi
296
12
0
08 Dec 2022
MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal
  Open-domain Conversation
MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jiazhan Feng
Qingfeng Sun
Can Xu
Lu Wang
Yaming Yang
Chongyang Tao
Dongyan Zhao
Qingwei Lin
251
67
0
10 Nov 2022
BotsTalk: Machine-sourced Framework for Automatic Curation of
  Large-scale Multi-skill Dialogue Datasets
BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue DatasetsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Minju Kim
Chaehyeong Kim
Yongho Song
Seung-won Hwang
Jinyoung Yeo
210
18
0
23 Oct 2022
1