BI-MDRG: Bridging Image History in Multimodal Dialogue Response
Generation

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation

12 August 2024

Joshua Tian Jin Tee

Yu-Jung Heo

Chang D. Yoo

Papers citing "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"

5 / 5 papers shown

Title
TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation Sunjae Yoon Gwanhyeong Koo Younghwan Lee Chang-Dong Yoo VGen 59 3 0 31 Oct 2024
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval Zijia Zhao Longteng Guo Tongtian Yue Erdong Hu Shuai Shao Zehuan Yuan Hua Huang J. Liu 16 1 0 24 Oct 2024
MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation T. Pham Tri Ton Chang D. Yoo 36 3 0 03 Oct 2024
HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue Sunjae Yoon Dahyun Kim Eunseop Yoon Hee Suk Yoon Junyeong Kim C. Yoo 29 6 0 15 Dec 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models Junnan Li Dongxu Li Silvio Savarese Steven C. H. Hoi VLM MLLM 244 4,186 0 30 Jan 2023