ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.06354
  4. Cited By
Modality-Balanced Models for Visual Dialogue

Modality-Balanced Models for Visual Dialogue

AAAI Conference on Artificial Intelligence (AAAI), 2020
17 January 2020
Hyounghun Kim
Hao Tan
Joey Tianyi Zhou
ArXiv (abs)PDFHTML

Papers citing "Modality-Balanced Models for Visual Dialogue"

9 / 9 papers shown
Unified Multimodal Model with Unlikelihood Training for Visual Dialog
Unified Multimodal Model with Unlikelihood Training for Visual DialogACM Multimedia (ACM MM), 2022
Zihao Wang
Junli Wang
Changjun Jiang
MLLM
231
13
0
23 Nov 2022
Multimodal Dialogue State Tracking
Multimodal Dialogue State TrackingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Hung Le
Nancy F. Chen
Guosheng Lin
196
10
0
16 Jun 2022
VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution
VD-PCR: Improving Visual Dialog with Pronoun Coreference ResolutionPattern Recognition (Pattern Recogn.), 2022
Xintong Yu
Hongming Zhang
Ruixin Hong
Yangqiu Song
Changshui Zhang
248
17
0
29 May 2022
Modality-Balanced Embedding for Video Retrieval
Modality-Balanced Embedding for Video RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Xun Wang
Bingqing Ke
Xuanping Li
Fangyu Liu
Mingyu Zhang
Xiao Liang
Qi-En Xiao
Cheng Luo
Yue Yu
204
12
0
18 Apr 2022
Echo-Reconstruction: Audio-Augmented 3D Scene Reconstruction
Echo-Reconstruction: Audio-Augmented 3D Scene Reconstruction
Justin Wilson
Nicholas Rewkowski
Ming Lin
Henry Fuchs
185
1
0
05 Oct 2021
VD-BERT: A Unified Vision and Dialog Transformer with BERT
VD-BERT: A Unified Vision and Dialog Transformer with BERTConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Yue Wang
Shafiq Joty
Michael R. Lyu
Irwin King
Caiming Xiong
Guosheng Lin
471
110
0
28 Apr 2020
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge
  Transfer
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge TransferConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Gi-Cheon Kang
Junseok Park
Hwaran Lee
Byoung-Tak Zhang
Jin-Hwa Kim
VLM
290
10
0
14 Apr 2020
Guessing State Tracking for Visual Dialogue
Guessing State Tracking for Visual DialogueEuropean Conference on Computer Vision (ECCV), 2020
Wei Pang
Xiaojie Wang
OOD
498
10
0
24 Feb 2020
Efficient Attention Mechanism for Visual Dialog that can Handle All the
  Interactions between Multiple Inputs
Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
384
7
0
26 Nov 2019
1
Page 1 of 1