DSTC8-AVSD: Multimodal Semantic Transformer Network with Retrieval Style Word Generator

1 April 2020

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "DSTC8-AVSD: Multimodal Semantic Transformer Network with Retrieval Style Word Generator"

11 / 11 papers shown

HEAR: Hearing Enhanced Audio Response for Video-grounded DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

475

15 Dec 2023

Uncovering Hidden Connections: Iterative Search and Reasoning for Video-grounded Dialog

Haoyu Zhang

447

11 Oct 2023

Information-Theoretic Text Hallucination Reduction for Video-grounded DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

213

12 Dec 2022

End-to-End Multimodal Representation Learning for Video Dialog

251

26 Oct 2022

Video Dialog as Conversation about Objects Living in Space-TimeEuropean Conference on Computer Vision (ECCV), 2022

261

08 Jul 2022

C^3

: Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues

Hung Le

Nancy F. Chen

Guosheng Lin

189

16 Jun 2021

VGNMN: Video-grounded Neural Module Network to Video-Grounded Language TasksNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Hung Le

Nancy F. Chen

Guosheng Lin

MLLM

334

16 Apr 2021

Structured Co-reference Graph Attention for Video-grounded DialogueAAAI Conference on Artificial Intelligence (AAAI), 2021

239

24 Mar 2021

Learning Reasoning Paths over Semantic Graphs for Video-grounded DialoguesInternational Conference on Learning Representations (ICLR), 2021

Hung Le

Nancy F. Chen

Guosheng Lin

261

01 Mar 2021

Look Before you Speak: Visually Contextualized UtterancesComputer Vision and Pattern Recognition (CVPR), 2020

Paul Hongsuck Seo

Arsha Nagrani

Cordelia Schmid

393

10 Dec 2020

TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware DialogInterspeech (Interspeech), 2020

Wubo Li

Dongwei Jiang

Wei Zou

Xiangang Li

210

21 Oct 2020