HEAR: Hearing Enhanced Audio Response for Video-grounded DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
Information-Theoretic Text Hallucination Reduction for Video-grounded
DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
Video Dialog as Conversation about Objects Living in Space-TimeEuropean Conference on Computer Vision (ECCV), 2022 |
VGNMN: Video-grounded Neural Module Network to Video-Grounded Language
TasksNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021 |
Structured Co-reference Graph Attention for Video-grounded DialogueAAAI Conference on Artificial Intelligence (AAAI), 2021 |
Learning Reasoning Paths over Semantic Graphs for Video-grounded
DialoguesInternational Conference on Learning Representations (ICLR), 2021 |
Look Before you Speak: Visually Contextualized UtterancesComputer Vision and Pattern Recognition (CVPR), 2020 |