HEAR: Hearing Enhanced Audio Response for Video-grounded DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded MemoryIEEE International Conference on Computer Vision (ICCV), 2023 |
Information-Theoretic Text Hallucination Reduction for Video-grounded
DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
Structured Co-reference Graph Attention for Video-grounded DialogueAAAI Conference on Artificial Intelligence (AAAI), 2021 |
HSCJN: A Holistic Semantic Constraint Joint Network for Diverse Response
GenerationComputer Speech and Language (CSL), 2019 |