Unified Multimodal Model with Unlikelihood Training for Visual DialogACM Multimedia (ACM MM), 2022 |
Multimodal Dialogue State TrackingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022 |
VD-PCR: Improving Visual Dialog with Pronoun Coreference ResolutionPattern Recognition (Pattern Recogn.), 2022 |
Modality-Balanced Embedding for Video RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022 |
VD-BERT: A Unified Vision and Dialog Transformer with BERTConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 |
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge
TransferConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 |
Guessing State Tracking for Visual DialogueEuropean Conference on Computer Vision (ECCV), 2020 |