Sign Language Video Retrieval with Free-Form Textual QueriesComputer Vision and Pattern Recognition (CVPR), 2022 |
TEACHTEXT: CrossModal Generalized Distillation for Text-Video RetrievalIEEE International Conference on Computer Vision (ICCV), 2021 |
Learning Zero-Shot Multifaceted Visually Grounded Word Embeddings via
Multi-Task TrainingConference on Computational Natural Language Learning (CoNLL), 2021 |
Telling the What while Pointing to the Where: Multimodal Queries for
Image RetrievalIEEE International Conference on Computer Vision (ICCV), 2021 |
Learning to Scale Multilingual Representations for Vision-Language TasksEuropean Conference on Computer Vision (ECCV), 2020 |
Component Analysis for Visual Question Answering ArchitecturesIEEE International Joint Conference on Neural Network (IJCNN), 2020 |
MULE: Multimodal Universal Language EmbeddingAAAI Conference on Artificial Intelligence (AAAI), 2019 |