Test of Time: Instilling Video-Language Models with a Sense of TimeComputer Vision and Pattern Recognition (CVPR), 2023 |
Look Before you Speak: Visually Contextualized UtterancesComputer Vision and Pattern Recognition (CVPR), 2020 |
Clustering based Contrastive Learning for Improving Face RepresentationsIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2020 |