
Title |
|---|
![]() DiffusionRet: Generative Text-Video Retrieval with Diffusion ModelIEEE International Conference on Computer Vision (ICCV), 2023 |
![]() FTM: A Frame-level Timeline Modeling Method for Temporal Graph
Representation LearningAAAI Conference on Artificial Intelligence (AAAI), 2023 |
![]() Expectation-Maximization Contrastive Learning for Compact
Video-and-Language RepresentationsNeural Information Processing Systems (NeurIPS), 2022 |
![]() Toward 3D Spatial Reasoning for Human-like Text-based Visual Question
AnsweringIEEE Transactions on Image Processing (IEEE TIP), 2022 |
![]() Locality Guidance for Improving Vision Transformers on Tiny DatasetsEuropean Conference on Computer Vision (ECCV), 2022 |