Vision-Language Models Learn Super Images for Efficient Partially
Relevant Video RetrievalACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) (TOMM), 2023 |
Learning a Condensed Frame for Memory-Efficient Video Class-Incremental
LearningNeural Information Processing Systems (NeurIPS), 2022 |
Efficient Cross-Modal Video Retrieval with Meta-Optimized FramesIEEE transactions on multimedia (IEEE TMM), 2022 |
MLP-3D: A MLP-like 3D Architecture with Grouped Time MixingComputer Vision and Pattern Recognition (CVPR), 2022 |
A Coding Framework and Benchmark towards Compressed Video UnderstandingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 |
Can An Image Classifier Suffice For Action Recognition?International Conference on Learning Representations (ICLR), 2021 |