
Title |
|---|
![]() More performant and scalable: Rethinking contrastive vision-language pre-training of radiology in the LLM eraInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025 |
![]() DIVA-VQA: Detecting Inter-frame Variations in UGC Video QualityInternational Conference on Information Photonics (ICIP), 2025 |
![]() StepAL: Step-aware Active Learning for Cataract Surgical VideosInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025 |
![]() Improving Token-based Object Detection with VideoIEEE Access (IEEE Access), 2025 |
![]() Can Vision Language Models Understand Mimed Actions?Annual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() DejaVid: Encoder-Agnostic Learned Temporal Matching for Video ClassificationComputer Vision and Pattern Recognition (CVPR), 2025 |
An Effective End-to-End Solution for Multimodal Action RecognitionInternational Conference on Pattern Recognition (ICPR), 2025 |