
Title |
|---|
BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQAInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025 |
![]() DocVLM: Make Your VLM an Efficient ReaderComputer Vision and Pattern Recognition (CVPR), 2024 |
![]() Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
RecognitionACM Multimedia (ACM MM), 2023 |
![]() Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards
Enhancing Text Spotting PerformanceIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023 |
![]() FuseCap: Leveraging Large Language Models for Enriched Fused Image
CaptionsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023 |
![]() CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained
Vision-Language ModelIEEE Transactions on Image Processing (IEEE TIP), 2023 |