
![]() PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-ResolutionComputer Vision and Pattern Recognition (CVPR), 2025 |
![]() EffiVLM-BENCH: A Comprehensive Benchmark for Evaluating Training-Free Acceleration in Large Vision-Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() TinyRS-R1: Compact Multimodal Language Model for Remote SensingIEEE Geoscience and Remote Sensing Letters (GRSL), 2025 |
![]() SAMChat: Introducing Chain of Thought Reasoning and GRPO to a Multimodal Small Language Model for Small Scale Remote SensingIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE J-STARS), 2025 |
![]() CM1 - A Dataset for Evaluating Few-Shot Information Extraction with Large Vision Language ModelsIEEE International Conference on Document Analysis and Recognition (ICDAR), 2025 |