Vision-Language Modeling Meets Remote Sensing: Models, Datasets and PerspectivesIEEE Geoscience and Remote Sensing Magazine (GRSM), 2025 |
Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable ModelsConference on Uncertainty in Artificial Intelligence (UAI), 2025 |
Adaptive Token Boundaries: Integrating Human Chunking Mechanisms into Multimodal LLMsSocial Science Research Network (SSRN), 2025 |
Investigating the Effect of Parallel Data in the Cross-Lingual Transfer for Vision-Language EncodersInternational Conference on Text, Speech and Dialogue (TSD), 2025 |
Mimic In-Context Learning for Multimodal TasksComputer Vision and Pattern Recognition (CVPR), 2025 |
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?Computer Vision and Pattern Recognition (CVPR), 2025 |