ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.09868
  4. Cited By
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote
  Sensing Image Retrieval

Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval

IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2021
21 April 2022
Zhiqiang Yuan
Wenkai Zhang
Kun Fu
Xuan Li
Chubo Deng
Hongqi Wang
Xian Sun
ArXiv (abs)PDFHTML

Papers citing "Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval"

50 / 51 papers shown
Title
FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding
FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding
Z. Li
W. Yu
Dilxat Muhtar
X. Zhang
Pengfeng Xiao
Pedram Ghamisi
Xiao Xiang Zhu
CLIPVLM
132
0
0
18 Nov 2025
Survey of Multimodal Geospatial Foundation Models: Techniques, Applications, and Challenges
Survey of Multimodal Geospatial Foundation Models: Techniques, Applications, and Challenges
Liling Yang
Ning Chen
Jun Yue
Yidan Liu
Jiayi Ma
Pedram Ghamisi
Antonio J. Plaza
Leyuan Fang
AI4TS
96
0
0
27 Oct 2025
TCMA: Text-Conditioned Multi-granularity Alignment for Drone Cross-Modal Text-Video Retrieval
TCMA: Text-Conditioned Multi-granularity Alignment for Drone Cross-Modal Text-Video Retrieval
Zixu Zhao
Yang Zhan
VGenAI4TS
98
0
0
11 Oct 2025
A Synthetic-to-Real Dehazing Method based on Domain Unification
A Synthetic-to-Real Dehazing Method based on Domain Unification
Zhiqiang Yuan
Jinchao Zhang
Jie Zhou
56
0
0
04 Sep 2025
Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges
Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges
Haifeng Li
Wang Guo
Haiyang Wu
Mengwei Wu
Jipeng Zhang
Qing Zhu
Yu Liu
Xin Huang
Chao Tao
98
0
0
09 Aug 2025
CLOSP: A Unified Semantic Space for SAR, MSI, and Text in Remote Sensing
CLOSP: A Unified Semantic Space for SAR, MSI, and Text in Remote Sensing
Daniele Rege Cambrin
Lorenzo Vaiani
Giuseppe Gallipoli
Luca Cagliero
Paolo Garza
102
0
0
14 Jul 2025
Chain-of-Talkers (CoTalk): Fast Human Annotation of Dense Image Captions
Chain-of-Talkers (CoTalk): Fast Human Annotation of Dense Image Captions
Yijun Shen
Delong Chen
Fan Liu
Xingyu Wang
Chuanyi Zhang
Liang Yao
Yuhui Zheng
211
1
0
28 May 2025
Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval
Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval
Hailong Ning
Siying Wang
Tao Lei
Xiaopeng Cao
Huanmin Dou
Bin Zhao
Asoke K. Nandi
Petia Radeva
133
1
0
22 May 2025
Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives
Vision-Language Modeling Meets Remote Sensing: Models, Datasets and PerspectivesIEEE Geoscience and Remote Sensing Magazine (GRSM), 2025
Xingxing Weng
Chao Pang
Gui-Song Xia
VLM
276
8
0
20 May 2025
LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery
LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery
Jerome Quenum
Wen-Han Hsieh
Tsung-Han Wu
Ritwik Gupta
Trevor Darrell
David M. Chan
MLLMVLM
204
3
0
05 May 2025
XeMap: Contextual Referring in Large-Scale Remote Sensing Environments
XeMap: Contextual Referring in Large-Scale Remote Sensing Environments
Yongqian Li
Lu Si
Y. T. Hou
Chengaung Liu
Yangqiu Song
Hongjian Fang
Jing Zhang
230
0
0
30 Apr 2025
FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing
FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing
Hariseetharam Gunduboina
Muhammad Haris Khan
Biplab Banerjee
VLM
227
0
0
23 Apr 2025
SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding
SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding
Yimin Wei
Aoran Xiao
Yexian Ren
Yuting Zhu
Hongruixuan Chen
J. Xia
Xiangwei Zhu
VLM
337
6
0
04 Apr 2025
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
Ziyue Huang
Hongxi Yan
Qiqi Zhan
Shuai Yang
Mingming Zhang
Yiming Lei
Chenkai Zhang
Zeming Liu
Qingjie Liu
Longji Xu
280
9
0
28 Mar 2025
DGTRSD & DGTRS-CLIP: A Dual-Granularity Remote Sensing Image-Text Dataset and Vision Language Foundation Model for Alignment
DGTRSD & DGTRS-CLIP: A Dual-Granularity Remote Sensing Image-Text Dataset and Vision Language Foundation Model for AlignmentIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE J-STARS), 2025
Weizhi Chen
Yupeng Deng
Jin Wei
Jingbo Chen
Jiansheng Chen
Yuman Feng
Zhihao Xi
Diyou Liu
Kai Li
Yu Meng
VLM
227
1
0
25 Mar 2025
Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation
Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation
Clive Tinashe Marimo
Benedikt Blumenstiel
Maximilian Nitsche
Johannes Jakubik
Thomas Brunschwiler
241
3
0
20 Mar 2025
GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis
GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis
Angelos Zavras
Dimitrios Michail
Xiao Xiang Zhu
Tim Siebert
Ioannis Papoutsis
VLM
390
3
0
13 Feb 2025
A Resource-Efficient Training Framework for Remote Sensing Text--Image Retrieval
A Resource-Efficient Training Framework for Remote Sensing Text--Image Retrieval
Weihang Zhang
Jihao Li
Shuoke Li
Ziqing Niu
Jialiang Chen
Wenkai Zhang
VLM
152
1
0
18 Jan 2025
On Domain-Adaptive Post-Training for Multimodal Large Language Models
On Domain-Adaptive Post-Training for Multimodal Large Language Models
Daixuan Cheng
Shaohan Huang
Ziyu Zhu
Xintong Zhang
Wayne Xin Zhao
Zhongzhi Luan
Bo Dai
Zhenliang Zhang
VLM
363
6
0
29 Nov 2024
Cross-Modal Pre-Aligned Method with Global and Local Information for
  Remote-Sensing Image and Text Retrieval
Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text RetrievalIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Zengbao Sun
Ming Zhao
Gaorui Liu
Andre Kaup
219
11
0
22 Nov 2024
Pushing the Limits of Vision-Language Models in Remote Sensing without
  Human Annotations
Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations
Keumgang Cha
Donggeun Yu
Junghoon Seo
VLM
195
2
0
11 Sep 2024
RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language ModelsIsprs Journal of Photogrammetry and Remote Sensing (ISPRS J. Photogramm. Remote Sens.), 2024
Junyao Ge
Xu Zhang
Yang Zheng
Kaitai Guo
Jimin Liang
496
5
0
27 Aug 2024
EarthMarker: Visual Prompt Learning for Region-level and Point-level
  Remote Sensing Imagery Comprehension
EarthMarker: Visual Prompt Learning for Region-level and Point-level Remote Sensing Imagery Comprehension
Wei Zhang
Miaoxin Cai
Tong Zhang
Jun Li
Zhuang Yin
Xuerui Mao
302
3
0
18 Jul 2024
RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote
  Sensing Image Understanding
RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding
Linrui Xu
Ling Zhao
Wang Guo
Qiujun Li
Kewang Long
Kaiqi Zou
Yuhan Wang
Haifeng Li
AI4TS
209
7
0
18 Jun 2024
SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for
  Remote Sensing Vision-Language Understanding
SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding
Junwei Luo
Zhen Pang
Yongjun Zhang
Tingzhu Wang
Linlin Wang
...
Jiangwei Lao
Jian Wang
Jingdong Chen
Yihua Tan
Yansheng Li
280
56
0
14 Jun 2024
Towards Vision-Language Geo-Foundation Model: A Survey
Towards Vision-Language Geo-Foundation Model: A Survey
Yue Zhou
Xue Jiang
Yiping Ke
Yiping Ke
Junchi Yan
Xue Yang
Wayne Zhang
160
29
0
13 Jun 2024
Zoom and Shift are All You Need
Zoom and Shift are All You Need
Jiahao Qin
155
2
0
13 Jun 2024
Transcending Fusion: A Multi-Scale Alignment Method for Remote Sensing
  Image-Text Retrieval
Transcending Fusion: A Multi-Scale Alignment Method for Remote Sensing Image-Text Retrieval
Rui Yang
Shuang Wang
Yi Han
Yuanheng Li
Dong Zhao
Dou Quan
Yanhe Guo
Licheng Jiao
211
8
0
29 May 2024
Composed Image Retrieval for Remote Sensing
Composed Image Retrieval for Remote Sensing
Bill Psomas
Ioannis Kakogeorgiou
Nikos Efthymiadis
Giorgos Tolias
Ondřej Chum
Yannis Avrithis
Konstantinos Karantzalos
317
12
0
24 May 2024
PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval
PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval
Jiancheng Pan
Muyuan Ma
Qing Ma
Cong Bai
Shengyong Chen
204
12
0
16 May 2024
On the Foundations of Earth and Climate Foundation Models
On the Foundations of Earth and Climate Foundation Models
Xiao Xiang Zhu
Zhitong Xiong
Yi Wang
Adam J. Stewart
Konrad Heidler
Yuanyuan Wang
Zhenghang Yuan
Thomas Dujardin
Qingsong Xu
Yilei Shi
AI4ClAI4CE
228
38
0
07 May 2024
Knowledge-aware Text-Image Retrieval for Remote Sensing Images
Knowledge-aware Text-Image Retrieval for Remote Sensing ImagesIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Li Mi
Xianjie Dai
J. Castillo-Navarro
D. Tuia
161
18
0
06 May 2024
LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for
  Remote Sensing Image-Text Retrival
LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival
Yuanxin Zhao
Mi Zhang
Bingnan Yang
Zhan Zhang
Jiaju Kang
Jianya Gong
139
2
0
16 Mar 2024
Mind the Modality Gap: Towards a Remote Sensing Vision-Language Model via Cross-modal Alignment
Mind the Modality Gap: Towards a Remote Sensing Vision-Language Model via Cross-modal Alignment
Angelos Zavras
Dimitrios Michail
Tim Siebert
Ioannis Papoutsis
VLM
331
15
0
15 Feb 2024
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal
  Language Model
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model
Dilxat Muhtar
Zhenshi Li
Feng-Xue Gu
Xue-liang Zhang
Pengfeng Xiao
372
114
0
04 Feb 2024
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor
  Image Comprehension in Remote Sensing Domain
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Wei Zhang
Miaoxin Cai
Tong Zhang
Zhuang Yin
Xuerui Mao
295
188
0
30 Jan 2024
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction
  Tuning with Large Language Model
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
MLLM
199
103
0
18 Jan 2024
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing
  Image-text Retrieval
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text RetrievalIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Qing Ma
Jiancheng Pan
Cong Bai
200
40
0
12 Oct 2023
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text
  Retrieval
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text RetrievalIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Yuan. Yuan
Yangfan Zhan
Zhitong Xiong
VLM
167
60
0
24 Aug 2023
RSGPT: A Remote Sensing Vision Language Model and Benchmark
RSGPT: A Remote Sensing Vision Language Model and BenchmarkIsprs Journal of Photogrammetry and Remote Sensing (ISPRS J. Photogramm. Remote Sens.), 2023
Yuan Hu
Jianlong Yuan
Congcong Wen
Xiaonan Lu
Xiang Li
VLM
197
186
0
28 Jul 2023
Visual Instruction Tuning with Polite Flamingo
Visual Instruction Tuning with Polite FlamingoAAAI Conference on Artificial Intelligence (AAAI), 2023
Delong Chen
Jianfeng Liu
Wenliang Dai
Baoyuan Wang
MLLM
312
51
0
03 Jul 2023
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large
  Vision-Language Model for Remote Sensing
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote SensingIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Zilun Zhang
Tiancheng Zhao
Yulong Guo
Yuxiang Cai
DiffMVLM
959
131
0
20 Jun 2023
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
RemoteCLIP: A Vision Language Foundation Model for Remote SensingIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Fan Liu
Delong Chen
Zhan-Rong Guan
Xiaocong Zhou
Jiale Zhu
Qiaolin Ye
Liyong Fu
Jun Zhou
VLM
389
414
0
19 Jun 2023
Vision-Language Models in Remote Sensing: Current Progress and Future
  Trends
Vision-Language Models in Remote Sensing: Current Progress and Future TrendsIEEE Geoscience and Remote Sensing Magazine (GRSM), 2023
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
262
140
0
09 May 2023
Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation
  Incorporating Gloss Information
Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation Incorporating Gloss InformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Sunjae Kwon
Rishabh Garodia
Linghe Wang
Zhichao Yang
Hong-ye Yu
CoGe
261
5
0
02 May 2023
Scale-Semantic Joint Decoupling Network for Image-text Retrieval in
  Remote Sensing
Scale-Semantic Joint Decoupling Network for Image-text Retrieval in Remote Sensing
Chengyu Zheng
Ning Song
Ruoyu Zhang
Lei Huang
Zhiqiang Wei
Jie Nie
141
17
0
12 Dec 2022
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing
  Data
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing DataIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
197
170
0
23 Oct 2022
EarthNets: Empowering AI in Earth Observation
EarthNets: Empowering AI in Earth Observation
Zhitong Xiong
Fahong Zhang
Yi Wang
Yilei Shi
Xiao Xiang Zhu
357
77
0
10 Oct 2022
Learning to Evaluate Performance of Multi-modal Semantic Localization
Learning to Evaluate Performance of Multi-modal Semantic LocalizationIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Zhiqiang Yuan
Wenkai Zhang
Chongyang Li
Zhaoying Pan
Yongqiang Mao
Jialiang Chen
Shuoke Li
Hongqi Wang
Xian Sun
202
28
0
14 Sep 2022
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and
  Local Information
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local InformationIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Zhiqiang Yuan
Wenkai Zhang
Changyuan Tian
Xuee Rong
Zhengyuan Zhang
Hongqi Wang
Kun Fu
Xian Sun
203
164
0
21 Apr 2022
12
Next