Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2410.06234
Cited By
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data
International Conference on Learning Representations (ICLR), 2024
28 January 2025
Jeremy Irvin
Emily Ruoyu Liu
Joyce Chuyi Chen
Ines Dormoy
Jinyoung Kim
Samar Khanna
Zhuo Zheng
Stefano Ermon
MLLM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data"
50 / 65 papers shown
Title
Multilingual Training-Free Remote Sensing Image Captioning
Carlos Rebelo
Gil Rocha
João Daniel Silva
Bruno Martins
48
0
0
30 Nov 2025
GeoZero: Incentivizing Reasoning from Scratch on Geospatial Scenes
Di Wang
Shunyu Liu
Wentao Jiang
Fengxiang Wang
Yi Liu
...
Haonan Guo
Jing Zhang
Bo Du
Dacheng Tao
L. Zhang
LRM
96
0
0
27 Nov 2025
Co-Training Vision Language Models for Remote Sensing Multi-task Learning
Qingyun Li
Shuran Ma
Junwei Luo
Yi Yu
Yue Zhou
...
Xiaoxing Wang
Xin He
Yushi Chen
Xue Yang
Junchi Yan
168
0
0
26 Nov 2025
Think First, Assign Next (ThiFAN-VQA): A Two-stage Chain-of-Thought Framework for Post-Disaster Damage Assessment
Ehsan Karimi
Nhut Le
Maryam Rahnemoonfar
LRM
84
0
0
24 Nov 2025
REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing
Binger Chen
Tacettin Emre Bök
Behnood Rasti
Volker Markl
Begüm Demir
100
0
0
21 Nov 2025
The Potential of Copernicus Satellites for Disaster Response: Retrieving Building Damage from Sentinel-1 and Sentinel-2
Olivier Dietrich
Merlin Alfredsson
Emilia Arens
Nando Metzger
T. Peters
L. Scheibenreif
Jan Dirk Wegner
Konrad Schindler
92
0
0
07 Nov 2025
DescribeEarth: Describe Anything for Remote Sensing Images
Kaiyu Li
Zixuan Jiang
Xiangyong Cao
Jiayu Wang
Yuchen Xiao
Deyu Meng
Zhi Wang
125
1
0
30 Sep 2025
Geo-R1: Unlocking VLM Geospatial Reasoning with Cross-View Reinforcement Learning
Chenhui Xu
F. Yu
Michael J. Bianco
Jacob Kovarskiy
Raphael Tang
...
Rupanjali Kukal
Mikael Figueroa
Rishi Madhok
Nikolaos Karianakis
Jinjun Xiong
ObjD
ReLM
LRM
123
0
0
29 Sep 2025
GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning
Mustansar Fiaz
Hiyam Debary
P. Fraccaro
D. Paudel
Luc Van Gool
Fahad Shahbaz Khan
Salman Khan
ObjD
OffRL
VLM
LRM
305
2
0
29 Sep 2025
BTCChat: Advancing Remote Sensing Bi-temporal Change Captioning with Multimodal Large Language Model
Yujie Li
Wenjia Xu
Yuanben Zhang
Zhiwei Wei
Mugen Peng
100
0
0
07 Sep 2025
RSCC: A Large-Scale Remote Sensing Change Caption Dataset for Disaster Events
Z. Chen
Chenxi Wang
Ningyu Zhang
Feng Zhang
182
2
0
02 Sep 2025
ChatENV: An Interactive Vision-Language Model for Sensor-Guided Environmental Monitoring and Scenario Simulation
Hosam Elgendy
Ahmed Sharshar
Ahmed Aboeitta
Mohsen Guizani
VLM
132
0
0
14 Aug 2025
Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges
Haifeng Li
Wang Guo
Haiyang Wu
Mengwei Wu
Jipeng Zhang
Qing Zhu
Yu Liu
Xin Huang
Chao Tao
131
1
0
09 Aug 2025
MONITRS: Multimodal Observations of Natural Incidents Through Remote Sensing
Shreelekha Revankar
Utkarsh Mall
Cheng Perng Phoo
Kavita Bala
Bharath Hariharan
114
0
0
22 Jul 2025
TAMMs: Temporal-Aware Multimodal Model for Satellite Image Change Understanding and Forecasting
Zhongbin Guo
Yuhao Wang
Ping Jian
Chengzhi Li
Xinyue Chen
Zhen Yang
Ertai E
243
0
0
23 Jun 2025
Domain Specific Benchmarks for Evaluating Multimodal Large Language Models
Khizar Anjuma
Muhammad Arbab Arshad
Kadhim Hayawi
Efstathios Polyzos
A. Tariq
...
Nishith Reddy Mannuru
Ravi Varma Kumar Bevara
Taslim Mahbub
Muhammad Zeeshan Akram
Sakib Shahriar
ELM
LRM
389
2
0
15 Jun 2025
ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks
Akashah Shabbir
Muhammad Akhtar Munir
Akshay Dudhane
Muhammad Umer Sheikh
M. H. Khan
Paolo Fraccaro
Juan Bernabé-Moreno
Fahad Shahbaz Khan
Salman Khan
LLMAG
ELM
191
3
0
29 May 2025
DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response
Junjue Wang
Weihao Xuan
Heli Qi
Zhihao Liu
Kunyi Liu
...
Hongruixuan Chen
Jian Song
J. Xia
Zhuo Zheng
Xiangwei Zhu
422
9
0
27 May 2025
Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives
IEEE Geoscience and Remote Sensing Magazine (GRSM), 2025
Xingxing Weng
Chao Pang
Gui-Song Xia
VLM
316
10
0
20 May 2025
LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery
Jerome Quenum
Wen-Han Hsieh
Tsung-Han Wu
Ritwik Gupta
Trevor Darrell
David M. Chan
MLLM
VLM
252
4
0
05 May 2025
Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization
Darryl Hannan
John Cooper
Dylan White
Timothy Doster
Henry Kvinge
Y. Watkins
205
1
0
14 Apr 2025
Operational Change Detection for Geographical Information: Overview and Challenges
Nicolas Gonthier
335
0
0
18 Mar 2025
Quality-Driven Curation of Remote Sensing Vision-Language Data via Learned Scoring Models
Dilxat Muhtar
Enzhuo Zhang
Zhenshi Li
Feng-Xue Gu
Yanglangxing He
Pengfeng Xiao
Xueliang Zhang
262
7
0
02 Mar 2025
Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models
Weihong Zhong
Xiaocheng Feng
Liang Zhao
Qiming Li
Lei Huang
Yuxuan Gu
Weitao Ma
Yuan Xu
Bing Qin
MLLM
450
19
0
30 Jun 2024
RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding
Linrui Xu
Ling Zhao
Wang Guo
Qiujun Li
Kewang Long
Kaiqi Zou
Yuhan Wang
Haifeng Li
AI4TS
241
9
0
18 Jun 2024
SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding
Junwei Luo
Zhen Pang
Yongjun Zhang
Tingzhu Wang
Linlin Wang
...
Jiangwei Lao
Jian Wang
Jingdong Chen
Yihua Tan
Yansheng Li
348
64
0
14 Jun 2024
ST-LLM: Large Language Models Are Effective Temporal Learners
Ruyang Liu
Chen Li
Haoran Tang
Yixiao Ge
Ying Shan
Ge Li
181
124
0
30 Mar 2024
ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation Models
Zhenghang Yuan
Zhitong Xiong
Lichao Mou
Xiao Xiang Zhu
206
19
0
17 Feb 2024
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model
Dilxat Muhtar
Zhenshi Li
Feng-Xue Gu
Xue-liang Zhang
Pengfeng Xiao
448
122
0
04 Feb 2024
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Wei Zhang
Miaoxin Cai
Tong Zhang
Zhuang Yin
Xuerui Mao
403
206
0
30 Jan 2024
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
MLLM
235
109
0
18 Jan 2024
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Anku Rani
Vipula Rawte
Vasu Sharma
Amitava Das
HILM
410
337
0
02 Jan 2024
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing
Zhecheng Wang
R. Prabha
Tianyuan Huang
Jiajun Wu
Ram Rajagopal
210
124
0
20 Dec 2023
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
European Conference on Computer Vision (ECCV), 2023
Yanwei Li
Chengyao Wang
Jiaya Jia
VLM
MLLM
303
470
0
28 Nov 2023
GeoChat: Grounded Large Vision-Language Model for Remote Sensing
Computer Vision and Pattern Recognition (CVPR), 2023
Kartik Kuckreja
M. S. Danish
Muzammal Naseer
Abhijit Das
Salman Khan
Fahad Shahbaz Khan
313
287
0
24 Nov 2023
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Bin Lin
Yang Ye
Bin Zhu
Jiaxi Cui
Munan Ning
Peng Jin
Li-ming Yuan
VLM
MLLM
1.5K
1,154
0
16 Nov 2023
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Computer Vision and Pattern Recognition (CVPR), 2023
Peng Jin
Ryuichi Takanobu
Caiwan Zhang
Xiaochun Cao
Li-ming Yuan
MLLM
488
348
0
14 Nov 2023
NExT-Chat: An LMM for Chat, Detection and Segmentation
Ao Zhang
Yuan Yao
Wei Ji
Zhiyuan Liu
Tat-Seng Chua
MLLM
VLM
335
73
0
08 Nov 2023
Improved Baselines with Visual Instruction Tuning
Computer Vision and Pattern Recognition (CVPR), 2023
Haotian Liu
Chunyuan Li
Yuheng Li
Yong Jae Lee
VLM
MLLM
596
4,087
0
05 Oct 2023
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
International Conference on Learning Representations (ICLR), 2023
Bin Zhu
Bin Lin
Munan Ning
Yang Yan
Jiaxi Cui
...
Zongwei Li
Wancai Zhang
Zhifeng Li
Wei Liu
Liejie Yuan
VLM
MLLM
652
331
0
03 Oct 2023
RSGPT: A Remote Sensing Vision Language Model and Benchmark
Isprs Journal of Photogrammetry and Remote Sensing (ISPRS J. Photogramm. Remote Sens.), 2023
Yuan Hu
Jianlong Yuan
Congcong Wen
Xiaonan Lu
Xiang Li
VLM
233
204
0
28 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
6.9K
15,103
0
18 Jul 2023
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Muhammad Maaz
H. Rasheed
Salman Khan
Fahad Shahbaz Khan
MLLM
385
940
0
08 Jun 2023
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hang Zhang
Xin Li
Lidong Bing
MLLM
542
1,466
0
05 Jun 2023
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Artificial Intelligence Review (AIR), 2023
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
339
142
0
19 May 2023
Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review
Remote Sensing (RS), 2023
Guangliang Cheng
Yun-Min Huang
Xiangtai Li
Shuchang Lyu
Zhaoyang Xu
Qi Zhao
Shiming Xiang
200
155
0
09 May 2023
Visual Instruction Tuning
Neural Information Processing Systems (NeurIPS), 2023
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
1.1K
7,286
0
17 Apr 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
4.3K
20,543
0
15 Mar 2023
SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery
Neural Information Processing Systems (NeurIPS), 2022
Yezhen Cong
Samarth Khanna
Chenlin Meng
Patrick Liu
Erik Rozi
Yutong He
Marshall Burke
David B. Lobell
Stefano Ermon
ViT
454
397
0
17 Jul 2022
Change Detection Meets Visual Question Answering
Zhenghang Yuan
Lichao Mou
Zhitong Xiong
Xiaoxiang Zhu
223
59
0
12 Dec 2021
1
2
Next