ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.15826
  4. Cited By
GeoChat: Grounded Large Vision-Language Model for Remote Sensing

GeoChat: Grounded Large Vision-Language Model for Remote Sensing

24 November 2023
Kartik Kuckreja
M. S. Danish
Muzammal Naseer
Abhijit Das
Salman Khan
Fahad Shahbaz Khan
ArXivPDFHTML

Papers citing "GeoChat: Grounded Large Vision-Language Model for Remote Sensing"

38 / 88 papers shown
Title
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image
  Understanding
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding
Dawei Dai
Yuanhui Zhang
Long Xu
Qianlan Yang
Xiaojing Shen
Shuyin Xia
Guoyin Wang
LM&MA
VLM
21
6
0
18 Aug 2024
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Jiancheng Pan
Yanxing Liu
Yuqian Fu
Muyuan Ma
Jiaohao Li
D. Paudel
Luc Van Gool
Xiaomeng Huang
ObjD
55
7
0
17 Aug 2024
Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height
  Estimation
Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation
Daniele Rege Cambrin
Isaac Corley
Paolo Garza
19
1
0
08 Aug 2024
FlexAttention for Efficient High-Resolution Vision-Language Models
FlexAttention for Efficient High-Resolution Vision-Language Models
Junyan Li
Delin Chen
Tianle Cai
Peihao Chen
Yining Hong
Zhenfang Chen
Yikang Shen
Chuang Gan
VLM
59
4
0
29 Jul 2024
Semantic-CC: Boosting Remote Sensing Image Change Captioning via
  Foundational Knowledge and Semantic Guidance
Semantic-CC: Boosting Remote Sensing Image Change Captioning via Foundational Knowledge and Semantic Guidance
Yongshuo Zhu
Lu Li
Keyan Chen
Chenyang Liu
Fugen Zhou
Z. Shi
37
4
0
19 Jul 2024
EarthMarker: Visual Prompt Learning for Region-level and Point-level
  Remote Sensing Imagery Comprehension
EarthMarker: Visual Prompt Learning for Region-level and Point-level Remote Sensing Imagery Comprehension
Wei Zhang
Miaoxin Cai
Tong Zhang
Jun Li
Zhuang Yin
Xuerui Mao
61
5
0
18 Jul 2024
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Xiang Li
Cristina Mata
J. Park
Kumara Kahatapitiya
Yoo Sung Jang
...
Kanchana Ranasinghe
R. Burgert
Mu Cai
Yong Jae Lee
Michael S. Ryoo
LM&Ro
59
23
0
28 Jun 2024
CityBench: Evaluating the Capabilities of Large Language Model as World
  Model
CityBench: Evaluating the Capabilities of Large Language Model as World Model
Jie Feng
Jun Zhang
Junbo Yan
Xin Zhang
Tianjian Ouyang
Tianhui Liu
Yuwei Du
Siqi Guo
Yong Li
ELM
41
0
0
20 Jun 2024
RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote
  Sensing Image Understanding
RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding
Linrui Xu
Ling Zhao
Wang Guo
Qiujun Li
Kewang Long
Kaiqi Zou
Yuhan Wang
Haifeng Li
AI4TS
21
5
0
18 Jun 2024
VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote
  Sensing Image Understanding
VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding
Xiang Li
Jian Ding
Mohamed Elhoseiny
CoGe
35
19
0
18 Jun 2024
Preserving Knowledge in Large Language Model with Model-Agnostic
  Self-Decompression
Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression
Zilun Zhang
Yutao Sun
Tiancheng Zhao
Leigang Sha
Ruochen Xu
Kyusong Lee
Jianwei Yin
CLL
KELM
46
0
0
17 Jun 2024
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in
  Multimodal Large Language Model
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model
Jiahao Huo
Yibo Yan
Boren Hu
Yutao Yue
Xuming Hu
LRM
MLLM
32
7
0
17 Jun 2024
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Di Wang
Meiqi Hu
Yao Jin
Yuchun Miao
Jiaqi Yang
...
Lefei Zhang
Chen Wu
Bo Du
Dacheng Tao
Liangpei Zhang
56
19
0
17 Jun 2024
SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for
  Remote Sensing Vision-Language Understanding
SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding
Junwei Luo
Zhen Pang
Yongjun Zhang
Tingzhu Wang
Linlin Wang
...
Jiangwei Lao
Jian Wang
Jingdong Chen
Yihua Tan
Yansheng Li
28
20
0
14 Jun 2024
Towards Vision-Language Geo-Foundation Model: A Survey
Towards Vision-Language Geo-Foundation Model: A Survey
Yue Zhou
Litong Feng
Yiping Ke
Xue Jiang
Junchi Yan
Xue Yang
Wayne Zhang
35
15
0
13 Jun 2024
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents
Wenjia Xu
Zijian Yu
Yixu Wang
Jiuniu Wang
Mugen Peng
LLMAG
35
7
0
11 Jun 2024
CityCraft: A Real Crafter for 3D City Generation
CityCraft: A Real Crafter for 3D City Generation
Jie Deng
Wenhao Chai
Junsheng Huang
Zhonghan Zhao
Qixuan Huang
...
Shengyu Hao
Wenhao Hu
Jenq-Neng Hwang
X. Li
Gaoang Wang
31
12
0
07 Jun 2024
MGIMM: Multi-Granularity Instruction Multimodal Model for
  Attribute-Guided Remote Sensing Image Detailed Description
MGIMM: Multi-Granularity Instruction Multimodal Model for Attribute-Guided Remote Sensing Image Detailed Description
Cong Yang
Zuchao Li
Lefei Zhang
32
1
0
07 Jun 2024
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing
  Image Generation
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation
Zhiping Yu
Chenyang Liu
Liqin Liu
Z. Shi
Zhengxia Zou
VGen
16
10
0
22 May 2024
Wildfire Risk Prediction: A Review
Wildfire Risk Prediction: A Review
Zhengsen Xu
Jonathan Li
Linlin Xu
20
0
0
02 May 2024
Struggle with Adversarial Defense? Try Diffusion
Struggle with Adversarial Defense? Try Diffusion
Yujie Li
Yanbin Wang
Haitao Xu
Bin Liu
Jianguo Sun
Zhenhao Guo
Wenrui Ma
DiffM
20
1
0
12 Apr 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Peng Gao
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Hongsheng Li
VLM
53
31
0
29 Mar 2024
LSKNet: A Foundation Lightweight Backbone for Remote Sensing
LSKNet: A Foundation Lightweight Backbone for Remote Sensing
Yuxuan Li
Xiang Li
Yimain Dai
Qibin Hou
Li Liu
Yongxiang Liu
Ming-Ming Cheng
Jian Yang
29
29
0
18 Mar 2024
Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection
  from Remote Sensing Imagery
Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Wei Zhang
Miaoxin Cai
Tong Zhang
Guoqiang Lei
Zhuang Yin
Xuerui Mao
22
6
0
06 Mar 2024
Deep Learning for Cross-Domain Data Fusion in Urban Computing: Taxonomy,
  Advances, and Outlook
Deep Learning for Cross-Domain Data Fusion in Urban Computing: Taxonomy, Advances, and Outlook
Xingchen Zou
Yibo Yan
Xixuan Hao
Yuehong Hu
Haomin Wen
...
Junbo Zhang
Yong Li
Tianrui Li
Yu Zheng
Yuxuan Liang
HAI
AI4TS
43
35
0
29 Feb 2024
ChatEarthNet: A Global-Scale Image-Text Dataset Empowering
  Vision-Language Geo-Foundation Models
ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation Models
Zhenghang Yuan
Zhitong Xiong
Lichao Mou
Xiao Xiang Zhu
20
7
0
17 Feb 2024
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for
  Medical LVLM
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Yutao Hu
Tian-Xin Li
Quanfeng Lu
Wenqi Shao
Junjun He
Yu Qiao
Ping Luo
ELM
LM&MA
23
50
0
14 Feb 2024
Large Language Models for Captioning and Retrieving Remote Sensing
  Images
Large Language Models for Captioning and Retrieving Remote Sensing Images
João Daniel Silva
João Magalhães
D. Tuia
Bruno Martins
32
29
0
09 Feb 2024
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal
  Language Model
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model
Dilxat Muhtar
Zhenshi Li
Feng-Xue Gu
Xue-liang Zhang
P. Xiao
67
46
0
04 Feb 2024
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth
  observation data
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data
Chenhui Zhang
Sherrie Wang
19
17
0
31 Jan 2024
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor
  Image Comprehension in Remote Sensing Domain
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Wei Zhang
Miaoxin Cai
Tong Zhang
Zhuang Yin
Xuerui Mao
11
83
0
30 Jan 2024
When Geoscience Meets Generative AI and Large Language Models:
  Foundations, Trends, and Future Challenges
When Geoscience Meets Generative AI and Large Language Models: Foundations, Trends, and Future Challenges
Abdenour Hadid
Tanujit Chakraborty
Daniel Busby
AI4CE
10
11
0
25 Jan 2024
MiniGPT-v2: large language model as a unified interface for
  vision-language multi-task learning
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
154
280
0
14 Oct 2023
Vision-Language Models in Remote Sensing: Current Progress and Future
  Trends
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
8
69
0
09 May 2023
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment
  Anything Model
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model
Di Wang
Jing Zhang
Bo Du
Minqiang Xu
Lin Liu
Dacheng Tao
L. Zhang
114
71
0
03 May 2023
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing
  Data
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
66
103
0
23 Oct 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
382
4,010
0
28 Jan 2022
RSVQA: Visual Question Answering for Remote Sensing Data
RSVQA: Visual Question Answering for Remote Sensing Data
Sylvain Lobry
Diego Marcos
J. Murray
D. Tuia
60
203
0
16 Mar 2020
Previous
12