Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2111.15174
Cited By
v1
v2 (latest)
CRIS: CLIP-Driven Referring Image Segmentation
30 November 2021
Zhaoqing Wang
Yu Lu
Qiang Li
Xunqiang Tao
Yan Guo
Ming Gong
Tongliang Liu
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CRIS: CLIP-Driven Referring Image Segmentation"
50 / 288 papers shown
Title
Segmentation as A Plug-and-Play Capability for Frozen Multimodal LLMs
Jiazhen Liu
Long Chen
MLLM
VLM
96
2
0
19 Oct 2025
QuASH: Using Natural-Language Heuristics to Query Visual-Language Robotic Maps
Matti Pekkanen
Francesco Verdoja
Ville Kyrki
76
0
0
16 Oct 2025
CoPRS: Learning Positional Prior from Chain-of-Thought for Reasoning Segmentation
Zhenyu Lu
Liupeng Li
Jinpeng Wang
Yan Feng
Bin Chen
Ke Chen
Yaowei Wang
LRM
74
0
0
13 Oct 2025
Unified Open-World Segmentation with Multi-Modal Prompts
Yang Liu
Yufei Yin
Chenchen Jing
M. Zhu
Hao Chen
Yuling Xi
Bo Feng
Hao Wang
Shiyu Li
Chunhua Shen
VLM
74
0
0
12 Oct 2025
SaFiRe: Saccade-Fixation Reiteration with Mamba for Referring Image Segmentation
Zhenjie Mao
Yuhuan Yang
Chaofan Ma
Dongsheng Jiang
Jiangchao Yao
Ya Zhang
Yanfeng Wang
80
0
0
11 Oct 2025
UGround: Towards Unified Visual Grounding with Unrolled Transformers
Rui Qian
Xin Yin
Chuanhang Deng
Zhiyuan Peng
Jian Xiong
Wei Zhai
Dejing Dou
99
0
0
04 Oct 2025
CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning
Qihua Dong
Luis Figueroa
Handong Zhao
Kushal Kafle
Jason Kuen
Zhihong Ding
Scott D. Cohen
Y. Fu
ObjD
LRM
148
0
0
03 Oct 2025
CoPatch: Zero-Shot Referring Image Segmentation by Leveraging Untapped Spatial Knowledge in CLIP
Na Min An
Inha Kang
Minhyun Lee
Hyunjung Shim
VLM
97
0
0
27 Sep 2025
COLA: Context-aware Language-driven Test-time Adaptation
IEEE Transactions on Image Processing (IEEE TIP), 2025
Aiming Zhang
Tianyuan Yu
Liang Bai
Jun Tang
Yanming Guo
Yirun Ruan
Yun Zhou
Zhihe Lu
TTA
VLM
214
0
0
22 Sep 2025
Prompt-Driven Image Analysis with Multimodal Generative AI: Detection, Segmentation, Inpainting, and Interpretation
Kaleem Ahmad
MLLM
62
0
0
10 Sep 2025
Attribute-based Object Grounding and Robot Grasp Detection with Spatial Reasoning
Houjian Yu
Zheming Zhou
Min Sun
Omid Ghasemalizadeh
Yuyin Sun
Cheng-Hao Kuo
Arnie Sen
Changhyun Choi
78
0
0
09 Sep 2025
VLSM-Ensemble: Ensembling CLIP-based Vision-Language Models for Enhanced Medical Image Segmentation
J. Dietlmeier
Oluwabukola Grace Adegboro
Vayangi Ganepola
Claudia Mazo
Noel E. O'Connor
VLM
60
0
0
05 Sep 2025
EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models
Wenhui Zhu
Xiwen Chen
Zhipeng Wang
Shao Tang
Sayan Ghosh
Xuanzhao Dong
Rajat Koner
Yalin Wang
VLM
92
0
0
16 Aug 2025
Latent Expression Generation for Referring Image Segmentation and Grounding
S. Yu
Joonbeom Hong
Joonseok Lee
Jeany Son
ObjD
145
1
0
07 Aug 2025
Referring Remote Sensing Image Segmentation with Cross-view Semantics Interaction Network
Jiaxing Yang
Lihe Zhang
Huchuan Lu
104
0
0
02 Aug 2025
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
314
10
0
01 Aug 2025
OW-CLIP: Data-Efficient Visual Supervision for Open-World Object Detection via Human-AI Collaboration
Junwen Duan
Wei Xue
Ziyao Kang
Shixia Liu
Jiazhi Xia
VLM
111
0
0
26 Jul 2025
Advancing Visual Large Language Model for Multi-granular Versatile Perception
Wentao Xiang
Haoxian Tan
Cong Wei
Yujie Zhong
Dengjie Li
Yujiu Yang
VLM
157
1
0
22 Jul 2025
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
Shiqi Huang
Shuting He
Huaiyuan Qin
Bihan Wen
231
1
0
17 Jul 2025
Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation
Shuchang Ye
Usman Naseem
Mingyuan Meng
Jinman Kim
200
1
0
15 Jul 2025
Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts
Shiu-hong Kao
Yu-Wing Tai
Chi-Keung Tang
MLLM
LRM
521
1
0
01 Jul 2025
Multi-encoder nnU-Net outperforms transformer models with self-supervised pretraining
Seyedeh Sahar Taheri Otaghsara
Reza Rahmanzadeh
ViT
242
1
0
01 Jul 2025
Unleashing Diffusion and State Space Models for Medical Image Segmentation
Rong Wu
Ziqi Chen
Liming Zhong
Heng Li
Hai Shu
MedIm
237
1
0
15 Jun 2025
Refer to Any Segmentation Mask Group With Vision-Language Prompts
Shengcao Cao
Zijun Wei
Jason Kuen
Kangning Liu
Lingzhi Zhang
Jiuxiang Gu
HyunJoon Jung
Liang-Yan Gui
Yu Wang
VLM
278
2
0
05 Jun 2025
LlamaSeg: Image Segmentation via Autoregressive Mask Generation
Jiru Deng
Tengjin Weng
Tianyu Yang
Tong Lu
Zhiheng Li
Wenhao Jiang
VLM
302
0
0
26 May 2025
Deformable Attentive Visual Enhancement for Referring Segmentation Using Vision-Language Model
Alaa Dalaq
Muzammil Behzad
VLM
369
0
0
25 May 2025
Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation
Zhihua Liu
Amrutha Saseendran
Lei Tong
Xilin He
Fariba Yousefi
...
Dino Oglic
Tom Diethe
Philip Teare
Huiyu Zhou
Chen Jin
VLM
559
3
0
23 May 2025
SynRES: Towards Referring Expression Segmentation in the Wild via Synthetic Data
Dong-Hee Kim
Hyunjee Song
Donghyun Kim
407
1
0
23 May 2025
Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels
Computer Vision and Pattern Recognition (CVPR), 2025
Yongshuo Zong
Qin Zhang
Dongsheng An
Zhihua Li
Xiang Xu
Linghan Xu
Zhuowen Tu
Yifan Xing
Onkar Dabeer
ObjD
233
2
0
20 May 2025
Adversarial Robustness Analysis of Vision-Language Models in Medical Image Segmentation
Anjila Budathoki
Manish Dhakal
AAML
251
1
0
05 May 2025
Segment Any RGB-Thermal Model with Language-aided Distillation
Dong Xing
Xianxun Zhu
Wei Zhou
Qika Lin
Hang Yang
Yuqing Wang
VLM
381
0
0
04 May 2025
RESAnything: Attribute Prompting for Arbitrary Referring Segmentation
Ruiqi Wang
Hao Zhang
VLM
225
2
0
03 May 2025
Diverse Semantics-Guided Feature Alignment and Decoupling for Visible-Infrared Person Re-Identification
Neng Dong
Shuanglin Yan
Liyan Zhang
Jinhui Tang
230
0
0
01 May 2025
LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation
Pattern Recognition (Pattern Recogn.), 2025
Jiachen Li
Qing Xie
Xiaohan Yu
Hongyun Wang
Jinyu Xu
Yongjian Liu
ObjD
400
0
0
20 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
International Conference on Learning Representations (ICLR), 2025
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
441
1
0
15 Apr 2025
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Tao Zhang
Xuelong Li
Zilong Huang
Yuchen Ren
Weixian Lei
XueQing Deng
Shihao Chen
Shilin Xu
Jiashi Feng
MLLM
LRM
275
17
0
14 Apr 2025
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model
Kaiyu Li
Zepeng Xin
Li Pang
Chao Pang
Yupeng Deng
Jing Yao
Guisong Xia
Deyu Meng
Zhi Wang
Xiangyong Cao
VLM
LRM
240
20
0
13 Apr 2025
DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration
Jiamei Xiong
Xuefeng Yan
Yongzhen Wang
Wei Zhao
Xiao-Ping Zhang
Mingqiang Wei
DiffM
196
2
0
07 Apr 2025
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities
Jing Liu
Wenxuan Wang
Yisi Zhang
Yepeng Tang
Xingjian He
Longteng Guo
Tongtian Yue
Xinlong Wang
ObjD
246
2
0
02 Apr 2025
POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Computer Vision and Pattern Recognition (CVPR), 2025
Lanyun Zhu
Tianrun Chen
Qianxiong Xu
Xuanyi Liu
Deyi Ji
Haiyang Wu
Na Zhao
Jing Liu
VLM
LRM
250
11
0
01 Apr 2025
BiPVL-Seg: Bidirectional Progressive Vision-Language Fusion with Global-Local Alignment for Medical Image Segmentation
Rafi Ibn Sultan
Hui Zhu
Chengyin Li
Dongxiao Zhu
208
0
0
30 Mar 2025
MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation
International Conference on Learning Representations (ICLR), 2025
Donggon Jang
Yucheol Cho
Suin Lee
Taehyeon Kim
Dae-Shik Kim
VLM
195
15
0
18 Mar 2025
Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics
Computer Vision and Pattern Recognition (CVPR), 2025
Chen Liu
Liying Yang
Peike Li
Dadong Wang
Lincheng Li
Xin Yu
VOS
245
2
0
17 Mar 2025
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
Computer Vision and Pattern Recognition (CVPR), 2025
Huanyi Zheng
Yuzhuo Tian
Hao Chen
Chunluan Zhou
Qingpei Guo
Yongxu Liu
M. Yang
Chunhua Shen
MLLM
VLM
221
8
0
11 Mar 2025
RS2-SAM2: Customized SAM2 for Referring Remote Sensing Image Segmentation
Fu Rong
Meng Lan
Qian Zhang
Guang Dai
397
1
0
10 Mar 2025
AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP
Computer Vision and Pattern Recognition (CVPR), 2025
Wenxin Ma
Xu Zhang
Qingsong Yao
Fenghe Tang
Chenxu Wu
Yongbin Li
Rui Yan
Zihang Jiang
S. Kevin Zhou
VLM
271
28
0
09 Mar 2025
Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2025
Seil Kang
Jinyeong Kim
Junhyeok Kim
Seong Jae Hwang
VLM
240
27
0
08 Mar 2025
Towards Universal Text-driven CT Image Segmentation
Yuheng Li
Yuxiang Lai
Maria Thor
Deborah Marshall
Zachary Buchwald
D. Yu
Xiaofeng Yang
MedIm
VLM
168
4
0
08 Mar 2025
GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain tumour Segmentation on mp-MRI
Cecilia Diana-Albelda
Roberto Alcover-Couso
Álvaro García-Martín
Jesús Bescós
Marcos Escudero-Viñolo
265
3
0
06 Mar 2025
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Suhwan Cho
Seunghoon Lee
Minhyeok Lee
Jungho Lee
Sangyoun Lee
VOS
396
3
0
05 Mar 2025
1
2
3
4
5
6
Next