Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2111.15174
Cited By
v1
v2 (latest)
CRIS: CLIP-Driven Referring Image Segmentation
30 November 2021
Zhaoqing Wang
Yu Lu
Qiang Li
Xunqiang Tao
Yan Guo
Ming Gong
Tongliang Liu
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CRIS: CLIP-Driven Referring Image Segmentation"
50 / 288 papers shown
CLIP-Driven Semantic Discovery Network for Visible-Infrared Person Re-Identification
IEEE transactions on multimedia (IEEE TMM), 2024
Xiaoyan Yu
Neng Dong
Liehuang Zhu
Hao Peng
Dapeng Tao
299
31
0
11 Jan 2024
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
International Journal of Computer Vision (IJCV), 2024
Xingxing Zuo
Pouya Samangouei
Yunwen Zhou
Yan Di
Mingyang Li
3DGS
355
82
0
03 Jan 2024
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Jiannan Wu
Yi Jiang
Bin Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
VOS
273
26
0
25 Dec 2023
FoodLMM: A Versatile Food Assistant using Large Multi-modal Model
Yuehao Yin
Huiyan Qi
B. Zhu
Yue Yu
Yu-Gang Jiang
Chong-Wah Ngo
264
39
0
22 Dec 2023
Weakly Supervised Semantic Segmentation for Driving Scenes
Dongseob Kim
Seungho Lee
Junsuk Choe
Hyunjung Shim
554
17
0
21 Dec 2023
Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation
Wenhao Xu
Rongtao Xu
Changwei Wang
Shibiao Xu
Li Guo
Man Zhang
Xiaopeng Zhang
VLM
247
18
0
20 Dec 2023
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Shraman Pramanick
Guangxing Han
Rui Hou
Sayan Nag
Ser-Nam Lim
Nicolas Ballas
Qifan Wang
Rama Chellappa
Amjad Almahairi
VLM
MLLM
386
50
0
19 Dec 2023
Mask Grounding for Referring Image Segmentation
Yong Xien Chng
Henry Zheng
Yizeng Han
Xuchong Qiu
Gao Huang
ISeg
ObjD
382
41
0
19 Dec 2023
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
Sihan Liu
Yiwei Ma
Xiaoqing Zhang
Haowei Wang
Jiayi Ji
Xiaoshuai Sun
Rongrong Ji
411
86
0
19 Dec 2023
GSVA: Generalized Segmentation via Multimodal Large Language Models
Computer Vision and Pattern Recognition (CVPR), 2023
Zhuofan Xia
Dongchen Han
Yizeng Han
Xuran Pan
Shiji Song
Gao Huang
VLM
596
125
0
15 Dec 2023
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment
M. Lavrenyuk
Shariq Farooq Bhat
Matthias Müller
Peter Wonka
ObjD
MDE
242
13
0
13 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Computer Vision and Pattern Recognition (CVPR), 2023
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLM
MLLM
311
33
0
13 Dec 2023
Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation
Computer Vision and Pattern Recognition (CVPR), 2023
Wenxuan Wang
Tongtian Yue
Yisi Zhang
Longteng Guo
Xingjian He
Xinlong Wang
Jing Liu
ObjD
266
24
0
13 Dec 2023
CLIP in Medical Imaging: A Comprehensive Survey
Medical Image Analysis (MIA), 2023
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Hongtu Zhu
CLIP
MedIm
LM&MA
VLM
577
43
0
12 Dec 2023
Foundation Models for Weather and Climate Data Understanding: A Comprehensive Survey
Shengchao Chen
Guodong Long
Jing Jiang
Dikai Liu
Chengqi Zhang
SyDa
AI4CE
308
37
0
05 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language Instruction
Computer Vision and Pattern Recognition (CVPR), 2023
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLM
VOS
301
30
0
04 Dec 2023
PixelLM: Pixel Reasoning with Large Multimodal Model
Computer Vision and Pattern Recognition (CVPR), 2023
Zhongwei Ren
Zhicheng Huang
Yunchao Wei
Yao-Min Zhao
Dongmei Fu
Jiashi Feng
Xiaojie Jin
VLM
MLLM
LRM
369
188
0
04 Dec 2023
Towards Generalizable Referring Image Segmentation via Target Prompt and Visual Coherence
International Conference on Information Photonics (ICIP), 2023
Yajie Liu
Pu Ge
Haoxiang Ma
Shichao Fan
Qingjie Liu
Di Huang
Yunhong Wang
198
1
0
01 Dec 2023
Synchronizing Vision and Language: Bidirectional Token-Masking AutoEncoder for Referring Image Segmentation
Minhyeok Lee
Dogyoon Lee
Jungho Lee
Suhwan Cho
Heeseung Choi
Ig-Jae Kim
Sangyoun Lee
175
0
0
29 Nov 2023
Explaining CLIP's performance disparities on data from blind/low vision users
Computer Vision and Pattern Recognition (CVPR), 2023
Daniela Massiceti
Camilla Longden
Agnieszka Slowik
Samuel Wills
Martin Grayson
C. Morrison
VLM
327
14
0
29 Nov 2023
RISAM: Referring Image Segmentation via Mutual-Aware Attention Features
Mengxi Zhang
Yiming Liu
Xiangjun Yin
Huanjing Yue
Jingyu Yang
442
1
0
27 Nov 2023
Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Yifei Chen
Dapeng Chen
Ruijin Liu
Sai Zhou
Wenyuan Xue
Wei Peng
279
15
0
27 Nov 2023
Spatially Covariant Image Registration with Text Prompts
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Xiang Chen
Min Liu
Rongguang Wang
Renjiu Hu
Dongdong Liu
Gaolei Li
Hang Zhang
MedIm
259
20
0
27 Nov 2023
End-to-End Breast Cancer Radiotherapy Planning via LMMs with Consistency Embedding
Kwanyoung Kim
Y. Oh
S. Park
H. Byun
Joongyo Lee
Jin Sung Kim
Yong Bae Kim
Jong Chul Ye
418
0
0
27 Nov 2023
Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
Junhao Chen
Peng Rong
Jingbo Sun
Chao Li
Xiang Li
Hongwu Lv
VLM
179
4
0
22 Nov 2023
VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search
IEEE Transactions on Image Processing (IEEE TIP), 2023
Shuting He
Hao Luo
Wei Jiang
Xudong Jiang
Henghui Ding
283
68
0
13 Nov 2023
Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in Clutter
Conference on Robot Learning (CoRL), 2023
Georgios Tziafas
Yucheng Xu
Arushi Goel
Mohammadreza Kasaei
Zhibin Li
Hamidreza Kasaei
240
40
0
09 Nov 2023
NExT-Chat: An LMM for Chat, Detection and Segmentation
Ao Zhang
Yuan Yao
Wei Ji
Zhiyuan Liu
Tat-Seng Chua
MLLM
VLM
355
73
0
08 Nov 2023
GLaMM: Pixel Grounding Large Multimodal Model
Computer Vision and Pattern Recognition (CVPR), 2023
H. Rasheed
Muhammad Maaz
Sahal Shaji Mullappilly
Abdelrahman M. Shaker
Salman Khan
Hisham Cholakkal
Rao M. Anwer
Erix Xing
Ming-Hsuan Yang
Fahad S. Khan
MLLM
VLM
434
396
0
06 Nov 2023
Towards a Unified Transformer-based Framework for Scene Graph Generation and Human-object Interaction Detection
IEEE Transactions on Image Processing (IEEE TIP), 2023
Tao He
Lianli Gao
Jingkuan Song
Yuan-Fang Li
ViT
250
14
0
03 Nov 2023
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Tianrui Hui
Zihan Ding
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Jiao Dai
Jizhong Han
Si Liu
300
7
0
02 Nov 2023
Towards Omni-supervised Referring Expression Segmentation
IEEE International Conference on Multimedia and Expo (ICME), 2023
Minglang Huang
Weihao Ye
Gen Luo
Guannan Jiang
Weilin Zhuang
Xiaoshuai Sun
315
1
0
01 Nov 2023
CHAIN: Exploring Global-Local Spatio-Temporal Information for Improved Self-Supervised Video Hashing
ACM Multimedia (ACM MM), 2023
Rukai Wei
Yu Liu
Jingkuan Song
Heng Cui
Yanzhao Xie
Ke Zhou
139
16
0
29 Oct 2023
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuchen Suo
Linchao Zhu
Yi Yang
289
20
0
27 Oct 2023
Open-NeRF: Towards Open Vocabulary NeRF Decomposition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Hao Zhang
Fang Li
Narendra Ahuja
177
17
0
25 Oct 2023
FD-Align: Feature Discrimination Alignment for Fine-tuning Pre-Trained Models in Few-Shot Learning
Neural Information Processing Systems (NeurIPS), 2023
Kun Song
Huimin Ma
Bochao Zou
Huishuai Zhang
Weiran Huang
308
16
0
23 Oct 2023
Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation
Francisco Eiras
Kemal Oksuz
Adel Bibi
Juil Sock
P. Dokania
304
2
0
20 Oct 2023
Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Zuxuan Wu
Zejia Weng
Wujian Peng
Xitong Yang
Ang Li
Larry S. Davis
Yu-Gang Jiang
CLIP
VLM
243
29
0
08 Oct 2023
MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP
Prajwal Ganugula
Y. Kumar
N. Reddy
Prabhath Chellingi
A. Thakur
Neeraj Kasera
C. S. Anand
CLIP
DiffM
165
3
0
24 Sep 2023
Synthetic Boost: Leveraging Synthetic Data for Enhanced Vision-Language Segmentation in Echocardiography
Rabin Adhikari
Manish Dhakal
Safal Thapaliya
K. Poudel
Prasiddha Bhandari
Bishesh Khanal
211
8
0
22 Sep 2023
CLIPUNetr: Assisting Human-robot Interface for Uncalibrated Visual Servoing Control with CLIP-driven Referring Expression Segmentation
IEEE International Conference on Robotics and Automation (ICRA), 2023
Chen Jiang
Yuchen Yang
Martin Jägersand
186
3
0
17 Sep 2023
Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation
Conference on Multimedia Modeling (MMM), 2023
Yixing Lu
Zhaoxin Fan
Min Xu
173
1
0
12 Sep 2023
Temporal Collection and Distribution for Referring Video Object Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
185
41
0
07 Sep 2023
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
IEEE International Conference on Computer Vision (ICCV), 2023
Jiajin Tang
Ge Zheng
Jingyi Yu
Sibei Yang
ObjD
215
39
0
03 Sep 2023
Contrastive Grouping with Transformer for Referring Image Segmentation
Computer Vision and Pattern Recognition (CVPR), 2023
Jiajin Tang
Ge Zheng
Cheng Shi
Sibei Yang
ViT
314
57
0
02 Sep 2023
Shatter and Gather: Learning Referring Image Segmentation with Text Supervision
IEEE International Conference on Computer Vision (ICCV), 2023
Dongwon Kim
Nam-Won Kim
Cuiling Lan
Suha Kwak
VLM
275
26
0
29 Aug 2023
Referring Image Segmentation Using Text Supervision
IEEE International Conference on Computer Vision (ICCV), 2023
Fang Liu
Yuhao Liu
Yuqiu Kong
Ke Xu
Lulu Zhang
Baocai Yin
Gerhard Hancke
Rynson W. H. Lau
250
46
0
28 Aug 2023
Beyond One-to-One: Rethinking the Referring Image Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Yutao Hu
Qixiong Wang
Wenqi Shao
Enze Xie
Zhenguo Li
Jungong Han
Ping Luo
3DV
228
69
0
26 Aug 2023
CgT-GAN: CLIP-guided Text GAN for Image Captioning
ACM Multimedia (ACM MM), 2023
Jiarui Yu
Haoran Li
Y. Hao
B. Zhu
Tong Xu
Xiangnan He
VLM
CLIP
219
24
0
23 Aug 2023
Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields
IEEE International Conference on Computer Vision (ICCV), 2023
H. Song
Seokhun Choi
Hoseok Do
Chul Lee
Taehyeong Kim
DiffM
246
28
0
23 Aug 2023
Previous
1
2
3
4
5
6
Next