Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2105.01839
Cited By
Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation
Computer Vision and Pattern Recognition (CVPR), 2021
5 May 2021
Guang Feng
Zhiwei Hu
Lihe Zhang
Huchuan Lu
EgoV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation"
50 / 106 papers shown
Layover or Direct Flight: Rethinking Audio-Guided Image Segmentation
Joel Alberto Santos
Zongwei Wu
Xavier Alameda-Pineda
Radu Timofte
128
0
0
27 Nov 2025
RefAM: Attention Magnets for Zero-Shot Referral Segmentation
Anna Kukleva
Enis Simsar
A. Tonioni
Muhammad Ferjad Naeem
F. Tombari
J. E. Lenssen
Bernt Schiele
DiffM
VLM
709
0
0
26 Sep 2025
Improving Generalized Visual Grounding with Instance-aware Joint Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Ming Dai
Wenxuan Cheng
Jiang-Jiang Liu
Lingfeng Yang
Zhenhua Feng
Wankou Yang
Jingdong Wang
ObjD
ISeg
355
6
0
17 Sep 2025
TFANet: Three-Stage Image-Text Feature Alignment Network for Robust Referring Image Segmentation
Qianqi Lu
Yuxiang Xie
Jing Zhang
Shiwei Zou
Yan Chen
Xidao Luan
226
0
0
16 Sep 2025
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
Jingchao Wang
Zhijian Wu
Dingjiang Huang
Yefeng Zheng
Hong Wang
213
3
0
06 Aug 2025
Referring Remote Sensing Image Segmentation with Cross-view Semantics Interaction Network
Jiaxing Yang
Lihe Zhang
Huchuan Lu
249
1
0
02 Aug 2025
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
521
19
0
01 Aug 2025
Multi-encoder nnU-Net outperforms transformer models with self-supervised pretraining
Seyedeh Sahar Taheri Otaghsara
Reza Rahmanzadeh
ViT
335
0
0
01 Jul 2025
ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections
Ziling Huang
Yidan Zhang
Shiníchi Satoh
ObjD
222
1
0
18 Jun 2025
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
Jingchao Wang
Hong Wang
Wenlong Zhang
Kunhua Ji
Dingjiang Huang
Yefeng Zheng
ObjD
440
3
0
22 Apr 2025
LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation
Pattern Recognition (Pattern Recogn.), 2025
Jiachen Li
Qing Xie
Xiaohan Yu
Hongyun Wang
Jinyu Xu
Yongjian Liu
ObjD
520
3
0
20 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
International Conference on Learning Representations (ICLR), 2025
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
578
2
0
15 Apr 2025
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Tao Zhang
Xuelong Li
Zilong Huang
Yuchen Ren
Weixian Lei
XueQing Deng
Shihao Chen
Shilin Xu
Jiashi Feng
MLLM
LRM
419
20
0
14 Apr 2025
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities
Jing Liu
Wenxuan Wang
Yisi Zhang
Yepeng Tang
Xingjian He
Longteng Guo
Tongtian Yue
Xinlong Wang
ObjD
346
2
0
02 Apr 2025
BiPVL-Seg: Bidirectional Progressive Vision-Language Fusion with Global-Local Alignment for Medical Image Segmentation
Rafi Ibn Sultan
Hui Zhu
Chengyin Li
Dongxiao Zhu
290
1
0
30 Mar 2025
Referring Human Pose and Mask Estimation in the Wild
Neural Information Processing Systems (NeurIPS), 2024
Bo Miao
Mingtao Feng
Zijie Wu
Mohammed Bennamoun
Yongsheng Gao
Lin Wang
298
9
0
27 Oct 2024
LESS: Label-Efficient and Single-Stage Referring 3D Segmentation
Neural Information Processing Systems (NeurIPS), 2024
Xuexun Liu
Xiaoxu Xu
Jinlong Li
Qiudan Zhang
Xu Wang
Andrii Zadaianchuk
Lin Ma
479
4
0
17 Oct 2024
Segment as You Wish -- Free-Form Language-Based Segmentation for Medical Images
Longchao Da
Rui Wang
Xiaojian Xu
Parminder Bhatia
Taha A. Kass-Hout
Hua Wei
Cao Xiao
MedIm
VLM
386
2
0
02 Oct 2024
Fully Aligned Network for Referring Image Segmentation
Visual Communications and Image Processing (VCIP), 2024
Yong-Jin Liu
Ruihao Xu
Yansong Tang
329
0
0
29 Sep 2024
HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models
V. Bhat
Prashanth Krishnamurthy
Ramesh Karri
Farshad Khorrami
553
10
0
16 Sep 2024
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
ACM Multimedia (MM), 2024
Hongyu Li
Tianrui Hui
Zihan Ding
Jing Zhang
Bin Ma
Xiaoming Wei
Jizhong Han
Si Liu
DiffM
274
5
0
12 Sep 2024
Language-guided Scale-aware MedSegmentor for Lesion Segmentation in Medical Imaging
Shuyi Ouyang
Jinyang Zhang
Xiangye Lin
Xilai Wang
Qingqing Chen
Yen-Wei Chen
Lanfen Lin
VLM
451
0
0
30 Aug 2024
Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation
IEEE transactions on multimedia (IEEE TMM), 2024
Yubin Cho
Hyunwoo Yu
Suk-Ju Kang
332
43
0
14 Aug 2024
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
European Conference on Computer Vision (ECCV), 2024
Wei Chen
Mahdieh Hatamian
Yu Wu
276
25
0
02 Aug 2024
Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
Seonghoon Yu
Paul Hongsuck Seo
Jeany Son
DiffM
479
11
0
10 Jul 2024
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
Sayan Nag
Koustava Goswami
Srikrishna Karanam
337
6
0
02 Jul 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
Tao Zhang
Xiangtai Li
Hao Fei
Haobo Yuan
Shengqiong Wu
Shunping Ji
Chen Change Loy
Shuicheng Yan
LRM
MLLM
VLM
412
149
0
27 Jun 2024
SRC-Net: Bi-Temporal Spatial Relationship Concerned Network for Change Detection
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE JSTARS), 2024
Hongjia Chen
Xin Xu
Fangling Pu
395
19
0
09 Jun 2024
HDC: Hierarchical Semantic Decoding with Counting Assistance for Generalized Referring Expression Segmentation
Zhuoyan Luo
Yinghao Wu
Yong-Jin Liu
Yicheng Xiao
Jinqiang Cui
Yujiu Yang
409
0
0
24 May 2024
Fuse & Calibrate: A bi-directional Vision-Language Guided Framework for Referring Image Segmentation
Yichen Yan
Xingjian He
Sihan Chen
Shichen Lu
Jing Liu
307
1
0
18 May 2024
Spatial Semantic Recurrent Mining for Referring Image Segmentation
Jiaxing Yang
Lihe Zhang
Jiayu Sun
Huchuan Lu
341
1
0
15 May 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
241
26
0
18 Apr 2024
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Shuting He
Henghui Ding
VOS
324
74
0
04 Apr 2024
Deep Instruction Tuning for Segment Anything Model
Xiaorui Huang
Gen Luo
Chaoyang Zhu
Bo Tong
Weihao Ye
Xiaoshuai Sun
Rongrong Ji
VLM
380
4
0
31 Mar 2024
ReMamber: Referring Image Segmentation with Mamba Twister
Yu-Hao Yang
Chaofan Ma
Jiangchao Yao
Zhun Zhong
Ya Zhang
Yanfeng Wang
Mamba
374
59
0
26 Mar 2024
Empowering Segmentation Ability to Multi-modal Large Language Models
Yuqi Yang
Peng-Tao Jiang
Jing Wang
Hao Zhang
Kai Zhao
Jinwei Chen
Yue Liu
LRM
VLM
322
9
0
21 Mar 2024
Rethinking Referring Object Removal
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
245
0
0
14 Mar 2024
RESMatch: Referring Expression Segmentation in a Semi-Supervised Manner
Ying Zang
Chenglong Fu
Runlong Cao
Didi Zhu
Min Zhang
Wenjun Hu
Lanyun Zhu
Tianrun Chen
336
13
0
08 Feb 2024
Collaborative Position Reasoning Network for Referring Image Segmentation
Jianjian Cao
Beiya Dai
Yulin Li
Xiameng Qin
Jingdong Wang
361
1
0
22 Jan 2024
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Jiannan Wu
Yi Jiang
Bin Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
VOS
327
28
0
25 Dec 2023
SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation
Wenxi Yue
Jing Zhang
Kun Hu
Qiuxia Wu
Zongyuan Ge
Yong Xia
Jiebo Luo
Zhiyong Wang
260
7
0
22 Dec 2023
Mask Grounding for Referring Image Segmentation
Yong Xien Chng
Henry Zheng
Yizeng Han
Xuchong Qiu
Gao Huang
ISeg
ObjD
471
54
0
19 Dec 2023
Context Disentangling and Prototype Inheriting for Robust Visual Grounding
Wei Tang
Liang Li
Xuejing Liu
Lu Jin
Jinhui Tang
Zechao Li
307
45
0
19 Dec 2023
GSVA: Generalized Segmentation via Multimodal Large Language Models
Computer Vision and Pattern Recognition (CVPR), 2023
Zhuofan Xia
Dongchen Han
Yizeng Han
Xuran Pan
Shiji Song
Gao Huang
VLM
695
155
0
15 Dec 2023
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment
M. Lavrenyuk
Shariq Farooq Bhat
Matthias Müller
Peter Wonka
ObjD
MDE
288
16
0
13 Dec 2023
Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation
Computer Vision and Pattern Recognition (CVPR), 2023
Wenxuan Wang
Tongtian Yue
Yisi Zhang
Longteng Guo
Xingjian He
Xinlong Wang
Jing Liu
ObjD
357
27
0
13 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language Instruction
Computer Vision and Pattern Recognition (CVPR), 2023
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLM
VOS
378
35
0
04 Dec 2023
Towards Generalizable Referring Image Segmentation via Target Prompt and Visual Coherence
International Conference on Information Photonics (ICIP), 2023
Yajie Liu
Pu Ge
Haoxiang Ma
Shichao Fan
Qingjie Liu
Di Huang
Yunhong Wang
244
2
0
01 Dec 2023
Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in Clutter
Conference on Robot Learning (CoRL), 2023
Georgios Tziafas
Yucheng Xu
Arushi Goel
Mohammadreza Kasaei
Zhibin Li
Hamidreza Kasaei
305
45
0
09 Nov 2023
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Tianrui Hui
Zihan Ding
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Jiao Dai
Jizhong Han
Si Liu
353
8
0
02 Nov 2023
1
2
3
Next
Page 1 of 3