ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.05565
  4. Cited By
Vision-Language Transformer and Query Generation for Referring
  Segmentation

Vision-Language Transformer and Query Generation for Referring Segmentation

12 August 2021
Henghui Ding
Chang-rui Liu
Suchen Wang
Xudong Jiang
ArXivPDFHTML

Papers citing "Vision-Language Transformer and Query Generation for Referring Segmentation"

50 / 173 papers shown
Title
Shatter and Gather: Learning Referring Image Segmentation with Text
  Supervision
Shatter and Gather: Learning Referring Image Segmentation with Text Supervision
Dongwon Kim
Nam-Won Kim
Cuiling Lan
Suha Kwak
VLM
18
19
0
29 Aug 2023
Referring Image Segmentation Using Text Supervision
Referring Image Segmentation Using Text Supervision
Fang Liu
Yuhao Liu
Yuqiu Kong
Ke Xu
L. Zhang
Baocai Yin
Gerhard Hancke
Rynson W. H. Lau
27
25
0
28 Aug 2023
Beyond One-to-One: Rethinking the Referring Image Segmentation
Beyond One-to-One: Rethinking the Referring Image Segmentation
Yutao Hu
Qixiong Wang
Wenqi Shao
Enze Xie
Zhenguo Li
Jungong Han
Ping Luo
3DV
6
37
0
26 Aug 2023
EAVL: Explicitly Align Vision and Language for Referring Image
  Segmentation
EAVL: Explicitly Align Vision and Language for Referring Image Segmentation
Yimin Yan
Xingjian He
Wenxuan Wang
Sihan Chen
J. Liu
ObjD
VLM
16
2
0
18 Aug 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion
  Expressions
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
28
101
0
16 Aug 2023
LISA: Reasoning Segmentation via Large Language Model
LISA: Reasoning Segmentation via Large Language Model
Xin Lai
Zhuotao Tian
Yukang Chen
Yanwei Li
Yuhui Yuan
Shu Liu
Jiaya Jia
LM&Ro
VLM
MLLM
LRM
27
385
0
01 Aug 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring
  Video Object Segmentation
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
Yonglin Li
Jing Zhang
Xiao Teng
Long Lan
VOS
VLM
16
16
0
03 Jul 2023
Hierarchical Open-vocabulary Universal Image Segmentation
Hierarchical Open-vocabulary Universal Image Segmentation
Xudong Wang
Shufang Li
Konstantinos Kallidromitis
Yu Kato
Kazuki Kozuka
Trevor Darrell
VLM
OCL
30
36
0
03 Jul 2023
Towards Open Vocabulary Learning: A Survey
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Bernard Ghanem
Dacheng Tao
ObjD
VLM
25
134
0
28 Jun 2023
Mutual Query Network for Multi-Modal Product Image Segmentation
Mutual Query Network for Multi-Modal Product Image Segmentation
Y. Guo
Wei Feng
Zheng Zhang
Xiancong Ren
Yaoyu Li
Jing Lv
Xinshuai Zhu
Zhangang Lin
Jingping Shao
8
0
0
26 Jun 2023
Online Unsupervised Video Object Segmentation via Contrastive Motion
  Clustering
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering
Lin Xi
Weihai Chen
Xingming Wu
Zhong Liu
Zhengguo Li
VOS
13
9
0
21 Jun 2023
Primitive Generation and Semantic-related Alignment for Universal
  Zero-Shot Segmentation
Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation
Shuting He
Henghui Ding
Wei Jiang
VLM
70
34
0
19 Jun 2023
WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image
  Segmentation
WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation
Ze-Long Cheng
Peng Jin
Hao Li
Kehan Li
Siheng Li
Xiang Ji
Chang-rui Liu
Jie Chen
19
5
0
19 Jun 2023
Text Promptable Surgical Instrument Segmentation with Vision-Language
  Models
Text Promptable Surgical Instrument Segmentation with Vision-Language Models
Zijian Zhou
Oluwatosin O. Alabi
Meng Wei
Tom Kamiel Magda Vercauteren
Miaojing Shi
MedIm
10
22
0
15 Jun 2023
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Seoyeon Kim
Minguk Kang
Dongwon Kim
Jaesik Park
Suha Kwak
VLM
10
10
0
14 Jun 2023
UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot
  Vision-Language Tasks
UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Yanan Sun
Zi-Qi Zhong
Qi Fan
Chi-Keung Tang
Yu-Wing Tai
VLM
8
4
0
07 Jun 2023
Language Adaptive Weight Generation for Multi-task Visual Grounding
Language Adaptive Weight Generation for Multi-task Visual Grounding
Wei Su
Peihan Miao
Huanzhang Dou
Gaoang Wang
Liang Qiao
Zheyang Li
Xi Li
ObjD
22
32
0
06 Jun 2023
LRVS-Fashion: Extending Visual Search with Referring Instructions
LRVS-Fashion: Extending Visual Search with Referring Instructions
Simon Lepage
Jérémie Mary
David Picard
18
1
0
05 Jun 2023
GRES: Generalized Referring Expression Segmentation
GRES: Generalized Referring Expression Segmentation
Chang Liu
Henghui Ding
Xudong Jiang
25
139
0
01 Jun 2023
RaSP: Relation-aware Semantic Prior for Weakly Supervised Incremental
  Segmentation
RaSP: Relation-aware Semantic Prior for Weakly Supervised Incremental Segmentation
Subhankar Roy
Riccardo Volpi
G. Csurka
Diane Larlus
CLL
22
4
0
31 May 2023
Contextual Object Detection with Multimodal Large Language Models
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
14
77
0
29 May 2023
SOC: Semantic-Assisted Object Cluster for Referring Video Object
  Segmentation
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
Zhuoyan Luo
Yicheng Xiao
Yong-Jin Liu
Shuyan Li
Yitong Wang
Yansong Tang
Xiu Li
Yujiu Yang
VOS
14
32
0
26 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
11
29
0
25 May 2023
Multi-Modal Mutual Attention and Iterative Interaction for Referring
  Image Segmentation
Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation
Chang Liu
Henghui Ding
Yulun Zhang
Xudong Jiang
19
47
0
24 May 2023
MMNet: Multi-Mask Network for Referring Image Segmentation
MMNet: Multi-Mask Network for Referring Image Segmentation
Yimin Yan
Xingjian He
Wenxuan Wan
J. Liu
EgoV
14
1
0
24 May 2023
Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud
  Semantic Segmentation
Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation
Shuting He
Xudong Jiang
Wei Jiang
Henghui Ding
3DPC
19
32
0
23 May 2023
Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot
  Instance Segmentation
Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
Shuting He
Henghui Ding
Wei Jiang
ISeg
24
21
0
22 May 2023
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image
  Segmentation
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
Wenxuan Wang
Jing Liu
Xingjian He
Yisi Zhang
Cheng Chen
Jiachen Shen
Yan Zhang
Jiangyun Li
14
11
0
19 May 2023
Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
32
132
0
19 Apr 2023
Meta Compositional Referring Expression Segmentation
Meta Compositional Referring Expression Segmentation
Li Xu
Mark He Huang
Xindi Shang
Zehuan Yuan
Ying Sun
Jun Liu
28
22
0
10 Apr 2023
Probabilistic Prompt Learning for Dense Prediction
Probabilistic Prompt Learning for Dense Prediction
Hyeongjun Kwon
Taeyong Song
Somi Jeong
Jin-Hwa Kim
Jinhyun Jang
K. Sohn
VLM
14
8
0
03 Apr 2023
Zero-shot Referring Image Segmentation with Global-Local Context
  Features
Zero-shot Referring Image Segmentation with Global-Local Context Features
S. Yu
Paul Hongsuck Seo
Jeany Son
6
49
0
31 Mar 2023
Parallel Vertex Diffusion for Unified Visual Grounding
Parallel Vertex Diffusion for Unified Visual Grounding
Ze-Long Cheng
Kehan Li
Peng Jin
Xiang Ji
Li-ming Yuan
Chang-rui Liu
Jie Chen
DiffM
24
25
0
13 Mar 2023
SelfPromer: Self-Prompt Dehazing Transformers with Depth-Consistency
SelfPromer: Self-Prompt Dehazing Transformers with Depth-Consistency
Cong Wang
Jin-shan Pan
Wanyu Lin
Jiangxin Dong
Xiaomei Wu
VLM
MDE
26
39
0
13 Mar 2023
Universal Instance Perception as Object Discovery and Retrieval
Universal Instance Perception as Object Discovery and Retrieval
B. Yan
Yi-Xin Jiang
Jiannan Wu
D. Wang
Ping Luo
Zehuan Yuan
Huchuan Lu
VOS
VLM
LRM
24
161
0
12 Mar 2023
Semantics-Aware Dynamic Localization and Refinement for Referring Image
  Segmentation
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation
Zhao Yang
Jiaqi Wang
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Philip H. S. Torr
31
23
0
11 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
156
213
0
03 Mar 2023
PolyFormer: Referring Image Segmentation as Sequential Polygon
  Generation
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Jiang Liu
Hui Ding
Zhaowei Cai
Yuting Zhang
R. Satzoda
Vijay Mahadevan
R. Manmatha
ObjD
15
120
0
14 Feb 2023
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Philip H. S. Torr
S. Bai
VOS
25
131
0
03 Feb 2023
Linguistic Query-Guided Mask Generation for Referring Image Segmentation
Linguistic Query-Guided Mask Generation for Referring Image Segmentation
Zhichao Wei
Xiaohao Chen
Mingqiang Chen
Siyu Zhu
VLM
12
1
0
16 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open
  Vocabulary Instance Segmentation
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
77
31
0
02 Jan 2023
Position-Aware Contrastive Alignment for Referring Image Segmentation
Position-Aware Contrastive Alignment for Referring Image Segmentation
Bo Chen
Zhiwei Hu
Zhilong Ji
Jinfeng Bai
W. Zuo
12
8
0
27 Dec 2022
Fully and Weakly Supervised Referring Expression Segmentation with
  End-to-End Learning
Fully and Weakly Supervised Referring Expression Segmentation with End-to-End Learning
Hui Li
Mingjie Sun
Jimin Xiao
Eng Gee Lim
Yao-Min Zhao
13
19
0
17 Dec 2022
CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for
  Referring Image Segmentation
CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation
Zicheng Zhang
Yi Zhu
Jian-zhuo Liu
Xiaodan Liang
Wei Ke
17
29
0
04 Dec 2022
Feature Aggregation and Propagation Network for Camouflaged Object
  Detection
Feature Aggregation and Propagation Network for Camouflaged Object Detection
Tao Zhou
Yi Zhou
Chen Gong
Jian Yang
Yu Zhang
25
130
0
02 Dec 2022
A Unified Mutual Supervision Framework for Referring Expression
  Segmentation and Generation
A Unified Mutual Supervision Framework for Referring Expression Segmentation and Generation
Shijia Huang
Feng Li
Hao Zhang
Siyi Liu
Lei Zhang
Liwei Wang
19
5
0
15 Nov 2022
YORO -- Lightweight End to End Visual Grounding
YORO -- Lightweight End to End Visual Grounding
Chih-Hui Ho
Srikar Appalaraju
Bhavan A. Jasani
R. Manmatha
Nuno Vasconcelos
ObjD
21
21
0
15 Nov 2022
Self-Regularized Prototypical Network for Few-Shot Semantic Segmentation
Self-Regularized Prototypical Network for Few-Shot Semantic Segmentation
Henghui Ding
Hui Zhang
Xudong Jiang
54
59
0
30 Oct 2022
VLT: Vision-Language Transformer and Query Generation for Referring
  Segmentation
VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
Henghui Ding
Chang Liu
Suchen Wang
Xudong Jiang
63
115
0
28 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without
  Fine-tuning
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
12
25
0
03 Oct 2022
Previous
1234
Next