ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.14757
  4. Cited By
A Simple Baseline for Open-Vocabulary Semantic Segmentation with
  Pre-trained Vision-language Model

A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model

29 December 2021
Mengde Xu
Zheng-Wei Zhang
Fangyun Wei
Yutong Lin
Yue Cao
Han Hu
Xiang Bai
    VLM
ArXivPDFHTML

Papers citing "A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model"

38 / 38 papers shown
Title
Split Matching for Inductive Zero-shot Semantic Segmentation
Split Matching for Inductive Zero-shot Semantic Segmentation
Jialei Chen
Xu Zheng
Dongyue Li
Chong Yi
Seigo Ito
D. Paudel
Luc Van Gool
Hiroshi Murase
Daisuke Deguchi
VLM
54
0
0
08 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Y. Chen
Zhuotao Tian
VLM
38
0
0
07 May 2025
Handling Imbalanced Pseudolabels for Vision-Language Models with Concept Alignment and Confusion-Aware Calibrated Margin
Handling Imbalanced Pseudolabels for Vision-Language Models with Concept Alignment and Confusion-Aware Calibrated Margin
Yuchen Wang
X. Bai
X. Li
Weili Guan
Liqiang Nie
Xinyang Chen
VLM
44
0
0
04 May 2025
ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model
Kunyang Han
Yibo Hu
Mengxue Qu
Hailin Shi
Yao Zhao
Y. X. Wei
MLLM
VLM
3DV
85
1
0
29 Nov 2024
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
Sule Bai
Yong-Jin Liu
Yifei Han
Haoji Zhang
Yansong Tang
VLM
79
3
0
24 Nov 2024
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
M. Arda Aydın
Efe Mert Çırpar
Elvin Abdinli
Gözde B. Ünal
Y. Sahin
VLM
71
0
0
18 Nov 2024
Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery
Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery
Chuyu Zhang
Peiyan Gu
Xueyang Yu
Xuming He
28
0
0
17 Oct 2024
Locality Alignment Improves Vision-Language Models
Locality Alignment Improves Vision-Language Models
Ian Covert
Tony Sun
James Y. Zou
Tatsunori Hashimoto
VLM
67
3
0
14 Oct 2024
OrionNav: Online Planning for Robot Autonomy with Context-Aware LLM and
  Open-Vocabulary Semantic Scene Graphs
OrionNav: Online Planning for Robot Autonomy with Context-Aware LLM and Open-Vocabulary Semantic Scene Graphs
Venkata Naren Devarakonda
Raktim Gautam Goswami
Ali Umut Kaypak
Naman Patel
Rooholla Khorrambakht
P. Krishnamurthy
Farshad Khorrami
LM&Ro
37
3
0
08 Oct 2024
Rethinking Prompting Strategies for Multi-Label Recognition with Partial
  Annotations
Rethinking Prompting Strategies for Multi-Label Recognition with Partial Annotations
Samyak Rawlekar
Shubhang Bhatnagar
Narendra Ahuja
VLM
31
1
0
12 Sep 2024
DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge
  Transfer
DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Runjia Li
Junlin Han
Luke Melas-Kyriazi
Chunyi Sun
Zhaochong An
Zhongrui Gui
Shuyang Sun
Philip Torr
Tomas Jakab
36
1
0
12 Sep 2024
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary
  Segmentation
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Xi Chen
Haosen Yang
Sheng Jin
Xiatian Zhu
H. Yao
VLM
29
3
0
05 Sep 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
46
3
0
31 Jul 2024
3D Weakly Supervised Semantic Segmentation with 2D Vision-Language
  Guidance
3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance
Xiaoxu Xu
Yitian Yuan
Jinlong Li
Qiudan Zhang
Zequn Jie
Lin Ma
Hao Tang
N. Sebe
Xu Wang
38
2
0
13 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic
  Segmentation
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
36
15
0
11 Jul 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
42
0
0
17 Jun 2024
Improving Multi-label Recognition using Class Co-Occurrence
  Probabilities
Improving Multi-label Recognition using Class Co-Occurrence Probabilities
Samyak Rawlekar
Shubhang Bhatnagar
Vishnuvardhan Pogunulu Srinivasulu
Narendra Ahuja
VLM
37
5
0
24 Apr 2024
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic
  Segmentation
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
Sina Hajimiri
Ismail Ben Ayed
Jose Dolz
VLM
41
22
0
12 Apr 2024
Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large
  Multi-Modal Models
Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models
David Kurzendörfer
Otniel-Bogdan Mercea
A. Sophia Koepke
Zeynep Akata
VLM
CLIP
28
2
0
09 Apr 2024
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic
  Segmentation
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Jiannan Ge
Lingxi Xie
Hongtao Xie
Pandeng Li
Xiaopeng Zhang
Yongdong Zhang
Qi Tian
VLM
23
3
0
08 Apr 2024
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Elham Amin Mansour
Ozan Unal
Suman Saha
Benjamin Bejar
Luc Van Gool
42
1
0
04 Apr 2024
Training-Free Semantic Segmentation via LLM-Supervision
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
44
2
0
31 Mar 2024
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion
  Models
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models
Barbara Toniella Corradini
Mustafa Shukor
Paul Couairon
Guillaume Couairon
Franco Scarselli
Matthieu Cord
DiffM
VLM
45
4
0
29 Mar 2024
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin
Chien-Yi Wang
Yu-Chiang Frank Wang
Min-Hung Chen
VLM
23
0
0
22 Jan 2024
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and
  Self-Prompting
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting
Yankai Jiang
Zhongzhen Huang
Rongzhao Zhang
Xiaofan Zhang
Shaoting Zhang
VLM
34
10
0
07 Dec 2023
Auto-Vocabulary Semantic Segmentation
Auto-Vocabulary Semantic Segmentation
Osman Ülger
Maksymilian Kulicki
Yuki M. Asano
Martin R. Oswald
VLM
45
2
0
07 Dec 2023
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf
  Vision-Language Models
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
Jiayun Luo
Siddhesh Khandelwal
Leonid Sigal
Boyang Albert Li
MLLM
VLM
29
7
0
28 Nov 2023
SILC: Improving Vision Language Pretraining with Self-Distillation
SILC: Improving Vision Language Pretraining with Self-Distillation
Muhammad Ferjad Naeem
Yongqin Xian
Xiaohua Zhai
Lukas Hoyer
Luc Van Gool
F. Tombari
VLM
25
33
0
20 Oct 2023
Open-Vocabulary Affordance Detection using Knowledge Distillation and
  Text-Point Correlation
Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation
Tuan V. Vo
Minh Nhat Vu
Baoru Huang
Toan Tien Nguyen
Ngan Le
T. Vo
Anh Nguyen
3DPC
19
10
0
19 Sep 2023
Hierarchical Open-vocabulary Universal Image Segmentation
Hierarchical Open-vocabulary Universal Image Segmentation
Xudong Wang
Shufang Li
Konstantinos Kallidromitis
Yu Kato
Kazuki Kozuka
Trevor Darrell
VLM
OCL
40
36
0
03 Jul 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
33
7
0
14 Jun 2023
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Dingkang Liang
Jiahao Xie
Zhikang Zou
Xiaoqing Ye
Wei Xu
Xiang Bai
SSL
CLIP
VLM
26
52
0
09 Apr 2023
Open-Vocabulary Affordance Detection in 3D Point Clouds
Open-Vocabulary Affordance Detection in 3D Point Clouds
Toan Ngyen
Minh Nhat Vu
Annalies Vuong
Dzung Nguyen
T. Vo
Ngan Le
A. Nguyen
3DPC
14
32
0
04 Mar 2023
SimCon Loss with Multiple Views for Text Supervised Semantic
  Segmentation
SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Yash J. Patel
Yusheng Xie
Yi Zhu
Srikar Appalaraju
R. Manmatha
29
4
0
07 Feb 2023
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth
  Pre-training
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training
Tianyu Huang
Bowen Dong
Yunhan Yang
Xiaoshui Huang
Rynson W. H. Lau
Wanli Ouyang
W. Zuo
VLM
3DPC
CLIP
42
144
0
03 Oct 2022
Learning to Prompt for Vision-Language Models
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
327
2,267
0
02 Sep 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
298
3,700
0
11 Feb 2021
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image
  Segmentation
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Vijay Badrinarayanan
Alex Kendall
R. Cipolla
SSeg
446
15,637
0
02 Nov 2015
1