ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.04150
  4. Cited By
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP

Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP

9 October 2022
Feng Liang
Bichen Wu
Xiaoliang Dai
Kunpeng Li
Yinan Zhao
Hang Zhang
Peizhao Zhang
Peter Vajda
Diana Marculescu
    CLIP
    VLM
ArXivPDFHTML

Papers citing "Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP"

50 / 331 papers shown
Title
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume
  Rendering
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering
Simon Boeder
Fabian Gigengack
Benjamin Risse
31
7
0
24 Jul 2024
OpenSU3D: Open World 3D Scene Understanding using Foundation Models
OpenSU3D: Open World 3D Scene Understanding using Foundation Models
Rafay Mohiuddin
Sai Manoj Prakhya
Fiona Collins
Ziyuan Liu
André Borrmann
31
2
0
19 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
34
4
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided
  Self-Distillation
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
23
2
0
18 Jul 2024
OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary
  Robotic Grasping
OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping
Meng Li
Qi Zhao
Shuchang Lyu
Chunlei Wang
Yujing Ma
Guangliang Cheng
Chenguang Yang
22
4
0
18 Jul 2024
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language
  Inference
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
Mengcheng Lan
Chaofeng Chen
Yiping Ke
Xinjiang Wang
Litong Feng
Wayne Zhang
VLM
26
23
0
17 Jul 2024
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal
  Models
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
Zijian Zhou
Zheng Zhu
Holger Caesar
Miaojing Shi
VLM
24
2
0
15 Jul 2024
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer
  from Text to Image via CLIP Inversion
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion
Philipp Allgeuer
Kyra Ahrens
Stefan Wermter
CLIP
VLM
27
0
0
15 Jul 2024
3D Weakly Supervised Semantic Segmentation with 2D Vision-Language
  Guidance
3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance
Xiaoxu Xu
Yitian Yuan
Jinlong Li
Qiudan Zhang
Zequn Jie
Lin Ma
Hao Tang
N. Sebe
Xu Wang
38
2
0
13 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DV
VLM
29
5
0
13 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized
  Segmentation
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
32
0
0
12 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic
  Segmentation
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
26
14
0
11 Jul 2024
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian
Shuangrui Ding
Dahua Lin
OCL
44
1
0
09 Jul 2024
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot
  Performance
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Jiedong Zhuang
Jiaqi Hu
Lianrui Mu
Rui Hu
Xiaoyu Liang
Jiangnan Ye
Haoji Hu
CLIP
VLM
26
2
0
08 Jul 2024
Open Panoramic Segmentation
Open Panoramic Segmentation
Junwei Zheng
Ruiping Liu
Yufan Chen
Kunyu Peng
Chengzhi Wu
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
VLM
26
7
0
02 Jul 2024
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Zihan Gao
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Yuwei Guo
Shuyuan Yang
23
0
0
01 Jul 2024
InterCLIP-MEP: Interactive CLIP and Memory-Enhanced Predictor for
  Multi-modal Sarcasm Detection
InterCLIP-MEP: Interactive CLIP and Memory-Enhanced Predictor for Multi-modal Sarcasm Detection
Junjie Chen
Hang Yu
Weidong Liu
Subin Huang
Sanmin Liu
21
2
0
24 Jun 2024
High-resolution open-vocabulary object 6D pose estimation
High-resolution open-vocabulary object 6D pose estimation
Jaime Corsetti
Davide Boscaini
Francesco Giuliari
Changjae Oh
Andrea Cavallaro
Fabio Poiesi
28
1
0
24 Jun 2024
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
Thomas Stegmüller
Tim Lebailly
Nikola Dukic
Behzad Bozorgtabar
Tinne Tuytelaars
Jean-Philippe Thiran
VLM
25
1
0
23 Jun 2024
StableSemantics: A Synthetic Language-Vision Dataset of Semantic
  Representations in Naturalistic Images
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images
Rushikesh Zawar
Shaurya Dewan
Andrew F. Luo
Margaret M. Henderson
Michael J. Tarr
Leila Wehbe
VGen
CoGe
28
1
0
19 Jun 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
33
0
0
17 Jun 2024
SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and
  Lexical Alterations
SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations
Sri Harsha Dumpala
Aman Jaiswal
Chandramouli Shama Sastry
E. Milios
Sageev Oore
Hassan Sajjad
CoGe
30
8
0
17 Jun 2024
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
Xiangheng Shan
Dongyue Wu
Guilin Zhu
Yuanjie Shao
Nong Sang
Changxin Gao
VLM
21
15
0
14 Jun 2024
Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on
  Curb Segmentation
Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation
Diwei Sheng
Giles Hamilton-Fletcher
Mahya Beheshti
Chen Feng
John-Ross Rizzo
16
2
0
11 Jun 2024
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor
  Segmentation
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation
Zhongzhen Huang
Yankai Jiang
Rongzhao Zhang
Shaoting Zhang
Xiaofan Zhang
MedIm
59
4
0
11 Jun 2024
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
S. Linok
T. Zemskova
Svetlana Ladanova
Roman Titkov
Dmitry A. Yudin
Maxim Monastyrny
Aleksei Valenkov
LM&Ro
43
0
0
11 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
64
12
0
09 Jun 2024
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation
Xiaoqi Wang
Wenbin He
Xiwei Xuan
Clint Sebastian
Jorge Henrique Piazentin Ono
...
Sima Behpour
T. Doan
Liang Gou
Han-Wei Shen
Liu Ren
VLM
19
5
0
07 Jun 2024
SMART: Scene-motion-aware human action recognition framework for mental
  disorder group
SMART: Scene-motion-aware human action recognition framework for mental disorder group
Zengyuan Lai
Jiarui Yang
Songpengcheng Xia
Qi Wu
Zhen Sun
Wenxian Yu
Ling Pei
35
2
0
07 Jun 2024
FastLGS: Speeding up Language Embedded Gaussians with Feature Grid
  Mapping
FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping
Yuzhou Ji
He Zhu
Junshu Tang
Wuyi Liu
Zhizhong Zhang
Yuan Xie
Xin Tan
23
8
0
04 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
F. Khan
VLM
ISeg
61
6
0
04 Jun 2024
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view
  Understanding
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Thanh-Dat Truong
Utsav Prabhu
Dongyi Wang
Bhiksha Raj
Susan Gauch
J. Subbiah
Khoa Luu
38
2
0
03 Jun 2024
Proxy Denoising for Source-Free Domain Adaptation
Proxy Denoising for Source-Free Domain Adaptation
Song Tang
Wenxin Su
Mao Ye
Jianwei Zhang
Xiatian Zhu
Xiatian Zhu
53
1
0
03 Jun 2024
Sparsity- and Hybridity-Inspired Visual Parameter-Efficient Fine-Tuning
  for Medical Diagnosis
Sparsity- and Hybridity-Inspired Visual Parameter-Efficient Fine-Tuning for Medical Diagnosis
Mingyuan Liu
Lu Xu
Shengnan Liu
Jicong Zhang
28
1
0
28 May 2024
Composed Image Retrieval for Remote Sensing
Composed Image Retrieval for Remote Sensing
Bill Psomas
Ioannis Kakogeorgiou
Nikos Efthymiadis
Giorgos Tolias
Ondřej Chum
Yannis Avrithis
Konstantinos Karantzalos
37
4
0
24 May 2024
Tuning-free Universally-Supervised Semantic Segmentation
Tuning-free Universally-Supervised Semantic Segmentation
Xiaobo Yang
Xiaojin Gong
VLM
37
1
0
23 May 2024
Unifying 3D Vision-Language Understanding via Promptable Queries
Unifying 3D Vision-Language Understanding via Promptable Queries
Ziyu Zhu
Zhuofan Zhang
Xiaojian Ma
Xuesong Niu
Yixin Chen
Baoxiong Jia
Zhidong Deng
Siyuan Huang
Qing Li
40
21
0
19 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks
  via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
29
11
0
16 May 2024
Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?
Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?
Hari Chandana Kuchibhotla
Sai Srinivas Kancheti
Abbavaram Gowtham Reddy
Vineeth N. Balasubramanian
VLM
21
0
0
13 May 2024
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
Lingdong Kong
You-Chen Liu
Lai Xing Ng
Benoit R. Cottereau
Wei Tsang Ooi
VLM
29
12
0
08 May 2024
A Novel Wide-Area Multiobject Detection System with High-Probability
  Region Searching
A Novel Wide-Area Multiobject Detection System with High-Probability Region Searching
Xianlei Long
Hui Zhao
Chao Chen
Fuqiang Gu
Qingyi Gu
19
3
0
07 May 2024
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
Ye Mao
Junpeng Jing
K. Mikolajczyk
VLM
21
0
0
25 Apr 2024
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and
  Lexical Alterations
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations
Sri Harsha Dumpala
Aman Jaiswal
Chandramouli Shama Sastry
E. Milios
Sageev Oore
Hassan Sajjad
VLM
CoGe
35
0
0
25 Apr 2024
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and
  View-consistent 3D Semantic Understanding
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Understanding
Guibiao Liao
Jiankun Li
Zhenyu Bao
Xiaoqing Ye
Jingdong Wang
Qing Li
Kanglin Liu
3DGS
25
13
0
22 Apr 2024
Vocabulary-free Image Classification and Semantic Segmentation
Vocabulary-free Image Classification and Semantic Segmentation
Alessandro Conti
Enrico Fini
Massimiliano Mancini
Paolo Rota
Yiming Wang
Elisa Ricci
VLM
27
2
0
16 Apr 2024
Zero-shot detection of buildings in mobile LiDAR using Language Vision
  Model
Zero-shot detection of buildings in mobile LiDAR using Language Vision Model
June Moh Goo
Zichao Zeng
Jan Boehm
19
2
0
15 Apr 2024
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually
  Expanding Large Vocabularies
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Zhongrui Gui
Shuyang Sun
Runjia Li
Jianhao Yuan
Zhaochong An
Karsten Roth
Ameya Prabhu
Philip H. S. Torr
VLM
CLL
24
6
0
15 Apr 2024
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
Junchi Wang
Lei Ke
MLLM
LRM
VLM
36
18
0
12 Apr 2024
LaSagnA: Language-based Segmentation Assistant for Complex Queries
LaSagnA: Language-based Segmentation Assistant for Complex Queries
Cong Wei
Haoxian Tan
Yujie Zhong
Yujiu Yang
Lin Ma
34
14
0
12 Apr 2024
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic
  Segmentation
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
Sina Hajimiri
Ismail Ben Ayed
Jose Dolz
VLM
31
22
0
12 Apr 2024
Previous
1234567
Next