ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.03546
  4. Cited By
Language-driven Semantic Segmentation

Language-driven Semantic Segmentation

10 January 2022
Boyi Li
Kilian Q. Weinberger
Serge J. Belongie
V. Koltun
René Ranftl
    VLM
ArXivPDFHTML

Papers citing "Language-driven Semantic Segmentation"

50 / 85 papers shown
Title
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models
F. Khan
Jun Chen
Youssef Mohamed
Chun-Mei Feng
Mohamed Elhoseiny
VLM
21
0
0
08 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Y. Chen
Zhuotao Tian
VLM
38
0
0
07 May 2025
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
Feng Xue
Wenzhuang Xu
Guofeng Zhong
Anlong Minga
N. Sebe
65
0
0
01 May 2025
ForesightNav: Learning Scene Imagination for Efficient Exploration
ForesightNav: Learning Scene Imagination for Efficient Exploration
Hardik Shah
Jiaxu Xing
Nico Messikommer
Boyang Sun
Marc Pollefeys
Davide Scaramuzza
67
0
0
22 Apr 2025
FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment
FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment
Sebastián Barbas Laina
Simon Boche
Sotiris Papatheodorou
Simon Schaefer
Jaehyung Jung
Stefan Leutenegger
44
0
0
11 Apr 2025
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
J. Huang
Baoxiong Jia
Y. Wang
Ziyu Zhu
Xiongkun Linghu
Qing Li
Song-Chun Zhu
Siyuan Huang
75
3
0
28 Mar 2025
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Shijie Zhou
Hui Ren
Yijia Weng
Shuwang Zhang
Zhen Wang
...
Zhiwen Fan
Suya You
Z. Wang
Leonidas J. Guibas
A. Kadambi
VGen
3DGS
83
0
0
26 Mar 2025
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Jinlong Li
Cristiano Saltori
Fabio Poiesi
N. Sebe
109
0
0
20 Mar 2025
SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints
SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints
Weiwen Hu
Niccolò Parodi
Marcus Zepp
I. Feldmann
O. Schreer
Peter Eisert
VLM
90
0
0
19 Mar 2025
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
Wanhua Li
Renping Zhou
Jiawei Zhou
Yingwei Song
Johannes Herter
Minghan Qin
Gao Huang
Hanspeter Pfister
3DGS
VLM
66
0
0
13 Mar 2025
Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations
Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations
Meng Wang
Fan Wu
Yunchuan Qin
Ruihui Li
Zhuo Tang
KenLi Li
3DPC
91
0
0
08 Mar 2025
Bayesian Fields: Task-driven Open-Set Semantic Gaussian Splatting
Dominic Maggio
Luca Carlone
94
0
0
07 Mar 2025
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
Yijie Tang
Jiazhao Zhang
Yuqing Lan
Yulan Guo
Dezun Dong
Chenyang Zhu
K. Xu
118
0
0
03 Mar 2025
QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries
QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries
N. H. Chapman
Feras Dayoub
Will N. Browne
Christopher F. Lehnert
VLM
64
0
0
26 Feb 2025
SMITE: Segment Me In TimE
SMITE: Segment Me In TimE
Amirhossein Alimohammadi
Sauradip Nag
Saeid Asgari Taghanaki
Andrea Tagliasacchi
Ghassan Hamarneh
Ali Mahdavi-Amiri
VLM
VOS
90
2
0
20 Feb 2025
TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation
TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation
Navid Rajabi
Jana Kosecka
LM&Ro
3DV
53
0
0
11 Feb 2025
Can video generation replace cinematographers? Research on the cinematic language of generated video
Can video generation replace cinematographers? Research on the cinematic language of generated video
X. Li
Kai WU
Siyi Yang
YiZhan Qu
Guohua. Zhang
...
Mingliang Xiong
Hao Deng
Qingwen Liu
Gang Li
Bin He
VGen
DiffM
90
1
0
16 Dec 2024
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
M. Arda Aydın
Efe Mert Çırpar
Elvin Abdinli
Gözde B. Ünal
Y. Sahin
VLM
66
0
0
18 Nov 2024
Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
Zhaochong An
Guolei Sun
Yun Liu
Runjia Li
Min Wu
Ming-Ming Cheng
Ender Konukoglu
Serge J. Belongie
64
4
0
29 Oct 2024
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes
Juliette Marrie
Romain Menegaux
Michael Arbel
Diane Larlus
Julien Mairal
3DGS
39
1
0
18 Oct 2024
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
Joey Wilson
Ruihan Xu
Yile Sun
Parker Ewen
Minghan Zhu
Kira Barton
Maani Ghaffari
36
0
0
15 Oct 2024
3D Vision-Language Gaussian Splatting
3D Vision-Language Gaussian Splatting
Qucheng Peng
Benjamin Planche
Zhongpai Gao
Meng Zheng
Anwesa Choudhuri
Terrence Chen
C. L. P. Chen
Ziyan Wu
3DGS
35
4
0
10 Oct 2024
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
Ayesha Ishaq
Mohamed El Amine Boudjoghra
Jean Lahoud
F. Khan
Salman Khan
Hisham Cholakkal
Rao Muhammad Anwer
64
1
0
02 Oct 2024
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Ayca Takmaz
Alexandros Delitzas
R. Sumner
Francis Engelmann
Johanna Wald
Federico Tombari
63
11
0
27 Sep 2024
Embodiment-Agnostic Action Planning via Object-Part Scene Flow
Embodiment-Agnostic Action Planning via Object-Part Scene Flow
Weiliang Tang
Jia-Hui Pan
Wei Zhan
Jianshu Zhou
Huaxiu Yao
Yun-Hui Liu
M. Tomizuka
Mingyu Ding
Chi-Wing Fu
41
0
0
16 Sep 2024
iSeg: An Iterative Refinement-based Framework for Training-free
  Segmentation
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun
Jiale Cao
J. Xie
F. Khan
Yanwei Pang
DiffM
35
1
0
05 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-xiong Wang
70
15
0
05 Sep 2024
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
Guofeng Mei
Luigi Riz
Yiming Wang
Fabio Poiesi
ISeg
VLM
59
3
0
20 Aug 2024
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Cheng Shi
Yulin Zhang
Bin Yang
Jiajin Tang
Yuexin Ma
Sibei Yang
3DPC
34
1
0
14 Jul 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
33
0
0
17 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
F. Khan
VLM
ISeg
73
6
0
04 Jun 2024
DGD: Dynamic 3D Gaussians Distillation
DGD: Dynamic 3D Gaussians Distillation
Isaac Labe
Noam Issachar
Itai Lang
Sagie Benaim
36
4
0
29 May 2024
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
Lingdong Kong
You-Chen Liu
Lai Xing Ng
Benoit R. Cottereau
Wei Tsang Ooi
VLM
29
12
0
08 May 2024
${M^2D}$NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields
M2D{M^2D}M2DNeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields
N. Wang
Lefei Zhang
Angel X Chang
42
0
0
08 May 2024
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
Minghao Chen
Iro Laina
Andrea Vedaldi
3DGS
40
23
0
29 Apr 2024
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and
  View-consistent 3D Semantic Understanding
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Understanding
Guibiao Liao
Jiankun Li
Zhenyu Bao
Xiaoqing Ye
Jingdong Wang
Qing Li
Kanglin Liu
3DGS
38
14
0
22 Apr 2024
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Dominic Maggio
Yun Chang
Nathan Hughes
Matthew Trang
Dan Griffith
Carlyn Dougherty
Eric Cristofalo
Lukas Schmid
Luca Carlone
3DV
36
32
0
21 Apr 2024
Unifying Global and Local Scene Entities Modelling for Precise Action
  Spotting
Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Kim Hoang Tran
Phuc Vuong Do
Ngoc Quoc Ly
Ngan Le
34
4
0
15 Apr 2024
Physical Property Understanding from Language-Embedded Feature Fields
Physical Property Understanding from Language-Embedded Feature Fields
Albert J. Zhai
Yuan Shen
Emily Y. Chen
Gloria X. Wang
Xinlei Wang
Sheng Wang
Kaiyu Guan
Shenlong Wang
33
13
0
05 Apr 2024
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Ji-Jia Wu
Andy Chia-Hao Chang
Chieh-Yu Chuang
Chun-Pei Chen
Yu-Lun Liu
Min-Hung Chen
Hou-Ning Hu
Yung-Yu Chuang
Yen-Yu Lin
VLM
38
9
0
05 Apr 2024
Training-Free Semantic Segmentation via LLM-Supervision
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
35
2
0
31 Mar 2024
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin
Chien-Yi Wang
Yu-Chiang Frank Wang
Min-Hung Chen
VLM
21
0
0
22 Jan 2024
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Antonín Vobecký
Oriane Siméoni
David Hurych
Spyros Gidaris
Andrei Bursuc
Patrick Pérez
Josef Sivic
29
33
0
17 Jan 2024
Auto-Vocabulary Semantic Segmentation
Auto-Vocabulary Semantic Segmentation
Osman Ülger
Maksymilian Kulicki
Yuki M. Asano
Martin R. Oswald
VLM
42
2
0
07 Dec 2023
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
Jin-Chuan Shi
Miao Wang
Hao-Bin Duan
Shao-Hua Guan
3DGS
25
84
0
30 Nov 2023
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Zhihao Yuan
Jinke Ren
Chun-Mei Feng
Hengshuang Zhao
Shuguang Cui
Zhen Li
24
26
0
26 Nov 2023
DAE-Net: Deforming Auto-Encoder for fine-grained shape co-segmentation
DAE-Net: Deforming Auto-Encoder for fine-grained shape co-segmentation
Zhiqin Chen
Qimin Chen
Hang Zhou
Hao Zhang
3DPC
3DV
14
2
0
22 Nov 2023
SILC: Improving Vision Language Pretraining with Self-Distillation
SILC: Improving Vision Language Pretraining with Self-Distillation
Muhammad Ferjad Naeem
Yongqin Xian
Xiaohua Zhai
Lukas Hoyer
Luc Van Gool
F. Tombari
VLM
17
32
0
20 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Yikang Shen
Zhenfang Chen
Mingyu Ding
Chuang Gan
40
15
0
10 Oct 2023
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
  Planning
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Yuanyi Zhong
Alihusein Kuwajerwala
Sacha Morin
Krishna Murthy Jatavallabhula
Bipasha Sen
...
Celso Miguel de Melo
Joshua B. Tenenbaum
Antonio Torralba
Florian Shkurti
Liam Paull
LM&Ro
27
166
0
28 Sep 2023
12
Next