ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.15654
  4. Cited By
OpenScene: 3D Scene Understanding with Open Vocabularies
v1v2 (latest)

OpenScene: 3D Scene Understanding with Open Vocabularies

Computer Vision and Pattern Recognition (CVPR), 2022
28 November 2022
Songyou Peng
Kyle Genova
ChiyuMaxJiang
Andrea Tagliasacchi
Marc Pollefeys
Thomas Funkhouser
    3DPCVLM
ArXiv (abs)PDFHTML

Papers citing "OpenScene: 3D Scene Understanding with Open Vocabularies"

50 / 362 papers shown
Title
DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving
DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving
Liuhan Yin
Runkun Ju
Guodong Guo
Erkang Cheng
100
0
0
21 Nov 2025
SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors
SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors
Kunyi Li
Michael Niemeyer
Sen Wang
Stefano Gasperini
Nassir Navab
Federico Tombari
3DGS3DV
144
0
0
21 Nov 2025
Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift
Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift
Björn Michele
Alexandre Boulch
Gilles Puy
Tuan-Hung Vu
Renaud Marlet
Nicolas Courty
40
0
0
21 Nov 2025
Enhancing XR Auditory Realism via Multimodal Scene-Aware Acoustic Rendering
Enhancing XR Auditory Realism via Multimodal Scene-Aware Acoustic RenderingACM Symposium on User Interface Software and Technology (UIST), 2025
Tianyu Xu
Jihan Li
Penghe Zu
Pranav Sahay
Maruchi Kim
...
Xun Qian
Katrina Passarella
Mahitha Rachumalla
Rajeev Nongpiur
D Shin
72
2
0
14 Nov 2025
Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy
Large Language Models and 3D Vision for Intelligent Robotic Perception and AutonomyItalian National Conference on Sensors (INS), 2025
Vinit Mehta
Charu Sharma
Karthick Thiyagarajan
LM&Ro
312
0
0
14 Nov 2025
DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Semantic Instance Segmentation
DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Semantic Instance Segmentation
Xuexun Liu
Xiaoxu Xu
Qiudan Zhang
Lin Ma
Xu Wang
160
0
0
13 Nov 2025
Sparse3DPR: Training-Free 3D Hierarchical Scene Parsing and Task-Adaptive Subgraph Reasoning from Sparse RGB Views
Sparse3DPR: Training-Free 3D Hierarchical Scene Parsing and Task-Adaptive Subgraph Reasoning from Sparse RGB Views
Haida Feng
Hao Wei
Zewen Xu
Haolin Wang
Chade Li
Yihong Wu
44
0
0
11 Nov 2025
Accelerating Physical Property Reasoning for Augmented Visual Cognition
Accelerating Physical Property Reasoning for Augmented Visual Cognition
Hongbo Lan
Zhenlin An
Haoyu Li
Vaibhav Singh
Longfei Shangguan
60
0
0
05 Nov 2025
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations
Yujia Zhang
Xiaoyang Wu
Yixing Lao
Chengyao Wang
Zhuotao Tian
Naiyan Wang
Hengshuang Zhao
3DPC
117
1
0
27 Oct 2025
OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields
OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields
Lisa Weijler
Sebastian Koch
Fabio Poiesi
Timo Ropinski
Pedro Hermosilla
AI4CE
77
0
0
24 Oct 2025
Towards Physics-informed Spatial Intelligence with Human Priors: An Autonomous Driving Pilot Study
Towards Physics-informed Spatial Intelligence with Human Priors: An Autonomous Driving Pilot Study
Guanlin Wu
Boyan Su
Yang Zhao
Pu Wang
Yichen Lin
Hao Frank Yang
84
0
0
24 Oct 2025
COS3D: Collaborative Open-Vocabulary 3D Segmentation
COS3D: Collaborative Open-Vocabulary 3D Segmentation
Runsong Zhu
Ka-Hei Hui
Zhengzhe Liu
Qianyi Wu
Weiliang Tang
Shi Qiu
Pheng-Ann Heng
Chi-Wing Fu
3DGS
109
0
0
23 Oct 2025
From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction
From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction
Zhida Zhao
Talas Fu
Yifan Wang
Lijun Wang
Huchuan Lu
VGen
106
0
0
22 Oct 2025
Where, Not What: Compelling Video LLMs to Learn Geometric Causality for 3D-Grounding
Where, Not What: Compelling Video LLMs to Learn Geometric Causality for 3D-Grounding
Yutong Zhong
VGen
52
0
0
19 Oct 2025
3D Weakly Supervised Semantic Segmentation via Class-Aware and Geometry-Guided Pseudo-Label Refinement
3D Weakly Supervised Semantic Segmentation via Class-Aware and Geometry-Guided Pseudo-Label Refinement
Xiaoxu Xu
Xuexun Liu
Jinlong Li
Yitian Yuan
Qiudan Zhang
Lin Ma
Nicu Sebe
Xu Wang
98
1
0
17 Oct 2025
QuASH: Using Natural-Language Heuristics to Query Visual-Language Robotic Maps
QuASH: Using Natural-Language Heuristics to Query Visual-Language Robotic Maps
Matti Pekkanen
Francesco Verdoja
Ville Kyrki
68
0
0
16 Oct 2025
FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation
FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation
Hongrui Wu
Zhicheng Gao
Jin Cao
Kelu Yao
Wen Shen
Zhihua Wei
VLM
48
0
0
09 Oct 2025
UniFField: A Generalizable Unified Neural Feature Field for Visual, Semantic, and Spatial Uncertainties in Any Scene
UniFField: A Generalizable Unified Neural Feature Field for Visual, Semantic, and Spatial Uncertainties in Any Scene
Christian Maurer
Snehal Jauhri
Sophie Lueth
Georgia Chalvatzaki
64
0
0
08 Oct 2025
Are Heterogeneous Graph Neural Networks Truly Effective? A Causal Perspective
Are Heterogeneous Graph Neural Networks Truly Effective? A Causal Perspective
Xiao Yang
Xuejiao Zhao
Zhiqi Shen
CML
103
0
0
07 Oct 2025
Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Kun Xiang
Terry Jingchen Zhang
Yinya Huang
Jixi He
Zirong Liu
...
J. N. Han
Hang Xu
Han Li
Bin Dong
Xiaodan Liang
PINNAI4CE
264
1
0
06 Oct 2025
GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Weijia Dou
X. Zhang
Yi Bin
Jian Liu
Bo Peng
Guoqing Wang
Yang Yang
H. Shen
92
0
0
02 Oct 2025
Diffusion^2: Turning 3D Environments into Radio Frequency Heatmaps
Diffusion^2: Turning 3D Environments into Radio Frequency Heatmaps
Kyoungjun Park
Yifan Yang
Changhan Ge
Lili Qiu
Shiqi Jiang
154
0
0
02 Oct 2025
PinPoint3D: Fine-Grained 3D Part Segmentation from a Few Clicks
PinPoint3D: Fine-Grained 3D Part Segmentation from a Few Clicks
Bojun Zhang
Hangjian Ye
Hao Zheng
Jianzheng Huang
Zhengyu Lin
Zhenhong Guo
Feng Zheng
116
0
0
30 Sep 2025
Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy
Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy
Haijier Chen
Bo Xu
Shoujian Zhang
Haoze Liu
Jiaxuan Lin
Jingrong Wang
LRM
98
1
0
29 Sep 2025
OVSeg3R: Learn Open-vocabulary Instance Segmentation from 2D via 3D Reconstruction
OVSeg3R: Learn Open-vocabulary Instance Segmentation from 2D via 3D Reconstruction
Hongyang Li
Jinyuan Qu
Lei Zhang
3DV
116
0
0
28 Sep 2025
HELIOS: Hierarchical Exploration for Language-grounded Interaction in Open Scenes
HELIOS: Hierarchical Exploration for Language-grounded Interaction in Open Scenes
Katrina Ashton
Chahyon Ku
Shrey Shah
W. Jiang
Kostas Daniilidis
Bernadette Bucher
LM&Ro
59
0
0
26 Sep 2025
PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D Data
PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D Data
Zhe Zhu
Le Wan
Rui-Xue Xu
Y. Zhang
Honghua Chen
Zhiyang Dou
Cheng Lin
Yuan Liu
Mingqiang Wei
VLM
139
1
0
26 Sep 2025
Human-like Navigation in a World Built for Humans
Human-like Navigation in a World Built for Humans
Bhargav Chandaka
Gloria X. Wang
Haozhe Chen
Henry Che
Albert Zhai
Shenlong Wang
80
0
0
25 Sep 2025
Meta-Memory: Retrieving and Integrating Semantic-Spatial Memories for Robot Spatial Reasoning
Meta-Memory: Retrieving and Integrating Semantic-Spatial Memories for Robot Spatial Reasoning
Yufan Mao
Hanjing Ye
Wenlong Dong
Chengjie Zhang
Hong Zhang
LM&Ro
44
0
0
25 Sep 2025
Embodied AI: From LLMs to World Models
Embodied AI: From LLMs to World Models
Tongtong Feng
Xin Wang
Yu Jiang
Wenwu Zhu
LM&Ro
289
7
0
24 Sep 2025
Agentic Scene Policies: Unifying Space, Semantics, and Affordances for Robot Action
Agentic Scene Policies: Unifying Space, Semantics, and Affordances for Robot Action
Sacha Morin
Kumaraditya Gupta
Mahtab Sandhu
Charlie Gauthier
F. Argenziano
Kirsty Ellis
Liam Paull
LM&Ro
89
0
0
23 Sep 2025
DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving
DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving
Shuyao Shang
Yuntao Chen
Yuqi Wang
Yingyan Li
Zhaoxiang Zhang
125
4
0
22 Sep 2025
RangeSAM: On the Potential of Visual Foundation Models for Range-View represented LiDAR segmentation
RangeSAM: On the Potential of Visual Foundation Models for Range-View represented LiDAR segmentation
Paul Julius Kühn
Duc Anh Nguyen
Arjan Kuijper
Holger Graf
Dieter W. Fellner
3DPC
197
0
0
19 Sep 2025
SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features
SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features
Jinyuan Qu
Hongyang Li
Xingyu Chen
Shilong Liu
Yukai Shi
Tianhe Ren
Ruitao Jing
Lei Zhang
3DPC
177
1
0
19 Sep 2025
FSR-VLN: Fast and Slow Reasoning for Vision-Language Navigation with Hierarchical Multi-modal Scene Graph
FSR-VLN: Fast and Slow Reasoning for Vision-Language Navigation with Hierarchical Multi-modal Scene Graph
Xiaolin Zhou
Tingyang Xiao
Liu Liu
Yucheng Wang
Maiyue Chen
Xinrui Meng
Xinjie Wang
Wei Feng
Wei Sui
Zhizhong Su
LRM
101
0
0
17 Sep 2025
OpenUrban3D: Annotation-Free Open-Vocabulary Semantic Segmentation of Large-Scale Urban Point Clouds
OpenUrban3D: Annotation-Free Open-Vocabulary Semantic Segmentation of Large-Scale Urban Point Clouds
Chongyu Wang
Kunlei Jing
J. Zhu
Di Wang
3DPC
121
0
0
13 Sep 2025
Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Zhuoxu Huang
Mingqi Gao
Jungong Han
64
1
0
09 Sep 2025
P3-SAM: Native 3D Part Segmentation
P3-SAM: Native 3D Part Segmentation
Changfeng Ma
Yang Li
X. Yan
Jiachen Xu
Yunhan Yang
Chunshi Wang
Zibo Zhao
Yanwen Guo
Zhuo Chen
Chunchao Guo
231
7
0
08 Sep 2025
SGS-3D: High-Fidelity 3D Instance Segmentation via Reliable Semantic Mask Splitting and Growing
SGS-3D: High-Fidelity 3D Instance Segmentation via Reliable Semantic Mask Splitting and Growing
Chaolei Wang
Yang Luo
Jing Du
Siyu Chen
Yiping Chen
Ting Han
ISeg3DGS
164
0
0
05 Sep 2025
Visibility-Aware Language Aggregation for Open-Vocabulary Segmentation in 3D Gaussian Splatting
Visibility-Aware Language Aggregation for Open-Vocabulary Segmentation in 3D Gaussian Splatting
Sen Wang
Kunyi Li
Siyun Liang
Elena Alegret
Jing Ma
Nassir Navab
Stefano Gasperini
3DGS
92
0
0
05 Sep 2025
OpenMulti: Open-Vocabulary Instance-Level Multi-Agent Distributed Implicit Mapping
OpenMulti: Open-Vocabulary Instance-Level Multi-Agent Distributed Implicit MappingIEEE Robotics and Automation Letters (IEEE RA-L), 2025
Jianyu Dou
Yinan Deng
Jiahui Wang
Xingsi Tang
Yi Yang
Yufeng Yue
48
2
0
01 Sep 2025
Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation
Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic SegmentationIEEE International Conference on Robotics and Automation (ICRA), 2025
Jialiang Kang
Jiawen Wang
Dingsheng Luo
3DPC
100
0
0
30 Aug 2025
SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding
SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding
Jiawen Lin
Shiran Bian
Yihang Zhu
Wenbin Tan
Yachao Zhang
Yuan Xie
Yanyun Qu
84
0
0
28 Aug 2025
M3DMap: Object-aware Multimodal 3D Mapping for Dynamic Environments
M3DMap: Object-aware Multimodal 3D Mapping for Dynamic EnvironmentsOptical Memory and Neural Networks (OMNN), 2025
Dmitry Yudin
3DPC
96
1
0
23 Aug 2025
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Xinhao Xiang
Kuan-Chuan Peng
Suhas Lohit
Michael Jeffrey Jones
Jiawei Zhang
3DPC
110
0
0
22 Aug 2025
GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation
GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation
Ken Deng
Yunhan Yang
Jingxiang Sun
Xihui Liu
Yebin Liu
Ding Liang
Yan-Pei Cao
3DGS3DV
147
1
0
19 Aug 2025
GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting
GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting
Elena Alegret
Kunyi Li
Sen Wang
Siyun Liang
Michael Niemeyer
Stefano Gasperini
Nassir Navab
Federico Tombari
3DGS
103
2
0
19 Aug 2025
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
Junjie Wang
Keyu Chen
Yulin Li
Bin Chen
Hengshuang Zhao
Xiaojuan Qi
Zhuotao Tian
CLIPVLM
82
1
0
15 Aug 2025
Remove360: Benchmarking Residuals After Object Removal in 3D Gaussian Splatting
Remove360: Benchmarking Residuals After Object Removal in 3D Gaussian Splatting
Simona Kocour
Assia Benbihi
Torsten Sattler
3DPC
88
0
0
15 Aug 2025
CitySeg: A 3D Open Vocabulary Semantic Segmentation Foundation Model in City-scale Scenarios
CitySeg: A 3D Open Vocabulary Semantic Segmentation Foundation Model in City-scale Scenarios
Jialei Xu
Zizhuang Wei
Weikang You
Linyun Li
Weijian Sun
3DPC
124
1
0
13 Aug 2025
12345678
Next