Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.06158
Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments
18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Matterport3D: Learning from RGB-D Data in Indoor Environments"
50 / 1,165 papers shown
Title
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation
Zihan Wang
Seungjun Lee
Gim Hee Lee
VGen
17
0
0
16 May 2025
Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities
Zachary Ravichandran
Fernando Cladera
Jason Hughes
Varun Murali
M. Hsieh
George J. Pappas
Camillo J Taylor
Vijay Kumar
LM&Ro
42
0
0
14 May 2025
NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance
Wenzhe Cai
Jiaqi Peng
Yuqiang Yang
Yujian Zhang
Meng Wei
Hanqing Wang
Yilun Chen
Tai Wang
Jiangmiao Pang
23
0
0
13 May 2025
VISTA: Generative Visual Imagination for Vision-and-Language Navigation
Yanjia Huang
Mingyang Wu
Renjie Li
Zhengzhong Tu
LM&Ro
41
0
0
09 May 2025
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory
Weichen Zhang
Chen Gao
Shiquan Yu
Ruiying Peng
Baining Zhao
Qian Zhang
Jinqiang Cui
Xinlei Chen
Yong Li
LLMAG
LM&Ro
49
0
0
08 May 2025
DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding
Henry Zheng
Hao Shi
Qihang Peng
Yong Xien Chng
Rui Huang
Yepeng Weng
Zhongchao Shi
Gao Huang
77
1
0
08 May 2025
LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs
Xinyuan Zhang
Yonglin Tian
Fei Lin
Yue Liu
Jing Ma
Kornélia Sára Szatmáry
Fei Wang
53
0
0
06 May 2025
Estimating Commonsense Scene Composition on Belief Scene Graphs
Mario A. V. Saucedo
Vignesh Kottayam Viswanathan
Christoforos Kanellakis
G. Nikolakopoulos
3DV
26
0
0
05 May 2025
PhysNav-DG: A Novel Adaptive Framework for Robust VLM-Sensor Fusion in Navigation Applications
Trisanth Srinivasan
Santosh Patapati
41
0
0
03 May 2025
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
63
0
0
01 May 2025
A Survey of Interactive Generative Video
Jiwen Yu
Yiran Qin
Haoxuan Che
Quande Liu
Xinyu Wang
Pengfei Wan
Di Zhang
Kun Gai
Hao Chen
Xihui Liu
VGen
67
0
0
30 Apr 2025
CasaGPT: Cuboid Arrangement and Scene Assembly for Interior Design
Weitao Feng
Hang Zhou
Jing Liao
Li Cheng
Wenbo Zhou
3DV
58
0
0
28 Apr 2025
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Haoran Geng
Feishi Wang
Songlin Wei
Yuchen Li
Bangjun Wang
...
Hao Dong
Siyuan Huang
Yue Wang
Jitendra Malik
Pieter Abbeel
85
4
0
26 Apr 2025
SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models
Nader Zantout
Haochen Zhang
Pujith Kachana
J. Qiu
Ji Zhang
Wenshan Wang
LM&Ro
LRM
231
0
0
25 Apr 2025
Dynamic Camera Poses and Where to Find Them
C. Rockwell
Joseph Tung
Nayeon Lee
Xuan Li
David Fouhey
Chen-Hsuan Lin
46
0
0
24 Apr 2025
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&Ro
LRM
33
0
0
22 Apr 2025
ApexNav: An Adaptive Exploration Strategy for Zero-Shot Object Navigation with Target-centric Semantic Fusion
Mingjie Zhang
Yuheng Du
Chengkai Wu
Jinni Zhou
Zhenchao Qi
Jun Ma
Boyu Zhou
38
0
0
20 Apr 2025
Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding
Yuchen Rao
Stefan Ainetter
Sinisa Stekovic
Vincent Lepetit
Friedrich Fraundorfer
3DPC
3DV
254
0
0
18 Apr 2025
RefComp: A Reference-guided Unified Framework for Unpaired Point Cloud Completion
Yixuan Yang
Jinyu Yang
Zixiang Zhao
Victor Sanchez
Feng Zheng
34
0
0
18 Apr 2025
Digital Twin Generation from Visual Data: A Survey
Andrew Melnik
Benjamin Alt
Giang Hoang Nguyen
Artur Wilkowski
Maciej Stefańczyk
Qirui Wu
Sinan Harms
Helge Rhodin
Manolis Savva
Michael Beetz
3DGS
VGen
56
0
0
17 Apr 2025
Real-World Depth Recovery via Structure Uncertainty Modeling and Inaccurate GT Depth Fitting
Delong Suzhang
Meng Yang
32
0
0
16 Apr 2025
CL-CoTNav: Closed-Loop Hierarchical Chain-of-Thought for Zero-Shot Object-Goal Navigation with Vision-Language Models
Yuxin Cai
Xiangkun He
Maonan Wang
Hongliang Guo
W. Yau
Chen Lv
LM&Ro
LRM
42
0
0
11 Apr 2025
Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation
Luo Ling
Bai Qianqian
LM&Ro
44
0
0
09 Apr 2025
SoundVista: Novel-View Ambient Sound Synthesis via Visual-Acoustic Binding
Mingfei Chen
I. D. Gebru
Ishwarya Ananthabhotla
Christian Richardt
Dejan Marković
Jake Sandakly
Steven Krenn
Todd Keebler
Eli Shlizerman
Alexander Richard
29
0
0
08 Apr 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
74
1
0
03 Apr 2025
LPA3D: 3D Room-Level Scene Generation from In-the-Wild Images
M. Yang
Yu-Xiao Guo
Yang Liu
Bin Zhou
Xin Tong
3DV
43
0
0
03 Apr 2025
WorldScore: A Unified Evaluation Benchmark for World Generation
Haoyi Duan
Hong-Xing Yu
Sirui Chen
L. Fei-Fei
Jiajun Wu
VGen
72
3
0
01 Apr 2025
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Zike Yan
Qi Wu
Zhihua Wei
Qingbin Liu
59
0
0
31 Mar 2025
Empowering Large Language Models with 3D Situation Awareness
Zhihao Yuan
Yibo Peng
Jinke Ren
Yinghong Liao
Yatong Han
Chun-Mei Feng
Hengshuang Zhao
G. Li
Shuguang Cui
Zhen Li
51
0
0
29 Mar 2025
Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments
Yifan Xu
V. Kamat
Carol Menassa
51
0
0
29 Mar 2025
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
Jiahui Zhang
Yurui Chen
Yanpeng Zhou
Yueming Xu
Ze Huang
...
Xinyue Cai
G. Huang
Xingyue Quan
Hang Xu
Li Zhang
LRM
100
0
0
29 Mar 2025
uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images
Jonathan Lee
Bolivar Solarte
Chin-Hsuan Wu
Jin-Cheng Jhang
Fu-En Wang
Yi-Hsuan Tsai
Min Sun
54
0
0
27 Mar 2025
Scene-agnostic Pose Regression for Visual Localization
Junwei Zheng
Ruiping Liu
Yuxiao Chen
Zhenfang Chen
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
54
0
0
25 Mar 2025
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces
Chenyangguang Zhang
Alexandros Delitzas
Fangjinhua Wang
Ruida Zhang
Xiangyang Ji
Marc Pollefeys
Francis Engelmann
3DV
3DPC
52
4
0
24 Mar 2025
Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation
Ziming Wei
Bingqian Lin
Yunshuang Nie
Jiaqi Chen
Shikui Ma
Hang Xu
Xiaodan Liang
56
0
0
23 Mar 2025
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
Yue Li
Qi Ma
Runyi Yang
Huapeng Li
Mengjiao Ma
...
E. Konukoglu
Theo Gevers
Luc Van Gool
Martin R. Oswald
Danda Pani Paudel
3DGS
VLM
94
0
0
23 Mar 2025
Distilling Monocular Foundation Model for Fine-grained Depth Completion
Yingping Liang
Yutao Hu
Wenqi Shao
Ying Fu
MDE
49
1
0
21 Mar 2025
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Jinlong Li
Cristiano Saltori
Fabio Poiesi
N. Sebe
249
0
0
20 Mar 2025
UniK3D: Universal Camera Monocular 3D Estimation
Luigi Piccinelli
Daniel Gehrig
Mattia Segu
Yifan Yang
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
47
0
0
20 Mar 2025
OffsetOPT: Explicit Surface Reconstruction without Normals
Huan Lei
3DPC
72
0
0
20 Mar 2025
IRef-VLA: A Benchmark for Interactive Referential Grounding with Imperfect Language in 3D Scenes
Haochen Zhang
Nader Zantout
Pujith Kachana
Ji Zhang
Wenshan Wang
VGen
56
0
0
20 Mar 2025
Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Akhil Perincherry
Jacob Krantz
Stefan Lee
LM&Ro
41
1
0
20 Mar 2025
SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes
Weixiao Gao
Liangliang Nan
H. Ledoux
3DV
3DPC
43
0
0
19 Mar 2025
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
Zhixuan Liu
H. Zhu
R. Chen
Jonathan M Francis
Soonmin Hwang
Jiangning Zhang
Jean Oh
VGen
250
0
0
18 Mar 2025
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Longteng Guo
Zhihua Wei
Qingbin Liu
LM&Ro
81
1
0
18 Mar 2025
SatDepth: A Novel Dataset for Satellite Image Matching
Rahul P. Deshmukh
A. Kak
MDE
67
0
0
17 Mar 2025
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
Erik Daxberger
Nina Wenzel
David Griffiths
Haiming Gang
Justin Lazarow
...
Kai Kang
Marcin Eichner
Yue Yang
Afshin Dehghan
Peter Grasch
77
3
0
17 Mar 2025
3D Human Interaction Generation: A Survey
Siyuan Fan
Wenke Huang
Xiantao Cai
Bo Du
VGen
60
0
0
17 Mar 2025
Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic Environments
Yuhang Peng
Sidong Wang
Jihaoyu Yang
Shilong Li
Han Wang
Jiangtao Gong
60
0
0
15 Mar 2025
CHOrD: Generation of Collision-Free, House-Scale, and Organized Digital Twins for 3D Indoor Scenes with Controllable Floor Plans and Optimal Layouts
Chong Su
Yingbin Fu
Zheyuan Hu
Jing Yang
Param Hanji
Shaojun Wang
Xuan Zhao
Cengiz Öztireli
Fangcheng Zhong
3DV
56
0
0
15 Mar 2025
1
2
3
4
...
22
23
24
Next