Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.06158
Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments
18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Matterport3D: Learning from RGB-D Data in Indoor Environments"
50 / 1,167 papers shown
Title
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances
Wenting Xu
Viorela Ila
Luping Zhou
Craig T. Jin
82
0
0
07 Dec 2024
TANGO: Training-free Embodied AI Agents for Open-world Tasks
Filippo Ziliotto
Tommaso Campari
Luciano Serafini
Lamberto Ballan
LLMAG
LM&Ro
MLLM
LRM
83
1
0
05 Dec 2024
Multi-view Image Diffusion via Coordinate Noise and Fourier Attention
Justin D. Theiss
Norman Müller
Daeil Kim
Aayush Prakash
76
0
0
04 Dec 2024
Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks
Zijiao Yang
Xiangxi Shi
Eric Slyman
Stefan Lee
AAML
84
1
0
03 Dec 2024
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans
Dillon Loh
Tomasz Bednarz
Xinxing Xia
Frank Guan
83
0
0
27 Nov 2024
Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
Mehdi Zayene
Jannik Endres
Albias Havolli
Charles Corbière
Salim Cherkaoui
Alexandre Kontouli
Alexandre Alahi
MDE
153
1
0
27 Nov 2024
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Xinhao Liu
Jiajian Li
Yichen Jiang
Niranjan Sujay
Zheng Yang
Juexiao Zhang
John Abanes
Jing Zhang
Chen Feng
116
2
0
26 Nov 2024
Revisiting Point Cloud Completion: Are We Ready For The Real-World?
Stuti Pathak
Prashant Kumar
Nicholus Mboga
Gunther Steenackers
R. Penne
Rudi Penne
269
0
0
26 Nov 2024
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
Chan Hee Song
Valts Blukis
Jonathan Tremblay
Stephen Tyree
Yu-Chuan Su
Stan Birchfield
101
8
0
25 Nov 2024
TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
Linqing Zhong
Chen Gao
Zihan Ding
Yue Liao
Si Liu
Shifeng Zhang
Xu Zhou
Si Liu
LRM
98
4
0
25 Nov 2024
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation
Umamaheswaran Raman Kumar
A. Fayjie
Jurgen Hannaert
Patrick Vandewalle
3DV
3DPC
85
1
0
20 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
Hamidreza Kasaei
Tingguang Li
M. Cao
LM&Ro
81
3
0
18 Nov 2024
The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods
Yifu Tao
Miguel Ángel Muñoz-Bañón
Lintong Zhang
Jiahao Wang
L. Fu
Maurice F. Fallon
44
5
0
15 Nov 2024
VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes
A. Sethuraman
Onur Bagoren
Harikrishnan Seetharaman
Dalton Richardson
Joseph Taylor
Katherine A. Skinner
3DV
45
0
0
07 Nov 2024
SA3DIP: Segment Any 3D Instance with Potential 3D Priors
Xi Yang
Xu Gu
Xingyilang Yin
Xinbo Gao
47
0
0
06 Nov 2024
VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation
Haochen Zhang
Nader Zantout
Pujith Kachana
Zongyuan Wu
Ji Zhang
Wenshan Wang
3DV
LM&Ro
49
5
0
05 Nov 2024
Deep Learning on 3D Semantic Segmentation: A Detailed Review
Thodoris Betsas
Andreas Georgopoulos
Anastasios Doulamis
Pierre Grussenmeyer
3DV
3DPC
49
1
0
04 Nov 2024
Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images
Kun Huang
Fang-Lue Zhang
Fangfang Zhang
Yu-Kun Lai
Paul L. Rosin
N. Dodgson
42
0
0
04 Nov 2024
CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality
Yiqin Zhao
Mallesham Dasari
Tian Guo
58
0
0
04 Nov 2024
MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
Wang Zhao
Jiachen Liu
Sheng Zhang
Heng Chang
Sili Chen
S. X. Huang
Yong-Jin Liu
Hengkai Guo
44
0
0
02 Nov 2024
DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Weicai Ye
Chenhao Ji
Zheng Chen
Junyao Gao
Xiaoshui Huang
Song-Hai Zhang
Wanli Ouyang
Tong He
Cairong Zhao
Guofeng Zhang
41
8
0
31 Oct 2024
Deep Learning for 3D Point Cloud Enhancement: A Survey
Siwen Quan
Junhao Yu
Ziming Nie
Muze Wang
Sijia Feng
Pei An
Jiaqi Yang
3DPC
55
3
0
30 Oct 2024
SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark
Hyunjun Jung
Weihang Li
Shun-cheng Wu
William Bittner
Nikolas Brasch
...
Eduardo Pérez-Pellitero
Zhensong Zhang
Arthur Moreau
Nassir Navab
Benjamin Busam
58
1
0
30 Oct 2024
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents
Junting Chen
Checheng Yu
Xunzhe Zhou
Tianqi Xu
Yao Mu
Mengkang Hu
Wenqi Shao
Yuran Wang
Ge Li
Lin Shao
76
4
0
30 Oct 2024
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting
Yongqian Li
Zijia Kuang
Ting Li
Guyue Zhou
Shaohui Zhang
Zike Yan
3DGS
38
5
0
29 Oct 2024
ANAVI: Audio Noise Awareness using Visuals of Indoor environments for NAVIgation
Vidhi Jain
Rishi Veerapaneni
Yonatan Bisk
46
0
0
24 Oct 2024
Scale Propagation Network for Generalizable Depth Completion
Haotian Wang
Meng Yang
Xinhu Zheng
Gang Hua
31
2
0
24 Oct 2024
PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model
Zhongchen Deng
Zhechen Yang
Chi Chen
Cheng Zeng
Yan Meng
Bisheng Yang
22
1
0
21 Oct 2024
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Yu Zhao
Hao Fei
Xiangtai Li
L. Qin
Jiayi Ji
Erik Cambria
Meishan Zhang
Hao Fei
Jianguo Wei
DiffM
34
1
0
20 Oct 2024
Vision-Language Navigation with Energy-Based Policy
Rui Liu
Wenguan Wang
Yue Yang
45
3
0
18 Oct 2024
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
Guangda Ji
Silvan Weder
Francis Engelmann
Marc Pollefeys
Hermann Blum
3DV
69
4
0
17 Oct 2024
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
52
0
0
16 Oct 2024
3D Gaussian Splatting in Robotics: A Survey
Siting Zhu
Guangming Wang
Dezhi Kong
Hesheng Wang
3DGS
57
7
0
16 Oct 2024
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
Joey Wilson
Ruihan Xu
Yile Sun
Parker Ewen
Minghan Zhu
Kira Barton
Maani Ghaffari
43
0
0
15 Oct 2024
ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination
Xinxin Zhao
Wenzhe Cai
Likun Tang
Teng Wang
LM&Ro
45
3
0
13 Oct 2024
SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Hang Yin
Xiuwei Xu
Zhenyu Wu
Jie Zhou
Jiwen Lu
48
14
0
10 Oct 2024
Automated Creation of Digital Cousins for Robust Policy Learning
Tianyuan Dai
Josiah Wong
Yunfan Jiang
Chen Wang
Cem Gokmen
Ruohan Zhang
Jiajun Wu
Li Fei-Fei
36
22
0
09 Oct 2024
Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology
Xinyu Wang
Donglin Yang
Ziqin Wang
Hohin Kwan
Jinyu Chen
Wenjun Wu
Hongsheng Li
Yue Liao
Si Liu
29
14
0
09 Oct 2024
3D Representation Methods: A Survey
Zhengren Wang
3DGS
32
3
0
09 Oct 2024
CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality
Wenjie Chang
Hao Ai
Tianzhu Zhang
Lin Wang
MDE
26
0
0
08 Oct 2024
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Xue Liu
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
72
4
0
07 Oct 2024
LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation
Jianhao Jiao
Jinhao He
Changkun Liu
Sebastian Aegidius
Xiangcheng Hu
Tristan Braud
Dimitrios Kanoulas
59
4
0
06 Oct 2024
Semantic Environment Atlas for Object-Goal Navigation
Nuri Kim
Jeongho Park
Mineui Hong
Songhwai Oh
33
0
0
05 Oct 2024
The Wallpaper is Ugly: Indoor Localization using Vision and Language
Seth Pate
Lawson L. S. Wong
41
0
0
04 Oct 2024
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Zhaowei Wang
Hongming Zhang
Tianqing Fang
Ye Tian
Yue Yang
Kaixin Ma
Xiaoman Pan
Yangqiu Song
Dong Yu
LM&Ro
55
3
0
03 Oct 2024
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
Mohan Xu
Kai Li
Guo Chen
Xiaolin Hu
51
0
0
02 Oct 2024
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
44
2
0
02 Oct 2024
Find Everything: A General Vision Language Model Approach to Multi-Object Search
Daniel Choi
Angus Fung
Haitong Wang
Aaron Hao Tan
65
3
0
01 Oct 2024
Active Neural Mapping at Scale
Zijia Kuang
Zike Yan
Hao Zhao
Guyue Zhou
Hongbin Zha
30
3
0
30 Sep 2024
Grounding 3D Scene Affordance From Egocentric Interactions
Cuiyu Liu
Wei Zhai
Yuhang Yang
Hongchen Luo
Sen Liang
Yang Cao
Zheng-Jun Zha
42
1
0
29 Sep 2024
Previous
1
2
3
4
5
6
...
22
23
24
Next