Matterport3D: Learning from RGB-D Data in Indoor Environments

18 September 2017

Matthias Nießner

Shuran Song

Papers citing "Matterport3D: Learning from RGB-D Data in Indoor Environments"

50 / 1,327 papers shown

SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control

Eduardo Pérez-Pellitero

Gerard Pons-Moll

DiffM VGen

308

20 Dec 2024

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation

...

557

18 Dec 2024

iKap: Kinematics-aware Planning with Imperative LearningIEEE International Conference on Robotics and Automation (ICRA), 2024

542

12 Dec 2024

NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis MethodsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2024

307

11 Dec 2024

MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 SecondsComputer Vision and Pattern Recognition (CVPR), 2024

263

09 Dec 2024

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

164

07 Dec 2024

TB-HSU: Hierarchical 3D Scene Understanding with Contextual AffordancesAAAI Conference on Artificial Intelligence (AAAI), 2024

397

07 Dec 2024

TANGO: Training-free Embodied AI Agents for Open-world TasksComputer Vision and Pattern Recognition (CVPR), 2024

331

05 Dec 2024

Multi-view Image Diffusion via Coordinate Noise and Fourier AttentionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

236

04 Dec 2024

Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental AttacksIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

322

03 Dec 2024

AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans

360

27 Nov 2024

Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth EstimationComputer Vision and Pattern Recognition (CVPR), 2024

625

27 Nov 2024

g3D-LF: Generalizable 3D-Language Feature Fields for Embodied TasksComputer Vision and Pattern Recognition (CVPR), 2024

Zihan Wang

Gim Hee Lee

257

26 Nov 2024

CityWalker: Learning Embodied Urban Navigation from Web-Scale VideosComputer Vision and Pattern Recognition (CVPR), 2024

537

26 Nov 2024

Revisiting Point Cloud Completion: Are We Ready For The Real-World?

Rudi Penne

1.2K

26 Nov 2024

DiffDesign: Controllable Diffusion with Meta Prior for Efficient Interior Design GenerationPLoS ONE (PLoS ONE), 2024

Yuxuan Yang

Wenwen Qiang

DiffM

598

25 Nov 2024

TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation

Shifeng Zhang

Xu Zhou

Si Liu

LRM

1.1K

25 Nov 2024

RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for RoboticsComputer Vision and Pattern Recognition (CVPR), 2024

841

25 Nov 2024

Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024

...

Chen Gao

Fengli Xu

Yong Li

VGen SyDa

517

21 Nov 2024

BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation

Umamaheswaran Raman Kumar

338

20 Nov 2024

VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation

385

18 Nov 2024

The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field MethodsThe international journal of robotics research (IJRR), 2024

Yifu Tao

Miguel Ángel Muñoz-Bañón

246

15 Nov 2024

VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor ScenesIEEE International Conference on Robotics and Automation (ICRA), 2024

A. Sethuraman

Onur Bagoren

Harikrishnan Seetharaman

248

07 Nov 2024

SA3DIP: Segment Any 3D Instance with Potential 3D PriorsNeural Information Processing Systems (NeurIPS), 2024

271

06 Nov 2024

VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation

317

05 Nov 2024

Deep Learning on 3D Semantic Segmentation: A Detailed ReviewRemote Sensing (Remote Sens.), 2024

342

04 Nov 2024

Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images

232

04 Nov 2024

CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented RealityProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2024

Yiqin Zhao

Mallesham Dasari

Tian Guo

403

04 Nov 2024

MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane ReconstructionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

192

02 Nov 2024

DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware DiffusionNeural Information Processing Systems (NeurIPS), 2024

Guofeng Zhang

266

31 Oct 2024

Deep Learning for 3D Point Cloud Enhancement: A Survey

229

30 Oct 2024

SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a BenchmarkNeural Information Processing Systems (NeurIPS), 2024

...

Eduardo Pérez-Pellitero

289

30 Oct 2024

EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM AgentsInternational Conference on Learning Representations (ICLR), 2024

369

30 Oct 2024

ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingIEEE Robotics and Automation Letters (RA-L), 2024

422

29 Oct 2024

ANAVI: Audio Noise Awareness using Visuals of Indoor environments for NAVIgation

Vidhi Jain

Rishi Veerapaneni

Yonatan Bisk

157

24 Oct 2024

Scale Propagation Network for Generalizable Depth CompletionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

Haotian Wang

Meng Yang

Xinhu Zheng

Gang Hua

274

24 Oct 2024

PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model

216

21 Oct 2024

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-ImageNeural Information Processing Systems (NeurIPS), 2024

263

20 Oct 2024

Vision-Language Navigation with Energy-Based PolicyNeural Information Processing Systems (NeurIPS), 2024

Rui Liu

Wenguan Wang

Yue Yang

229

18 Oct 2024

ARKit LabelMaker: A New Scale for Indoor 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2024

Marc Pollefeys

431

17 Oct 2024

Configurable Embodied Data Generation for Class-Agnostic RGB-D Video SegmentationIEEE Robotics and Automation Letters (RA-L), 2024

Anthony Opipari

Aravindhan K. Krishnan

Odest Chadwicke Jenkins

VOS

255

16 Oct 2024

3D Gaussian Splatting in Robotics: A Survey

Hesheng Wang

273

16 Oct 2024

LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable UncertaintyIEEE Robotics and Automation Letters (RA-L), 2024

370

15 Oct 2024

ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene ImaginationInternational Conference on Learning Representations (ICLR), 2024

238

13 Oct 2024

SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object NavigationNeural Information Processing Systems (NeurIPS), 2024

228

10 Oct 2024

Automated Creation of Digital Cousins for Robust Policy LearningConference on Robot Learning (CoRL), 2024

Tianyuan Dai

Josiah Wong

Yunfan Jiang

Chen Wang

Cem Gokmen

Ruohan Zhang

Jiajun Wu

Li Fei-Fei

271

09 Oct 2024

Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology

Ziqin Wang

Hongsheng Li

Si Liu

247

09 Oct 2024

3D Representation Methods: A Survey

Zhengren Wang

3DGS

194

09 Oct 2024

CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality

208

08 Oct 2024

Diffusion Models in 3D Vision: A Survey

Zhen Wang

Dongyuan Li

Xue Liu

Tianyu He

Jiang Bian

Renhe Jiang

MedIm

754

07 Oct 2024