Matterport3D: Learning from RGB-D Data in Indoor Environments

18 September 2017

Matthias Nießner

Shuran Song

Papers citing "Matterport3D: Learning from RGB-D Data in Indoor Environments"

50 / 1,327 papers shown

MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments

1.2K

18 Mar 2025

FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation TasksIEEE transactions on multimedia (TMM), 2025

360

18 Mar 2025

3D Human Interaction Generation: A Survey

349

17 Mar 2025

SatDepth: A Novel Dataset for Satellite Image Matching

Rahul P. Deshmukh

A. Kak

MDE

257

17 Mar 2025

MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs

...

290

17 Mar 2025

Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic Environments

248

15 Mar 2025

CHOrD: Generation of Collision-Free, House-Scale, and Organized Digital Twins for 3D Indoor Scenes with Controllable Floor Plans and Optimal Layouts

291

15 Mar 2025

Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation

489

14 Mar 2025

MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation

237

14 Mar 2025

UniGoal: Towards Universal Zero-shot Goal-oriented NavigationComputer Vision and Pattern Recognition (CVPR), 2025

479

13 Mar 2025

PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language NavigationNeural Networks (NN), 2025

330

13 Mar 2025

SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation

511

13 Mar 2025

HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots

...

341

12 Mar 2025

Embodied Crowd Counting

329

11 Mar 2025

SAS: Segment Any 3D Scene with Integrated 2D Priors

301

11 Mar 2025

Self-Supervised Large Scale Point Cloud Completion for Archaeological Site RestorationComputer Vision and Pattern Recognition (CVPR), 2025

Aocheng Li

James Zimmer-Dauphinee

Rajesh Kalyanam

Ian Lindsay

Parker VanValkenburgh

Steven A. Wernke

Daniel G. Aliaga

3DPC

315

06 Mar 2025

Out-of-Distribution Radar Detection in Compound Clutter and Thermal Noise through Variational AutoencodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

340

06 Mar 2025

WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation

685

04 Mar 2025

Pretrained Embeddings as a Behavior Specification Mechanism

247

03 Mar 2025

Semi-Supervised 360 Layout Estimation with Panoramic Collaborative Perturbations

234

03 Mar 2025

AirRoom: Objects Matter in Room ReidentificationComputer Vision and Pattern Recognition (CVPR), 2025

352

03 Mar 2025

EDM: Equirectangular Projection-Oriented Dense Kernelized Feature MatchingComputer Vision and Pattern Recognition (CVPR), 2025

239

28 Feb 2025

UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

529

27 Feb 2025

On Adversarial Attacks In Acoustic Drone Localization

215

27 Feb 2025

ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual GroundingComputer Vision and Pattern Recognition (CVPR), 2025

380

26 Feb 2025

Ground-level Viewpoint Vision-and-Language Navigation in Continuous EnvironmentsIEEE International Conference on Robotics and Automation (ICRA), 2025

301

26 Feb 2025

OpenFly: A Comprehensive Platform for Aerial Vision-Language Navigation

...

483

25 Feb 2025

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

357

24 Feb 2025

Personalized Instance-based Navigation Toward User-Specific Objects in Realistic EnvironmentsNeural Information Processing Systems (NeurIPS), 2024

497

20 Feb 2025

Spherical Dense Text-to-Image Synthesis

485

18 Feb 2025

IM360: Large-scale Indoor Mapping with 360 Cameras

431

18 Feb 2025

REGNav: Room Expert Guided Image-Goal NavigationAAAI Conference on Artificial Intelligence (AAAI), 2025

365

15 Feb 2025

TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation

Navid Rajabi

Jana Kosecka

LM&Ro 3DV

461

11 Feb 2025

SphereFusion: Efficient Panorama Depth Estimation via Gated FusionInternational Conference on 3D Vision (3DV), 2025

227

09 Feb 2025

NextBestPath: Efficient 3D Mapping of Unseen EnvironmentsInternational Conference on Learning Representations (ICLR), 2025

255

07 Feb 2025

3DSES: an indoor Lidar point cloud segmentation dataset with real and pseudo-labels from a 3D model

232

29 Jan 2025

Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric EnhancementIEEE Robotics and Automation Letters (IEEE RA-L), 2025

420

28 Jan 2025

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice RepresentationIEEE International Conference on Robotics and Automation (ICRA), 2025

375

28 Jan 2025

Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot AutonomyThe international journal of robotics research (IJRR), 2024

...

582

28 Jan 2025

Enhancing Monocular Depth Estimation with Multi-Source Auxiliary TasksIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025

282

22 Jan 2025

Beyond Uncertainty: Risk-Aware Active View Acquisition for Safe Robot Navigation and 3D Scene Understanding with FisherRF

419

20 Jan 2025

ActiveGAMER: Active GAussian Mapping through Efficient RenderingComputer Vision and Pattern Recognition (CVPR), 2025

509

12 Jan 2025

CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering

Walterio W. Mayol-Cuevas

Yunze Liu

Junxiao Shen

3DV

408

12 Jan 2025

Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models

351

07 Jan 2025

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

739

02 Jan 2025

SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic CameraIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

443

31 Dec 2024

Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models

379

31 Dec 2024

"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities

...

213

26 Dec 2024

Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few ExamplesAAAI Conference on Artificial Intelligence (AAAI), 2024

210

23 Dec 2024

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense LabelingIEEE Robotics and Automation Letters (RA-L), 2024

Daichi Yashima

Ryosuke Korekata

Komei Sugiura

445

21 Dec 2024