Matterport3D: Learning from RGB-D Data in Indoor Environments

18 September 2017

Matthias Nießner

Shuran Song

Papers citing "Matterport3D: Learning from RGB-D Data in Indoor Environments"

50 / 1,327 papers shown

LiteVLoc: Map-Lite Visual Localization for Image Goal NavigationIEEE International Conference on Robotics and Automation (ICRA), 2024

267

06 Oct 2024

Semantic Environment Atlas for Object-Goal NavigationKnowledge-Based Systems (KBS), 2024

233

05 Oct 2024

The Wallpaper is Ugly: Indoor Localization using Vision and LanguageIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023

Seth Pate

Lawson L. S. Wong

215

04 Oct 2024

DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse Scenes

416

03 Oct 2024

SonicSim: A customizable simulation platform for speech processing in moving sound source scenariosInternational Conference on Learning Representations (ICLR), 2024

Kai Li

301

02 Oct 2024

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech SeparationInternational Conference on Learning Representations (ICLR), 2024

Mohan Xu

Kai Li

Guo Chen

Xiaolin Hu

217

02 Oct 2024

Find Everything: A General Vision Language Model Approach to Multi-Object Search

471

01 Oct 2024

Active Neural Mapping at ScaleIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

Zijia Kuang

Zike Yan

Hao Zhao

Guyue Zhou

Hongbin Zha

193

30 Sep 2024

Grounding 3D Scene Affordance From Egocentric Interactions

Cuiyu Liu

Wei Zhai

Yuhang Yang

Hongchen Luo

Sen Liang

Yang Cao

Zheng-Jun Zha

386

29 Sep 2024

Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMsIEEE International Conference on Robotics and Automation (ICRA), 2024

Yuan Zhang

Qi Wu

339

27 Sep 2024

HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian SplattingIEEE International Conference on Robotics and Automation (ICRA), 2024

Zijun Xu

Rui Jin

Ke Wu

Yi Zhao

Zhiwei Zhang

Jieru Zhao

Fei Gao

Zhongxue Gan

Wenchao Ding

235

26 Sep 2024

RT-GuIDE: Real-Time Gaussian Splatting for Information-Driven ExplorationIEEE Robotics and Automation Letters (RA-L), 2024

435

26 Sep 2024

Navigating the Nuances: A Fine-grained Evaluation of Vision-Language NavigationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zehao Wang

Yixin Cao

173

25 Sep 2024

Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language ModelsConference on Robot Learning (CoRL), 2024

Marco Hutter

309

23 Sep 2024

Robust and Flexible Omnidirectional Depth Estimation with Multiple 360-degree Cameras

510

23 Sep 2024

HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal NavigationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

Sehoon Ha

243

22 Sep 2024

From Cognition to Precognition: A Future-Aware Framework for Social NavigationIEEE International Conference on Robotics and Automation (ICRA), 2024

835

20 Sep 2024

Navigation with VLM framework: Towards Going to Any Language

432

18 Sep 2024

Online Diffusion-Based 3D Occupancy Prediction at the Frontier with Probabilistic Map ReconciliationIEEE International Conference on Robotics and Automation (ICRA), 2024

272

16 Sep 2024

Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot NavigationIEEE International Conference on Robotics and Automation (ICRA), 2024

Vineet Kamat

166

16 Sep 2024

Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future ProspectsIEEE Access (IEEE Access), 2024

273

14 Sep 2024

Spatially-Aware Speaker for Vision-and-Language Navigation Instruction GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Muraleekrishna Gopinathan

213

09 Sep 2024

Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective

325

06 Sep 2024

Estimating Indoor Scene Depth Maps from Ultrasonic EchoesInternational Conference on Information Photonics (ICIP), 2024

226

05 Sep 2024

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene UnderstandingNeural Information Processing Systems (NeurIPS), 2024

532

05 Sep 2024

Active Semantic Mapping and Pose Graph Spectral Analysis for Robot ExplorationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

Rongge Zhang

Haechan Mark Bong

Giovanni Beltrame

369

27 Aug 2024

InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular DepthBritish Machine Vision Conference (BMVC), 2024

347

25 Aug 2024

OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding

424

20 Aug 2024

Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration

Hao Ai

Lin Wang

177

18 Aug 2024

VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability MapsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

Senthil Hariharan Arul

Xuewei

Dinesh Manocha

15 Aug 2024

DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions

Komei Sugiura

258

15 Aug 2024

Structure-preserving Planar Simplification for Indoor Environments

211

13 Aug 2024

Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces

260

12 Aug 2024

UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation

265

08 Aug 2024

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

...

Jifeng Dai

257

05 Aug 2024

NOLO: Navigate Only Look Once

323

02 Aug 2024

Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments

339

31 Jul 2024

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

270

26 Jul 2024

Deep Spherical Superpixels

Rémi Giraud

Michael Clement

MDE

403

24 Jul 2024

Navigation Instruction Generation with BEV Perception and Large Language Models

263

21 Jul 2024

Self-training Room Layout Estimation via Geometry-aware Ray-casting

130

21 Jul 2024

VisFly: An Efficient and Versatile Simulator for Training Vision-based Flight

521

20 Jul 2024

MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References

Lukas Bosiger

Mihai Dusmanu

Marc Pollefeys

Z. Bauer

190

18 Jul 2024

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

354

18 Jul 2024

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation

Pengfei Wang

Yuxi Wang

Shuai Li

Zhaoxiang Zhang

Zhen Lei

Lei Zhang

239

18 Jul 2024

GenRC: Generative 3D Room Completion from Sparse Image Collections

337

17 Jul 2024

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Qi Wu

312

17 Jul 2024

GRUtopia: Dream General Robots in a City at Scale

...

Yu Qiao

Dahua Lin

Jiangmiao Pang

LM&Ro VGen

335

15 Jul 2024

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding

Lei Zhang

244

13 Jul 2024

Semantic UV mapping to improve texture inpainting for indoor scenes

J. Vermandere

M. Bassier

M. Vergauwen

217

12 Jul 2024