Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1709.06158
Cited By

Matterport3D: Learning from RGB-D Data in Indoor Environments

Matterport3D: Learning from RGB-D Data in Indoor Environments

18 September 2017

Thomas Funkhouser

Matthias Nießner

Shuran Song

ArXiv (abs)PDF HTML

Papers citing "Matterport3D: Learning from RGB-D Data in Indoor Environments"

50 / 1,327 papers shown

SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA)

SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA)Computers & graphics (Comput. Graph.), 2025

Viet-Tham Huynh

Quang-Thuc Nguyen

...

Trung-Truc Huynh-Le

Minh-Triet Tran

108

1

0

12 Aug 2025

ASAudio: A Survey of Advanced Spatial Audio Research

ASAudio: A Survey of Advanced Spatial Audio Research

199

3

0

08 Aug 2025

Following Route Instructions using Large Vision-Language Models: A Comparison between Low-level and Panoramic Action Spaces

Following Route Instructions using Large Vision-Language Models: A Comparison between Low-level and Panoramic Action Spaces

Vebjørn Haug Kåsene

107

0

0

04 Aug 2025

How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes

Mahnoor Fatima Saad

90

1

0

04 Aug 2025

Glass Surface Segmentation with an RGB-D Camera via Weighted Feature Fusion for Service Robots

Glass Surface Segmentation with an RGB-D Camera via Weighted Feature Fusion for Service Robots

Anastasia Ioannou

138

2

0

03 Aug 2025

VPN: Visual Prompt Navigation

VPN: Visual Prompt Navigation

Shuaiqiang Wang

258

0

0

03 Aug 2025

ContestTrade: A Multi-Agent Trading System Based on Internal Contest Mechanism

ContestTrade: A Multi-Agent Trading System Based on Internal Contest Mechanism

219

1

0

01 Aug 2025

Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion

Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion

230

2

0

31 Jul 2025

Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques

Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques

226

3

0

30 Jul 2025

A Two-Stage Lightweight Framework for Efficient Land-Air Bimodal Robot Autonomous Navigation

A Two-Stage Lightweight Framework for Efficient Land-Air Bimodal Robot Autonomous Navigation

146

0

0

30 Jul 2025

Recursive Visual Imagination and Adaptive Linguistic Grounding for Vision Language Navigation

Recursive Visual Imagination and Adaptive Linguistic Grounding for Vision Language Navigation

110

0

0

29 Jul 2025

LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments

LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments

124

0

0

29 Jul 2025

DISCOVERSE: Efficient Robot Simulation in Complex High-Fidelity Environments

DISCOVERSE: Efficient Robot Simulation in Complex High-Fidelity Environments

...

157

10

0

29 Jul 2025

Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View

Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View

Suranjan Gautam

205

0

0

28 Jul 2025

Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting

Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting

202

4

0

24 Jul 2025

Sparse-View 3D Reconstruction: Recent Advances and Open Challenges

Sparse-View 3D Reconstruction: Recent Advances and Open Challenges

201

1

0

22 Jul 2025

Scanning Bot: Efficient Scan Planning using Panoramic Cameras

Scanning Bot: Efficient Scan Planning using Panoramic Cameras

203

1

0

22 Jul 2025

Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey

Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey

...

Hanspeter Pfister

Fangneng Zhan

641

8

0

19 Jul 2025

X-Nav: Learning End-to-End Cross-Embodiment Navigation for Mobile Robots

X-Nav: Learning End-to-End Cross-Embodiment Navigation for Mobile RobotsIEEE Robotics and Automation Letters (IEEE RA-L), 2025

232

4

0

19 Jul 2025

Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities

Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities

216

3

0

17 Jul 2025

Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation

Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation

...

181

3

0

15 Jul 2025

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

213

6

0

10 Jul 2025

NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments

NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments

241

14

0

30 Jun 2025

General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting

General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting

Shehryar Khattak

Mykel J. Kochenderfer

Georgios Georgakis

167

1

0

20 Jun 2025

VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning

VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning

Hengshuang Zhao

380

28

0

20 Jun 2025

Efficient and Generalizable Environmental Understanding for Visual Navigation

Efficient and Generalizable Environmental Understanding for Visual Navigation

247

0

0

18 Jun 2025

Uncertainty-Informed Active Perception for Open Vocabulary Object Goal Navigation

Uncertainty-Informed Active Perception for Open Vocabulary Object Goal NavigationEuropean Conference on Mobile Robots (ECMR), 2025

Cyrill Stachniss

Marija Popović

272

0

0

16 Jun 2025

LEO-VL: Efficient Scene Representation for Scalable 3D Vision-Language Learning

LEO-VL: Efficient Scene Representation for Scalable 3D Vision-Language Learning

Xiongkun Linghu

...

288

2

0

11 Jun 2025

The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge

313

3

0

11 Jun 2025

A Navigation Framework Utilizing Vision-Language Models

A Navigation Framework Utilizing Vision-Language Models

120

0

0

11 Jun 2025

Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations

175

2

0

10 Jun 2025

SpatialLM: Training Large Language Models for Structured Indoor Modeling

SpatialLM: Training Large Language Models for Structured Indoor Modeling

235

19

0

09 Jun 2025

Grounding Beyond Detection: Enhancing Contextual Understanding in Embodied 3D Grounding

Grounding Beyond Detection: Enhancing Contextual Understanding in Embodied 3D Grounding

387

3

0

05 Jun 2025

Defurnishing with X-Ray Vision: Joint Removal of Furniture from Panoramas and Mesh

Defurnishing with X-Ray Vision: Joint Removal of Furniture from Panoramas and Mesh

368

0

0

05 Jun 2025

RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models

RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

278

2

0

03 Jun 2025

Towards In-the-wild 3D Plane Reconstruction from a Single Image

Towards In-the-wild 3D Plane Reconstruction from a Single ImageComputer Vision and Pattern Recognition (CVPR), 2025

Sharon X. Huang

216

5

0

03 Jun 2025

R2SM: Referring and Reasoning for Selective Masks

R2SM: Referring and Reasoning for Selective Masks

Hwann-Tzong Chen

346

0

0

02 Jun 2025

NavBench: Probing Multimodal Large Language Models for Embodied Navigation

NavBench: Probing Multimodal Large Language Models for Embodied Navigation

250

5

0

01 Jun 2025

GRAM: Spatial general-purpose audio representation models for real-world applications

GRAM: Spatial general-purpose audio representation models for real-world applications

Goksenin Yuksel

Marcel van Gerven

Kiki van der Heijden

301

1

0

01 Jun 2025

Understanding while Exploring: Semantics-driven Active Mapping

Understanding while Exploring: Semantics-driven Active Mapping

Philippos Mordohai

280

0

0

30 May 2025

Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts

Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts

Aruni RoyChowdhury

315

2

0

29 May 2025

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

...

262

42

0

29 May 2025

Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration

Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration

481

0

0

29 May 2025

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Chang-Bin Zhang

185

6

0

28 May 2025

3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model

3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model

280

10

0

28 May 2025

DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation

DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation

518

1

0

28 May 2025

OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender

OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender

Toshiki Watanabe

Hwann-Tzong Chen

254

0

0

26 May 2025

GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scenes

GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scenes

361

5

0

26 May 2025

SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes

SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes

178

0

0

24 May 2025

Is Single-View Mesh Reconstruction Ready for Robotics?

Is Single-View Mesh Reconstruction Ready for Robotics?

Bernhard Schölkopf

453

2

0

23 May 2025

1 2 3 4 5 6...25 26 27