ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06158
  4. Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments

Matterport3D: Learning from RGB-D Data in Indoor Environments

18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
    3DV3DPC
ArXiv (abs)PDFHTML

Papers citing "Matterport3D: Learning from RGB-D Data in Indoor Environments"

50 / 1,327 papers shown
LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation
LiteVLoc: Map-Lite Visual Localization for Image Goal NavigationIEEE International Conference on Robotics and Automation (ICRA), 2024
Jianhao Jiao
Jinhao He
Changkun Liu
Sebastian Aegidius
Xiangcheng Hu
Tristan Braud
Dimitrios Kanoulas
267
5
0
06 Oct 2024
Semantic Environment Atlas for Object-Goal Navigation
Semantic Environment Atlas for Object-Goal NavigationKnowledge-Based Systems (KBS), 2024
Nuri Kim
Jeongho Park
Mineui Hong
Songhwai Oh
233
2
0
05 Oct 2024
The Wallpaper is Ugly: Indoor Localization using Vision and Language
The Wallpaper is Ugly: Indoor Localization using Vision and LanguageIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Seth Pate
Lawson L. S. Wong
215
4
0
04 Oct 2024
DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse Scenes
DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse Scenes
Zhaowei Wang
Hongming Zhang
Tianqing Fang
Ye Tian
Yue Yang
Kaixin Ma
Xiaoman Pan
Yangqiu Song
Dong Yu
LM&Ro
416
4
0
03 Oct 2024
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
SonicSim: A customizable simulation platform for speech processing in moving sound source scenariosInternational Conference on Learning Representations (ICLR), 2024
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
301
8
0
02 Oct 2024
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech SeparationInternational Conference on Learning Representations (ICLR), 2024
Mohan Xu
Kai Li
Guo Chen
Xiaolin Hu
217
9
0
02 Oct 2024
Find Everything: A General Vision Language Model Approach to Multi-Object Search
Find Everything: A General Vision Language Model Approach to Multi-Object Search
Daniel Choi
Angus Fung
Haitong Wang
Aaron Hao Tan
471
6
0
01 Oct 2024
Active Neural Mapping at Scale
Active Neural Mapping at ScaleIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Zijia Kuang
Zike Yan
Hao Zhao
Guyue Zhou
Hongbin Zha
193
6
0
30 Sep 2024
Grounding 3D Scene Affordance From Egocentric Interactions
Grounding 3D Scene Affordance From Egocentric Interactions
Cuiyu Liu
Wei Zhai
Yuhang Yang
Hongchen Luo
Sen Liang
Yang Cao
Zheng-Jun Zha
386
3
0
29 Sep 2024
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMsIEEE International Conference on Robotics and Automation (ICRA), 2024
Yanyuan Qiao
Wenqi Lyu
Hui Wang
Zixu Wang
Zerui Li
Yuan Zhang
Zhuliang Yu
Qi Wu
LRM
339
31
0
27 Sep 2024
HGS-Planner: Hierarchical Planning Framework for Active Scene
  Reconstruction Using 3D Gaussian Splatting
HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian SplattingIEEE International Conference on Robotics and Automation (ICRA), 2024
Zijun Xu
Rui Jin
Ke Wu
Yi Zhao
Zhiwei Zhang
Jieru Zhao
Fei Gao
Zhongxue Gan
Wenchao Ding
235
14
0
26 Sep 2024
RT-GuIDE: Real-Time Gaussian Splatting for Information-Driven Exploration
RT-GuIDE: Real-Time Gaussian Splatting for Information-Driven ExplorationIEEE Robotics and Automation Letters (RA-L), 2024
Yuezhan Tao
Dexter Ong
Varun Murali
Igor Spasojevic
Pratik Chaudhari
Vijay Kumar
3DGS
435
12
0
26 Sep 2024
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language
  Navigation
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language NavigationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zehao Wang
Minye Wu
Yixin Cao
Yubo Ma
Meiqi Chen
Tinne Tuytelaars
173
5
0
25 Sep 2024
Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with
  Large Language Models
Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language ModelsConference on Robot Learning (CoRL), 2024
Mike Zhang
Kaixian Qu
Vaishakh Patil
Cesar Cadena
Marco Hutter
LM&Ro3DV
309
9
0
23 Sep 2024
Robust and Flexible Omnidirectional Depth Estimation with Multiple 360-degree Cameras
Robust and Flexible Omnidirectional Depth Estimation with Multiple 360-degree Cameras
Ming Li
Xueqian Jin
Xuejiao Hu
Jinghao Cao
S. Du
Yang Li
MDE
510
0
0
23 Sep 2024
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal
  Navigation
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal NavigationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Naoki Yokoyama
Ram Ramrakhya
Abhishek Das
Dhruv Batra
Sehoon Ha
243
42
0
22 Sep 2024
From Cognition to Precognition: A Future-Aware Framework for Social Navigation
From Cognition to Precognition: A Future-Aware Framework for Social NavigationIEEE International Conference on Robotics and Automation (ICRA), 2024
Zeying Gong
Tianshuai Hu
Ronghe Qiu
Junwei Liang
835
9
0
20 Sep 2024
Navigation with VLM framework: Towards Going to Any Language
Navigation with VLM framework: Towards Going to Any Language
Zecheng Yin
Chonghao Cheng
Lizhen
Zhen Li
LM&Ro
432
3
0
18 Sep 2024
Online Diffusion-Based 3D Occupancy Prediction at the Frontier with
  Probabilistic Map Reconciliation
Online Diffusion-Based 3D Occupancy Prediction at the Frontier with Probabilistic Map ReconciliationIEEE International Conference on Robotics and Automation (ICRA), 2024
Alec Reed
Lorin Achey
Brendan Crowe
Bradley Hayes
Christoffer Heckman
272
3
0
16 Sep 2024
Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene
  Graph for Robot Navigation
Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot NavigationIEEE International Conference on Robotics and Automation (ICRA), 2024
Yifan Xu
Ziming Luo
Qianwei Wang
Vineet Kamat
Carol Menassa
3DV3DPC
166
6
0
16 Sep 2024
Automatic Scene Generation: State-of-the-Art Techniques, Models,
  Datasets, Challenges, and Future Prospects
Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future ProspectsIEEE Access (IEEE Access), 2024
Awal Ahmed Fime
Saifuddin Mahmud
Arpita Das
Md. Sunzidul Islam
Hong-Hoon Kim
VGen3DV
273
2
0
14 Sep 2024
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction
  Generation
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Muraleekrishna Gopinathan
Martin Masek
Jumana Abu-Khalaf
David Suter
LM&Ro
213
3
0
09 Sep 2024
Introducing a Class-Aware Metric for Monocular Depth Estimation: An
  Automotive Perspective
Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective
Tim Bader
Leon Eisemann
Adrian Pogorzelski
Namrata Jangid
Attila B. Kis
325
0
0
06 Sep 2024
Estimating Indoor Scene Depth Maps from Ultrasonic Echoes
Estimating Indoor Scene Depth Maps from Ultrasonic EchoesInternational Conference on Information Photonics (ICIP), 2024
Junpei Honma
Akisato Kimura
Go Irie
MDE
226
1
0
05 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene UnderstandingNeural Information Processing Systems (NeurIPS), 2024
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-Xiong Wang
532
32
0
05 Sep 2024
Active Semantic Mapping and Pose Graph Spectral Analysis for Robot
  Exploration
Active Semantic Mapping and Pose Graph Spectral Analysis for Robot ExplorationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Rongge Zhang
Haechan Mark Bong
Giovanni Beltrame
369
4
0
27 Aug 2024
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type
  Performance in Indoor Monocular Depth
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular DepthBritish Machine Vision Conference (BMVC), 2024
Cho-Ying Wu
Quankai Gao
Chin-Cheng Hsu
Te-Lin Wu
Jing-Wen Chen
Ulrich Neumann
MDE
347
0
0
25 Aug 2024
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Youjun Zhao
Jiaying Lin
Shuquan Ye
Qianshi Pang
Rynson W. H. Lau
424
4
0
20 Aug 2024
Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion
  and Cross-task Collaboration
Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration
Hao Ai
Lin Wang
177
2
0
18 Aug 2024
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object
  Localization Probability Maps
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability MapsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Senthil Hariharan Arul
Dhruva Kumar
Vivek Sugirtharaj
Richard Kim
Xuewei
Qi
R. Madhivanan
Arnie Sen
Dinesh Manocha
89
2
0
15 Aug 2024
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles
  Based on Open-Vocabulary Instructions
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions
Ryosuke Korekata
Kanta Kaneda
Shunya Nagashima
Yuto Imai
Komei Sugiura
ObjDLM&Ro
258
3
0
15 Aug 2024
Structure-preserving Planar Simplification for Indoor Environments
Structure-preserving Planar Simplification for Indoor Environments
Bishwash Khanal
Sanjay Rijal
Manish Awale
V. Ojha
3DPC
211
0
0
13 Aug 2024
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Junrui Zhang
Jiaqi Li
Yachuan Huang
Yiran Wang
Jinghong Zheng
Liao Shen
Z. Cao
MDE
260
5
0
12 Aug 2024
UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation
UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
LM&Ro
265
2
0
08 Aug 2024
MMIU: Multimodal Multi-image Understanding for Evaluating Large
  Vision-Language Models
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng
Jun Wang
Chuanhao Li
Quanfeng Lu
Hao Tian
...
Jifeng Dai
Ping Luo
Ping Luo
Kaipeng Zhang
Wenqi Shao
VLM
257
47
0
05 Aug 2024
NOLO: Navigate Only Look Once
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
323
2
0
02 Aug 2024
Navigating Beyond Instructions: Vision-and-Language Navigation in
  Obstructed Environments
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
339
7
0
31 Jul 2024
ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic
  Environments
ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments
Taewoong Kim
Cheolhong Min
Byeonghwi Kim
Jinyeon Kim
Wonje Jeung
Jonghyun Choi
LM&Ro
270
13
0
26 Jul 2024
Deep Spherical Superpixels
Deep Spherical Superpixels
Rémi Giraud
Michael Clement
MDE
403
0
0
24 Jul 2024
Navigation Instruction Generation with BEV Perception and Large Language
  Models
Navigation Instruction Generation with BEV Perception and Large Language Models
Sheng Fan
Rui Liu
Wenguan Wang
Yi Yang
263
20
0
21 Jul 2024
Self-training Room Layout Estimation via Geometry-aware Ray-casting
Self-training Room Layout Estimation via Geometry-aware Ray-casting
Bolivar Solarte
Chin-Hsuan Wu
Jin-Cheng Jhang
Jonathan Lee
Yi-Hsuan Tsai
Min Sun
SSL
130
4
0
21 Jul 2024
VisFly: An Efficient and Versatile Simulator for Training Vision-based
  Flight
VisFly: An Efficient and Versatile Simulator for Training Vision-based Flight
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
521
6
0
20 Jul 2024
MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby
  References
MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References
Lukas Bosiger
Mihai Dusmanu
Marc Pollefeys
Z. Bauer
190
1
0
18 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
354
11
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided
  Self-Distillation
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
239
10
0
18 Jul 2024
GenRC: Generative 3D Room Completion from Sparse Image Collections
GenRC: Generative 3D Room Completion from Sparse Image Collections
Ming-feng Li
Yueh-Feng Ku
Hong-Xuan Yen
Chi Liu
Yu-Lun Liu
Albert Y. C. Chen
Cheng-Hao Kuo
Min Sun
3DVVGen
337
9
0
17 Jul 2024
NavGPT-2: Unleashing Navigational Reasoning Capability for Large
  Vision-Language Models
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou
Yicong Hong
Zun Wang
Xin Eric Wang
Qi Wu
LM&Ro
312
74
0
17 Jul 2024
GRUtopia: Dream General Robots in a City at Scale
GRUtopia: Dream General Robots in a City at Scale
Hanqing Wang
Jiahe Chen
Wensi Huang
Qingwei Ben
Tai Wang
...
Ying Zhao
Zhongying Tu
Yu Qiao
Dahua Lin
Jiangmiao Pang
LM&RoVGen
335
43
0
15 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DVVLM
244
11
0
13 Jul 2024
Semantic UV mapping to improve texture inpainting for indoor scenes
Semantic UV mapping to improve texture inpainting for indoor scenes
J. Vermandere
M. Bassier
M. Vergauwen
217
2
0
12 Jul 2024
Previous
123...678...252627
Next