ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06158
  4. Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments

Matterport3D: Learning from RGB-D Data in Indoor Environments

18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
    3DV
    3DPC
ArXivPDFHTML

Papers citing "Matterport3D: Learning from RGB-D Data in Indoor Environments"

50 / 1,167 papers shown
Title
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Yanyuan Qiao
Wenqi Lyu
Hui Wang
Zixu Wang
Zerui Li
Yuan Zhang
Mingkui Tan
Qi Wu
LRM
43
4
0
27 Sep 2024
HGS-Planner: Hierarchical Planning Framework for Active Scene
  Reconstruction Using 3D Gaussian Splatting
HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting
Zijun Xu
Rui Jin
Ke Wu
Yi Zhao
Zhiwei Zhang
Jieru Zhao
Fei Gao
Zhongxue Gan
Wenchao Ding
50
4
0
26 Sep 2024
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language
  Navigation
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Zehao Wang
Minye Wu
Yixin Cao
Yubo Ma
Meiqi Chen
Tinne Tuytelaars
43
1
0
25 Sep 2024
Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with
  Large Language Models
Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models
Mike Zhang
Kaixian Qu
Vaishakh Patil
Cesar Cadena
Marco Hutter
LM&Ro
3DV
41
4
0
23 Sep 2024
Robust and Flexible Omnidirectional Depth Estimation with Multiple 360-degree Cameras
Robust and Flexible Omnidirectional Depth Estimation with Multiple 360-degree Cameras
Ming Li
Xueqian Jin
Xuejiao Hu
Jinghao Cao
S. Du
Yang Li
MDE
48
0
0
23 Sep 2024
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal
  Navigation
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal Navigation
Naoki Yokoyama
Ram Ramrakhya
Abhishek Das
Dhruv Batra
Sehoon Ha
38
10
0
22 Sep 2024
From Cognition to Precognition: A Future-Aware Framework for Social Navigation
From Cognition to Precognition: A Future-Aware Framework for Social Navigation
Zeying Gong
Tianshuai Hu
Ronghe Qiu
Junwei Liang
167
0
0
20 Sep 2024
Navigation with VLM framework: Go to Any Language
Navigation with VLM framework: Go to Any Language
Zecheng Yin
Chonghao Cheng
Lizhen
LM&Ro
32
0
0
18 Sep 2024
Online Diffusion-Based 3D Occupancy Prediction at the Frontier with
  Probabilistic Map Reconciliation
Online Diffusion-Based 3D Occupancy Prediction at the Frontier with Probabilistic Map Reconciliation
Alec Reed
Lorin Achey
Brendan Crowe
Bradley Hayes
Christoffer Heckman
35
0
0
16 Sep 2024
Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene
  Graph for Robot Navigation
Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation
Yifan Xu
Ziming Luo
Qianwei Wang
Vineet Kamat
Carol Menassa
3DV
3DPC
38
0
0
16 Sep 2024
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction
  Generation
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation
Muraleekrishna Gopinathan
Martin Masek
Jumana Abu-Khalaf
David Suter
LM&Ro
41
1
0
09 Sep 2024
Introducing a Class-Aware Metric for Monocular Depth Estimation: An
  Automotive Perspective
Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective
Tim Bader
Leon Eisemann
Adrian Pogorzelski
Namrata Jangid
Attila B. Kis
51
0
0
06 Sep 2024
Estimating Indoor Scene Depth Maps from Ultrasonic Echoes
Estimating Indoor Scene Depth Maps from Ultrasonic Echoes
Junpei Honma
Akisato Kimura
Go Irie
MDE
43
0
0
05 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-xiong Wang
78
15
0
05 Sep 2024
Active Semantic Mapping and Pose Graph Spectral Analysis for Robot
  Exploration
Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration
Rongge Zhang
Haechan Mark Bong
Giovanni Beltrame
57
1
0
27 Aug 2024
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type
  Performance in Indoor Monocular Depth
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth
Cho-Ying Wu
Quankai Gao
Chin-Cheng Hsu
Te-Lin Wu
Jing-Wen Chen
Ulrich Neumann
MDE
37
0
0
25 Aug 2024
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Youjun Zhao
Jiaying Lin
Shuquan Ye
Qianshi Pang
Rynson W. H. Lau
64
1
0
20 Aug 2024
Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion
  and Cross-task Collaboration
Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration
Hao Ai
Lin Wang
40
0
0
18 Aug 2024
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object
  Localization Probability Maps
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability Maps
Senthil Hariharan Arul
Dhruva Kumar
Vivek Sugirtharaj
Richard Kim
Xuewei
Qi
R. Madhivanan
Arnie Sen
Dinesh Manocha
23
1
0
15 Aug 2024
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles
  Based on Open-Vocabulary Instructions
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions
Ryosuke Korekata
Kanta Kaneda
Shunya Nagashima
Yuto Imai
Komei Sugiura
ObjD
LM&Ro
53
2
0
15 Aug 2024
Structure-preserving Planar Simplification for Indoor Environments
Structure-preserving Planar Simplification for Indoor Environments
Bishwash Khanal
Sanjay Rijal
Manish Awale
V. Ojha
3DPC
35
0
0
13 Aug 2024
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Junrui Zhang
Jiaqi Li
Yachuan Huang
Yiran Wang
Jinghong Zheng
Liao Shen
Z. Cao
MDE
39
3
0
12 Aug 2024
UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation
UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
LM&Ro
37
1
0
08 Aug 2024
MMIU: Multimodal Multi-image Understanding for Evaluating Large
  Vision-Language Models
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng
Jun Wang
Chuanhao Li
Quanfeng Lu
Hao Tian
...
Jifeng Dai
Ping Luo
Ping Luo
Kaipeng Zhang
Wenqi Shao
VLM
60
18
0
05 Aug 2024
NOLO: Navigate Only Look Once
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
61
1
0
02 Aug 2024
Navigating Beyond Instructions: Vision-and-Language Navigation in
  Obstructed Environments
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
51
2
0
31 Jul 2024
ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic
  Environments
ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments
Taewoong Kim
Cheolhong Min
Byeonghwi Kim
Jinyeon Kim
Wonje Jeung
Jonghyun Choi
LM&Ro
44
5
0
26 Jul 2024
Deep Spherical Superpixels
Deep Spherical Superpixels
Rémi Giraud
Michael Clement
MDE
61
0
0
24 Jul 2024
Navigation Instruction Generation with BEV Perception and Large Language
  Models
Navigation Instruction Generation with BEV Perception and Large Language Models
Sheng Fan
Rui Liu
Wenguan Wang
Yi Yang
50
5
0
21 Jul 2024
Self-training Room Layout Estimation via Geometry-aware Ray-casting
Self-training Room Layout Estimation via Geometry-aware Ray-casting
Bolivar Solarte
Chin-Hsuan Wu
Jin-Cheng Jhang
Jonathan Lee
Yi-Hsuan Tsai
Min Sun
SSL
34
2
0
21 Jul 2024
VisFly: An Efficient and Versatile Simulator for Training Vision-based
  Flight
VisFly: An Efficient and Versatile Simulator for Training Vision-based Flight
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
53
3
0
20 Jul 2024
MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby
  References
MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References
Lukas Bosiger
Mihai Dusmanu
Marc Pollefeys
Z. Bauer
59
0
0
18 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
67
4
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided
  Self-Distillation
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
52
3
0
18 Jul 2024
GenRC: Generative 3D Room Completion from Sparse Image Collections
GenRC: Generative 3D Room Completion from Sparse Image Collections
Ming-feng Li
Yueh-Feng Ku
Hong-Xuan Yen
Chi Liu
Yu-Lun Liu
Albert Y. C. Chen
Cheng-Hao Kuo
Min Sun
3DV
VGen
62
4
0
17 Jul 2024
NavGPT-2: Unleashing Navigational Reasoning Capability for Large
  Vision-Language Models
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou
Yicong Hong
Zun Wang
Xin Eric Wang
Qi Wu
LM&Ro
50
19
0
17 Jul 2024
GRUtopia: Dream General Robots in a City at Scale
GRUtopia: Dream General Robots in a City at Scale
Hanqing Wang
Jiahe Chen
Wensi Huang
Qingwei Ben
Tai Wang
...
Ying Zhao
Zhongying Tu
Yu Qiao
Dahua Lin
Jiangmiao Pang
LM&Ro
VGen
60
16
0
15 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DV
VLM
47
6
0
13 Jul 2024
Semantic UV mapping to improve texture inpainting for indoor scenes
Semantic UV mapping to improve texture inpainting for indoor scenes
J. Vermandere
M. Bassier
M. Vergauwen
59
1
0
12 Jul 2024
UNRealNet: Learning Uncertainty-Aware Navigation Features from
  High-Fidelity Scans of Real Environments
UNRealNet: Learning Uncertainty-Aware Navigation Features from High-Fidelity Scans of Real Environments
S. Triest
David D. Fan
Sebastian Scherer
Ali-Akbar Agha-Mohammadi
45
4
0
11 Jul 2024
SRPose: Two-view Relative Pose Estimation with Sparse Keypoints
SRPose: Two-view Relative Pose Estimation with Sparse Keypoints
Rui Yin
Yulun Zhang
Zherong Pan
Jianjun Zhu
Cheng Wang
Biao Jia
44
1
0
11 Jul 2024
Fusion of Short-term and Long-term Attention for Video Mirror Detection
Fusion of Short-term and Long-term Attention for Video Mirror Detection
Mingchen Xu
Jing Wu
Yukun Lai
Ze Ji
37
1
0
10 Jul 2024
Controlling Space and Time with Diffusion Models
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
73
27
0
10 Jul 2024
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion
  Model
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
Xiaoding Yuan
Shitao Tang
Kejie Li
Alan Yuille
Peng Wang
DiffM
39
3
0
09 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on
  Embodied AI
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Yang Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&Ro
SyDa
AI4CE
56
52
0
09 Jul 2024
Affordances-Oriented Planning using Foundation Models for Continuous
  Vision-Language Navigation
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Jiaqi Chen
Bingqian Lin
Xinmin Liu
Lin Ma
Xiaodan Liang
Kwan-Yee K. Wong
LM&Ro
57
10
0
08 Jul 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene
  Synthesis
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun
Hang Zhou
Wengang Zhou
Li Li
Houqiang Li
3DPC
3DV
46
6
0
07 Jul 2024
Open Panoramic Segmentation
Open Panoramic Segmentation
Junwei Zheng
Ruiping Liu
Yufan Chen
Kunyu Peng
Chengzhi Wu
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
VLM
47
8
0
02 Jul 2024
Object Segmentation from Open-Vocabulary Manipulation Instructions Based
  on Optimal Transport Polygon Matching with Multimodal Foundation Models
Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Takayuki Nishimura
Katsuyuki Kuyo
Motonari Kambara
Komei Sugiura
DiffM
35
0
0
01 Jul 2024
HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model
HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model
Hieu T. Nguyen
Yiwen Chen
Vikram S. Voleti
Varun Jampani
Huaizu Jiang
61
0
0
28 Jun 2024
Previous
12345...222324
Next