ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06158
  4. Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments

Matterport3D: Learning from RGB-D Data in Indoor Environments

18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
    3DV
    3DPC
ArXivPDFHTML

Papers citing "Matterport3D: Learning from RGB-D Data in Indoor Environments"

50 / 1,167 papers shown
Title
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances
Wenting Xu
Viorela Ila
Luping Zhou
Craig T. Jin
82
0
0
07 Dec 2024
TANGO: Training-free Embodied AI Agents for Open-world Tasks
TANGO: Training-free Embodied AI Agents for Open-world Tasks
Filippo Ziliotto
Tommaso Campari
Luciano Serafini
Lamberto Ballan
LLMAG
LM&Ro
MLLM
LRM
83
1
0
05 Dec 2024
Multi-view Image Diffusion via Coordinate Noise and Fourier Attention
Multi-view Image Diffusion via Coordinate Noise and Fourier Attention
Justin D. Theiss
Norman Müller
Daeil Kim
Aayush Prakash
76
0
0
04 Dec 2024
Hijacking Vision-and-Language Navigation Agents with Adversarial
  Environmental Attacks
Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks
Zijiao Yang
Xiangxi Shi
Eric Slyman
Stefan Lee
AAML
84
1
0
03 Dec 2024
AdaVLN: Towards Visual Language Navigation in Continuous Indoor
  Environments with Moving Humans
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans
Dillon Loh
Tomasz Bednarz
Xinxing Xia
Frank Guan
83
0
0
27 Nov 2024
Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
Mehdi Zayene
Jannik Endres
Albias Havolli
Charles Corbière
Salim Cherkaoui
Alexandre Kontouli
Alexandre Alahi
MDE
153
1
0
27 Nov 2024
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Xinhao Liu
Jiajian Li
Yichen Jiang
Niranjan Sujay
Zheng Yang
Juexiao Zhang
John Abanes
Jing Zhang
Chen Feng
116
2
0
26 Nov 2024
Revisiting Point Cloud Completion: Are We Ready For The Real-World?
Revisiting Point Cloud Completion: Are We Ready For The Real-World?
Stuti Pathak
Prashant Kumar
Nicholus Mboga
Gunther Steenackers
R. Penne
Rudi Penne
269
0
0
26 Nov 2024
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
Chan Hee Song
Valts Blukis
Jonathan Tremblay
Stephen Tyree
Yu-Chuan Su
Stan Birchfield
101
8
0
25 Nov 2024
TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
Linqing Zhong
Chen Gao
Zihan Ding
Yue Liao
Si Liu
Shifeng Zhang
Xu Zhou
Si Liu
LRM
98
4
0
25 Nov 2024
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D
  Point Cloud Semantic Segmentation
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation
Umamaheswaran Raman Kumar
A. Fayjie
Jurgen Hannaert
Patrick Vandewalle
3DV
3DPC
85
1
0
20 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
Hamidreza Kasaei
Tingguang Li
M. Cao
LM&Ro
81
3
0
18 Nov 2024
The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual
  Localisation, Reconstruction and Radiance Field Methods
The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods
Yifu Tao
Miguel Ángel Muñoz-Bañón
Lintong Zhang
Jiahao Wang
L. Fu
Maurice F. Fallon
44
5
0
15 Nov 2024
VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal
  Transparent Surface Reconstruction in Indoor Scenes
VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes
A. Sethuraman
Onur Bagoren
Harikrishnan Seetharaman
Dalton Richardson
Joseph Taylor
Katherine A. Skinner
3DV
45
0
0
07 Nov 2024
SA3DIP: Segment Any 3D Instance with Potential 3D Priors
SA3DIP: Segment Any 3D Instance with Potential 3D Priors
Xi Yang
Xu Gu
Xingyilang Yin
Xinbo Gao
47
0
0
06 Nov 2024
VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation
VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation
Haochen Zhang
Nader Zantout
Pujith Kachana
Zongyuan Wu
Ji Zhang
Wenshan Wang
3DV
LM&Ro
49
5
0
05 Nov 2024
Deep Learning on 3D Semantic Segmentation: A Detailed Review
Deep Learning on 3D Semantic Segmentation: A Detailed Review
Thodoris Betsas
Andreas Georgopoulos
Anastasios Doulamis
Pierre Grussenmeyer
3DV
3DPC
49
1
0
04 Nov 2024
Multi-task Geometric Estimation of Depth and Surface Normal from
  Monocular 360° Images
Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images
Kun Huang
Fang-Lue Zhang
Fangfang Zhang
Yu-Kun Lai
Paul L. Rosin
N. Dodgson
42
0
0
04 Nov 2024
CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality
CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality
Yiqin Zhao
Mallesham Dasari
Tian Guo
58
0
0
04 Nov 2024
MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D
  Plane Reconstruction
MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
Wang Zhao
Jiachen Liu
Sheng Zhang
Heng Chang
Sili Chen
S. X. Huang
Yong-Jin Liu
Hengkai Guo
44
0
0
02 Nov 2024
DiffPano: Scalable and Consistent Text to Panorama Generation with
  Spherical Epipolar-Aware Diffusion
DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Weicai Ye
Chenhao Ji
Zheng Chen
Junyao Gao
Xiaoshui Huang
Song-Hai Zhang
Wanli Ouyang
Tong He
Cairong Zhao
Guofeng Zhang
41
8
0
31 Oct 2024
Deep Learning for 3D Point Cloud Enhancement: A Survey
Deep Learning for 3D Point Cloud Enhancement: A Survey
Siwen Quan
Junhao Yu
Ziming Nie
Muze Wang
Sijia Feng
Pei An
Jiaqi Yang
3DPC
55
3
0
30 Oct 2024
SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark
SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark
Hyunjun Jung
Weihang Li
Shun-cheng Wu
William Bittner
Nikolas Brasch
...
Eduardo Pérez-Pellitero
Zhensong Zhang
Arthur Moreau
Nassir Navab
Benjamin Busam
58
1
0
30 Oct 2024
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents
Junting Chen
Checheng Yu
Xunzhe Zhou
Tianqi Xu
Yao Mu
Mengkang Hu
Wenqi Shao
Yuran Wang
Ge Li
Lin Shao
76
4
0
30 Oct 2024
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian
  Splatting
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting
Yongqian Li
Zijia Kuang
Ting Li
Guyue Zhou
Shaohui Zhang
Zike Yan
3DGS
38
5
0
29 Oct 2024
ANAVI: Audio Noise Awareness using Visuals of Indoor environments for
  NAVIgation
ANAVI: Audio Noise Awareness using Visuals of Indoor environments for NAVIgation
Vidhi Jain
Rishi Veerapaneni
Yonatan Bisk
46
0
0
24 Oct 2024
Scale Propagation Network for Generalizable Depth Completion
Scale Propagation Network for Generalizable Depth Completion
Haotian Wang
Meng Yang
Xinhu Zheng
Gang Hua
31
2
0
24 Oct 2024
PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment
  Anything Model
PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model
Zhongchen Deng
Zhechen Yang
Chi Chen
Cheng Zeng
Yan Meng
Bisheng Yang
22
1
0
21 Oct 2024
Synergistic Dual Spatial-aware Generation of Image-to-Text and
  Text-to-Image
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Yu Zhao
Hao Fei
Xiangtai Li
L. Qin
Jiayi Ji
Erik Cambria
Meishan Zhang
Hao Fei
Jianguo Wei
DiffM
34
1
0
20 Oct 2024
Vision-Language Navigation with Energy-Based Policy
Vision-Language Navigation with Energy-Based Policy
Rui Liu
Wenguan Wang
Yue Yang
45
3
0
18 Oct 2024
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
Guangda Ji
Silvan Weder
Francis Engelmann
Marc Pollefeys
Hermann Blum
3DV
69
4
0
17 Oct 2024
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video
  Segmentation
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
52
0
0
16 Oct 2024
3D Gaussian Splatting in Robotics: A Survey
3D Gaussian Splatting in Robotics: A Survey
Siting Zhu
Guangming Wang
Dezhi Kong
Hesheng Wang
3DGS
57
7
0
16 Oct 2024
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
Joey Wilson
Ruihan Xu
Yile Sun
Parker Ewen
Minghan Zhu
Kira Barton
Maani Ghaffari
43
0
0
15 Oct 2024
ImagineNav: Prompting Vision-Language Models as Embodied Navigator
  through Scene Imagination
ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination
Xinxin Zhao
Wenzhe Cai
Likun Tang
Teng Wang
LM&Ro
45
3
0
13 Oct 2024
SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object
  Navigation
SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Hang Yin
Xiuwei Xu
Zhenyu Wu
Jie Zhou
Jiwen Lu
48
14
0
10 Oct 2024
Automated Creation of Digital Cousins for Robust Policy Learning
Automated Creation of Digital Cousins for Robust Policy Learning
Tianyuan Dai
Josiah Wong
Yunfan Jiang
Chen Wang
Cem Gokmen
Ruohan Zhang
Jiajun Wu
Li Fei-Fei
36
22
0
09 Oct 2024
Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark,
  and Methodology
Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology
Xinyu Wang
Donglin Yang
Ziqin Wang
Hohin Kwan
Jinyu Chen
Wenjun Wu
Hongsheng Li
Yue Liao
Si Liu
29
14
0
09 Oct 2024
3D Representation Methods: A Survey
3D Representation Methods: A Survey
Zhengren Wang
3DGS
32
3
0
09 Oct 2024
CUBE360: Learning Cubic Field Representation for Monocular 360 Depth
  Estimation for Virtual Reality
CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality
Wenjie Chang
Hao Ai
Tianzhu Zhang
Lin Wang
MDE
26
0
0
08 Oct 2024
Diffusion Models in 3D Vision: A Survey
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Xue Liu
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
72
4
0
07 Oct 2024
LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation
LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation
Jianhao Jiao
Jinhao He
Changkun Liu
Sebastian Aegidius
Xiangcheng Hu
Tristan Braud
Dimitrios Kanoulas
59
4
0
06 Oct 2024
Semantic Environment Atlas for Object-Goal Navigation
Semantic Environment Atlas for Object-Goal Navigation
Nuri Kim
Jeongho Park
Mineui Hong
Songhwai Oh
33
0
0
05 Oct 2024
The Wallpaper is Ugly: Indoor Localization using Vision and Language
The Wallpaper is Ugly: Indoor Localization using Vision and Language
Seth Pate
Lawson L. S. Wong
41
0
0
04 Oct 2024
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes
  and Objects
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Zhaowei Wang
Hongming Zhang
Tianqing Fang
Ye Tian
Yue Yang
Kaixin Ma
Xiaoman Pan
Yangqiu Song
Dong Yu
LM&Ro
55
3
0
03 Oct 2024
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
Mohan Xu
Kai Li
Guo Chen
Xiaolin Hu
51
0
0
02 Oct 2024
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
44
2
0
02 Oct 2024
Find Everything: A General Vision Language Model Approach to Multi-Object Search
Find Everything: A General Vision Language Model Approach to Multi-Object Search
Daniel Choi
Angus Fung
Haitong Wang
Aaron Hao Tan
65
3
0
01 Oct 2024
Active Neural Mapping at Scale
Active Neural Mapping at Scale
Zijia Kuang
Zike Yan
Hao Zhao
Guyue Zhou
Hongbin Zha
30
3
0
30 Sep 2024
Grounding 3D Scene Affordance From Egocentric Interactions
Grounding 3D Scene Affordance From Egocentric Interactions
Cuiyu Liu
Wei Zhai
Yuhang Yang
Hongchen Luo
Sen Liang
Yang Cao
Zheng-Jun Zha
42
1
0
29 Sep 2024
Previous
123456...222324
Next