ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06158
  4. Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments

Matterport3D: Learning from RGB-D Data in Indoor Environments

18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
    3DV3DPC
ArXiv (abs)PDFHTML

Papers citing "Matterport3D: Learning from RGB-D Data in Indoor Environments"

50 / 1,327 papers shown
UNRealNet: Learning Uncertainty-Aware Navigation Features from
  High-Fidelity Scans of Real Environments
UNRealNet: Learning Uncertainty-Aware Navigation Features from High-Fidelity Scans of Real Environments
S. Triest
David D. Fan
Sebastian Scherer
Ali-Akbar Agha-Mohammadi
200
7
0
11 Jul 2024
SRPose: Two-view Relative Pose Estimation with Sparse Keypoints
SRPose: Two-view Relative Pose Estimation with Sparse Keypoints
Rui Yin
Yulun Zhang
Zherong Pan
Jianjun Zhu
Cheng Wang
Biao Jia
302
3
0
11 Jul 2024
Fusion of Short-term and Long-term Attention for Video Mirror Detection
Fusion of Short-term and Long-term Attention for Video Mirror Detection
Mingchen Xu
Jing Wu
Yukun Lai
Ze Ji
158
1
0
10 Jul 2024
Controlling Space and Time with Diffusion Models
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
458
54
0
10 Jul 2024
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion
  Model
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
Xiaoding Yuan
Shitao Tang
Kejie Li
Alan Yuille
Peng Wang
DiffM
221
5
0
09 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on
  Embodied AI
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Zehua Wang
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&RoSyDaAI4CE
619
185
0
09 Jul 2024
Affordances-Oriented Planning using Foundation Models for Continuous
  Vision-Language Navigation
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Jiaqi Chen
Bingqian Lin
Xinmin Liu
Lin Ma
Xiaodan Liang
Kwan-Yee K. Wong
LM&Ro
380
44
0
08 Jul 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene
  Synthesis
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun
Hang Zhou
Wengang Zhou
Li Li
Houqiang Li
3DPC3DV
268
12
0
07 Jul 2024
Open Panoramic Segmentation
Open Panoramic Segmentation
Junwei Zheng
Ruiping Liu
Yufan Chen
Kunyu Peng
Chengzhi Wu
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
VLM
276
14
0
02 Jul 2024
Object Segmentation from Open-Vocabulary Manipulation Instructions Based
  on Optimal Transport Polygon Matching with Multimodal Foundation Models
Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Takayuki Nishimura
Katsuyuki Kuyo
Motonari Kambara
Komei Sugiura
DiffM
261
1
0
01 Jul 2024
Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene
  Understanding
Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding
Yifan Tang
Cong Tai
Fangxing Chen
Wanting Zhang
Tao Zhang
Xueping Liu
Yongjin Liu
Long Zeng
250
9
0
28 Jun 2024
HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model
HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model
Hieu T. Nguyen
Yiwen Chen
Vikram S. Voleti
Varun Jampani
Huaizu Jiang
250
6
0
28 Jun 2024
SALVe: Semantic Alignment Verification for Floorplan Reconstruction from
  Sparse Panoramas
SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas
John Lambert
Yuguang Li
Ivaylo Boyadzhiev
L. Wixson
Manjunath Narayana
Will Hutchcroft
James Hays
F. Dellaert
S. B. Kang
SLR
156
8
0
27 Jun 2024
Human-Aware Vision-and-Language Navigation: Bridging Simulation to
  Reality with Dynamic Human Interactions
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions
Heng Li
Heng Li
Zhi-Qi Cheng
Yifei Dong
Yuxuan Zhou
Jun-Yan He
Jingdong Sun
Teruko Mitamura
Alexander G. Hauptmann
LM&Ro
268
21
0
27 Jun 2024
360 in the Wild: Dataset for Depth Prediction and View Synthesis
360 in the Wild: Dataset for Depth Prediction and View Synthesis
Kibaek Park
François Rameau
Jaesik Park
In So Kweon
MDE
225
1
0
27 Jun 2024
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for
  Effective-and-Efficient Vision-and-Language Navigation
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Mengjiao Shen
Jingwei Yang
Chengju Liu
Qijun Chen
VLM
330
3
0
25 Jun 2024
Smart Feature is What You Need
Smart Feature is What You Need
Zhaoxin Hu
Keyan Ren
298
0
0
22 Jun 2024
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation
Jungdae Lee
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
Daichi Azuma
Yutaka Matsuo
Nakamasa Inoue
325
23
0
20 Jun 2024
Estimating Map Completeness in Robot Exploration
Estimating Map Completeness in Robot Exploration
Matteo Luperto
Marco Maria Ferrara
Giacomo Boracchi
Francesco Amigoni
194
3
0
19 Jun 2024
Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective
  Distillation and Unlabeled Data Augmentation
Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data AugmentationNeural Information Processing Systems (NeurIPS), 2024
Ning-Hsu Wang
Yu-Lun Liu
MDE
256
27
0
18 Jun 2024
Infinigen Indoors: Photorealistic Indoor Scenes using Procedural
  Generation
Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation
Alexander Raistrick
Lingjie Mei
Karhan Kayan
David Yan
Yiming Zuo
...
Meenal Parakh
Stamatis Alexandropoulos
Lahav Lipson
Zeyu Ma
Gaowen Liu
VGenAI4CE
385
79
0
17 Jun 2024
Solving Vision Tasks with Simple Photoreceptors Instead of Cameras
Solving Vision Tasks with Simple Photoreceptors Instead of Cameras
Andrei Atanov
Jiawei Fu
Rishubh Singh
Isabella Yu
Andrew Spielberg
Amir Zamir
167
1
0
17 Jun 2024
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language
  Navigation
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language NavigationConference on Robot Learning (CoRL), 2024
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
LM&Ro
318
28
0
14 Jun 2024
A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion
A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion
Kailai Sun
Zhou Yang
Qianchuan Zhao
3DVViT3DPCMDE
140
2
0
14 Jun 2024
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
Ruiyuan Lyu
Tai Wang
Jingli Lin
Shuai Yang
Xiaohan Mao
...
Runsen Xu
Haifeng Huang
Chenming Zhu
Dahua Lin
Jiangmiao Pang
3DV
350
34
0
13 Jun 2024
Pandora: Towards General World Model with Natural Language Actions and
  Video States
Pandora: Towards General World Model with Natural Language Actions and Video States
Jiannan Xiang
Guangyi Liu
Yi Gu
Qiyue Gao
Yuting Ning
...
Shibo Hao
Yemin Shi
Zhengzhong Liu
Eric P. Xing
Zhiting Hu
VGen
302
67
0
12 Jun 2024
Can Large Language Models Understand Spatial Audio?
Can Large Language Models Understand Spatial Audio?
Changli Tang
Wenyi Yu
Guangzhi Sun
Xianzhao Chen
Tian Tan
...
Jun Zhang
Lu Lu
Zejun Ma
Yuxuan Wang
Chao Zhang
350
18
0
12 Jun 2024
Hearing Anything Anywhere
Hearing Anything Anywhere
Mason Wang
Ryosuke Sawata
Samuel Clarke
Ruohan Gao
Shangzhe Wu
Jiajun Wu
248
13
0
11 Jun 2024
Demonstrating HumanTHOR: A Simulation Platform and Benchmark for
  Human-Robot Collaboration in a Shared Workspace
Demonstrating HumanTHOR: A Simulation Platform and Benchmark for Human-Robot Collaboration in a Shared Workspace
Chenxu Wang
Boyuan Du
Jiaxin Xu
Peiyan Li
Di Guo
Huaping Liu
268
5
0
10 Jun 2024
Multimodal Contextualized Semantic Parsing from Speech
Multimodal Contextualized Semantic Parsing from Speech
Jordan Voas
Raymond Mooney
David Harwath
182
1
0
10 Jun 2024
EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks
  with Large Vision-Language Models
EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Mengfei Du
Binhao Wu
Zejun Li
Xuanjing Huang
Zhongyu Wei
276
46
0
09 Jun 2024
Diverse 3D Human Pose Generation in Scenes based on Decoupled Structure
Diverse 3D Human Pose Generation in Scenes based on Decoupled Structure
Bowen Dang
Xi Zhao
3DH
211
1
0
09 Jun 2024
I2EDL: Interactive Instruction Error Detection and Localization
I2EDL: Interactive Instruction Error Detection and Localization
Francesco Taioli
Stefano Rosa
A. Castellini
Lorenzo Natale
Alessio Del Bue
Alessandro Farinelli
Marco Cristani
Yiming Wang
282
3
0
07 Jun 2024
Omni6DPose: A Benchmark and Model for Universal 6D Object Pose
  Estimation and Tracking
Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and TrackingEuropean Conference on Computer Vision (ECCV), 2024
Jiyao Zhang
Weiyao Huang
Bo Peng
Mingdong Wu
Fei Hu
Zijian Chen
Bo Zhao
Hao Dong
300
35
0
06 Jun 2024
SelfReDepth: Self-Supervised Real-Time Depth Restoration for
  Consumer-Grade Sensors
SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors
Alexandre Duarte
Francisco Fernandes
João M. Pereira
Catarina Moreira
Jacinto C. Nascimento
Joaquim A. Jorge
MDE
235
3
0
05 Jun 2024
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Dmytro Kuzmenko
N. Shvai
LM&Ro
251
0
0
05 Jun 2024
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
Chengzu Li
Caiqi Zhang
Han Zhou
Nigel Collier
Anna Korhonen
Ivan Vulić
LRM
303
46
0
04 Jun 2024
CoNav: A Benchmark for Human-Centered Collaborative Navigation
CoNav: A Benchmark for Human-Centered Collaborative Navigation
Changhao Li
Xinyu Sun
Peihao Chen
Jugang Fan
Zixu Wang
Yanxia Liu
Jinhui Zhu
Chuang Gan
Zhuliang Yu
294
2
0
04 Jun 2024
Why Only Text: Empowering Vision-and-Language Navigation with
  Multi-modal Prompts
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
247
7
0
04 Jun 2024
Teledrive: An Embodied AI based Telepresence System
Teledrive: An Embodied AI based Telepresence System
Snehasis Banerjee
Sayan Paul
R. Roychoudhury
Abhijan Bhattacharya
Chayan Sarkar
A. Sau
Pradip Pramanick
Brojeshwar Bhowmick
332
3
0
01 Jun 2024
PanoNormal: Monocular Indoor 360° Surface Normal Estimation
PanoNormal: Monocular Indoor 360° Surface Normal Estimation
Kun Huang
Fanglue Zhang
N. Dodgson
MDE
228
1
0
29 May 2024
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative
  Warping
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping
Junyoung Seo
Kazumi Fukuda
Takashi Shibuya
Takuya Narihira
Naoki Murata
Shoukang Hu
Chieh-Hsin Lai
Seungryong Kim
Yuki Mitsufuji
283
42
0
27 May 2024
Benchmarking General-Purpose In-Context Learning
Benchmarking General-Purpose In-Context Learning
Fan Wang
Chuan Lin
Yang Cao
Yu Kang
526
5
0
27 May 2024
Vision-and-Language Navigation Generative Pretrained Transformer
Vision-and-Language Navigation Generative Pretrained Transformer
Hanlin Wen
LM&Ro
258
0
0
27 May 2024
Estimating Depth of Monocular Panoramic Image with Teacher-Student Model
  Fusing Equirectangular and Spherical Representations
Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations
Jingguo Liu
Yijun Xu
Shigang Li
Jianfeng Li
MDE
234
6
0
27 May 2024
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
Kuan-Chih Huang
Xiangtai Li
Lu Qi
Shuicheng Yan
Ming-Hsuan Yang
LRM
376
22
0
27 May 2024
Map-based Modular Approach for Zero-shot Embodied Question Answering
Map-based Modular Approach for Zero-shot Embodied Question Answering
Koya Sakamoto
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
M. Kawanabe
300
6
0
26 May 2024
MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling
MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling
Diwei Huang
Kun-Li Channing Lin
Peihao Chen
Qing Du
Zhuliang Yu
VGen
152
0
0
22 May 2024
CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected
  CRFs
CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs
Zidong Cao
Lin Wang
MDE
219
0
0
19 May 2024
Grounded 3D-LLM with Referent Tokens
Grounded 3D-LLM with Referent Tokens
Yilun Chen
Shuai Yang
Haifeng Huang
Tai Wang
Ruiyuan Lyu
Runsen Xu
Dahua Lin
Jiangmiao Pang
336
77
0
16 May 2024
Previous
123...789...252627
Next