Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.06158
Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments
18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Matterport3D: Learning from RGB-D Data in Indoor Environments"
50 / 1,167 papers shown
Title
Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding
Yifan Tang
Cong Tai
Fangxing Chen
Wanting Zhang
Tao Zhang
Xueping Liu
Yongjin Liu
Long Zeng
29
5
0
28 Jun 2024
SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas
John Lambert
Yuguang Li
Ivaylo Boyadzhiev
L. Wixson
Manjunath Narayana
Will Hutchcroft
James Hays
F. Dellaert
S. B. Kang
SLR
32
6
0
27 Jun 2024
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions
Minghan Li
Heng Li
Zhi-Qi Cheng
Yifei Dong
Yuxuan Zhou
Jun-Yan He
Qi Dai
Teruko Mitamura
Alexander G. Hauptmann
LM&Ro
45
4
0
27 Jun 2024
360 in the Wild: Dataset for Depth Prediction and View Synthesis
Kibaek Park
François Rameau
Jaesik Park
In So Kweon
MDE
47
0
0
27 Jun 2024
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Mengjiao Shen
Jingwei Yang
Chengju Liu
Qijun Chen
VLM
41
2
0
25 Jun 2024
Smart Feature is What You Need
Zhaoxin Hu
Keyan Ren
40
0
0
22 Jun 2024
CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information
Jungdae Lee
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
Daichi Azuma
Yutaka Matsuo
Nakamasa Inoue
47
15
0
20 Jun 2024
Estimating Map Completeness in Robot Exploration
Matteo Luperto
Marco Maria Ferrara
Giacomo Boracchi
Francesco Amigoni
49
1
0
19 Jun 2024
Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation
Ning-Hsu Wang
Yu-Lun Liu
MDE
37
4
0
18 Jun 2024
Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation
Alexander Raistrick
Lingjie Mei
Karhan Kayan
David Yan
Yiming Zuo
...
Meenal Parakh
Stamatis Alexandropoulos
Lahav Lipson
Zeyu Ma
Jia Deng
VGen
AI4CE
44
21
0
17 Jun 2024
Solving Vision Tasks with Simple Photoreceptors Instead of Cameras
Andrei Atanov
Jiawei Fu
Rishubh Singh
Isabella Yu
Andrew Spielberg
Amir Zamir
54
1
0
17 Jun 2024
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
LM&Ro
45
7
0
14 Jun 2024
A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion
Kailai Sun
Zhou Yang
Qianchuan Zhao
3DV
ViT
3DPC
MDE
32
0
0
14 Jun 2024
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
Ruiyuan Lyu
Tai Wang
Jingli Lin
Shuai Yang
Xiaohan Mao
...
Runsen Xu
Haifeng Huang
Chenming Zhu
Dahua Lin
Jiangmiao Pang
3DV
49
11
0
13 Jun 2024
Pandora: Towards General World Model with Natural Language Actions and Video States
Jiannan Xiang
Guangyi Liu
Yi Gu
Qiyue Gao
Yuting Ning
...
Shibo Hao
Yemin Shi
Zhengzhong Liu
Eric P. Xing
Zhiting Hu
VGen
62
36
0
12 Jun 2024
Can Large Language Models Understand Spatial Audio?
Changli Tang
Wenyi Yu
Guangzhi Sun
Xianzhao Chen
Tian Tan
...
Jun Zhang
Lu Lu
Zejun Ma
Yuxuan Wang
Chao Zhang
54
4
0
12 Jun 2024
Hearing Anything Anywhere
Mason Wang
Ryosuke Sawata
Samuel Clarke
Ruohan Gao
Shangzhe Wu
Jiajun Wu
41
5
0
11 Jun 2024
Demonstrating HumanTHOR: A Simulation Platform and Benchmark for Human-Robot Collaboration in a Shared Workspace
Chenxu Wang
Boyuan Du
Jiaxin Xu
Peiyan Li
Di Guo
Huaping Liu
59
2
0
10 Jun 2024
Multimodal Contextualized Semantic Parsing from Speech
Jordan Voas
Raymond Mooney
David Harwath
56
0
0
10 Jun 2024
EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models
Mengfei Du
Binhao Wu
Zejun Li
Xuanjing Huang
Zhongyu Wei
42
11
0
09 Jun 2024
Diverse 3D Human Pose Generation in Scenes based on Decoupled Structure
Bowen Dang
Xi Zhao
3DH
51
0
0
09 Jun 2024
I2EDL: Interactive Instruction Error Detection and Localization
Francesco Taioli
Stefano Rosa
A. Castellini
Lorenzo Natale
Alessio Del Bue
Alessandro Farinelli
Marco Cristani
Yiming Wang
45
2
0
07 Jun 2024
Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking
Jiyao Zhang
Weiyao Huang
Bo Peng
Mingdong Wu
Fei Hu
Zijian Chen
Bo Zhao
Hao Dong
70
13
0
06 Jun 2024
SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors
Alexandre Duarte
Francisco Fernandes
João M. Pereira
Catarina Moreira
Jacinto C. Nascimento
Joaquim A. Jorge
MDE
37
0
0
05 Jun 2024
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Dmytro Kuzmenko
N. Shvai
LM&Ro
38
0
0
05 Jun 2024
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
Chengzu Li
Caiqi Zhang
Han Zhou
Nigel Collier
Anna Korhonen
Ivan Vulić
LRM
51
16
0
04 Jun 2024
CoNav: A Benchmark for Human-Centered Collaborative Navigation
Changhao Li
Xinyu Sun
Peihao Chen
Jugang Fan
Zixu Wang
Yanxia Liu
Jinhui Zhu
Chuang Gan
Mingkui Tan
58
1
0
04 Jun 2024
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
43
3
0
04 Jun 2024
Teledrive: An Embodied AI based Telepresence System
Snehasis Banerjee
Sayan Paul
R. Roychoudhury
Abhijan Bhattacharya
Chayan Sarkar
A. Sau
Pradip Pramanick
Brojeshwar Bhowmick
35
2
0
01 Jun 2024
PanoNormal: Monocular Indoor 360° Surface Normal Estimation
Kun Huang
Fanglue Zhang
N. Dodgson
MDE
39
0
0
29 May 2024
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping
Junyoung Seo
Kazumi Fukuda
Takashi Shibuya
Takuya Narihira
Naoki Murata
Shoukang Hu
Chieh-Hsin Lai
Seungryong Kim
Yuki Mitsufuji
60
17
0
27 May 2024
Benchmarking General-Purpose In-Context Learning
Fan Wang
Chuan Lin
Yang Cao
Yu Kang
43
1
0
27 May 2024
Vision-and-Language Navigation Generative Pretrained Transformer
Hanlin Wen
LM&Ro
40
0
0
27 May 2024
Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations
Jingguo Liu
Yijun Xu
Shigang Li
Jianfeng Li
MDE
53
3
0
27 May 2024
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
Kuan-Chih Huang
Xiangtai Li
Lu Qi
Shuicheng Yan
Ming-Hsuan Yang
LRM
76
10
0
27 May 2024
Map-based Modular Approach for Zero-shot Embodied Question Answering
Koya Sakamoto
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
M. Kawanabe
32
3
0
26 May 2024
MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling
Diwei Huang
Kun-Li Channing Lin
Peihao Chen
Qing Du
Mingkui Tan
VGen
42
0
0
22 May 2024
CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs
Zidong Cao
Lin Wang
MDE
47
0
0
19 May 2024
Grounded 3D-LLM with Referent Tokens
Yilun Chen
Shuai Yang
Haifeng Huang
Tai Wang
Ruiyuan Lyu
Runsen Xu
Dahua Lin
Jiangmiao Pang
57
24
0
16 May 2024
4D Panoptic Scene Graph Generation
Jingkang Yang
Jun Cen
Wenxuan Peng
Shuai Liu
Fangzhou Hong
Xiangtai Li
Kaiyang Zhou
Qifeng Chen
Ziwei Liu
45
13
0
16 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
37
13
0
16 May 2024
SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks
M. Ginting
Sung-Kyun Kim
David D. Fan
M. Palieri
Mykel J. Kochenderfer
Ali-akbar Agha-Mohammadi
51
7
0
16 May 2024
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation
Yunhao Ge
Yihe Tang
Lyne Tchapmi
Cem Gokmen
Chengshu Li
...
Miao Liu
Pengchuan Zhang
Ruohan Zhang
Fei-Fei Li
Jiajun Wu
VGen
55
6
0
15 May 2024
Memory-Maze: Scenario Driven Benchmark and Visual Language Navigation Model for Guiding Blind People
Masaki Kuribayashi
Kohei Uehara
Allan Wang
Daisuke Sato
Simon Chu
Shigeo Morishima
42
1
0
11 May 2024
Learning Latent Dynamic Robust Representations for World Models
Ruixiang Sun
Hongyu Zang
Xin-hui Li
Riashat Islam
41
5
0
10 May 2024
FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting
Yikun Ma
Dandan Zhan
Zhi Jin
3DGS
35
9
0
09 May 2024
Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview
Yuhang Ming
Xingrui Yang
Weihan Wang
Zheng Chen
Jinglun Feng
Yifan Xing
Guofeng Zhang
45
12
0
09 May 2024
A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
Huaiyuan Xu
Junliang Chen
Shiyu Meng
Yi Wang
Lap-Pui Chau
3DPC
50
18
0
08 May 2024
An Empty Room is All We Want: Automatic Defurnishing of Indoor Panoramas
Mira Slavcheva
Dave Gausebeck
Kevin Chen
David Buchhofer
Azwad Sabik
Chen Ma
Sachal Dhillon
Olaf Brandt
Alan Dolhasz
34
5
0
06 May 2024
Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review
Anurag Dalal
Daniel Hagen
K. Robbersmyr
Kristian Muri Knausgård
GP
3DV
3DGS
59
21
0
06 May 2024
Previous
1
2
3
4
5
6
...
22
23
24
Next