Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.06158
Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments
18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Matterport3D: Learning from RGB-D Data in Indoor Environments"
50 / 1,167 papers shown
Title
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Yanyuan Qiao
Wenqi Lyu
Hui Wang
Zixu Wang
Zerui Li
Yuan Zhang
Mingkui Tan
Qi Wu
LRM
43
4
0
27 Sep 2024
HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting
Zijun Xu
Rui Jin
Ke Wu
Yi Zhao
Zhiwei Zhang
Jieru Zhao
Fei Gao
Zhongxue Gan
Wenchao Ding
50
4
0
26 Sep 2024
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Zehao Wang
Minye Wu
Yixin Cao
Yubo Ma
Meiqi Chen
Tinne Tuytelaars
43
1
0
25 Sep 2024
Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models
Mike Zhang
Kaixian Qu
Vaishakh Patil
Cesar Cadena
Marco Hutter
LM&Ro
3DV
41
4
0
23 Sep 2024
Robust and Flexible Omnidirectional Depth Estimation with Multiple 360-degree Cameras
Ming Li
Xueqian Jin
Xuejiao Hu
Jinghao Cao
S. Du
Yang Li
MDE
48
0
0
23 Sep 2024
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal Navigation
Naoki Yokoyama
Ram Ramrakhya
Abhishek Das
Dhruv Batra
Sehoon Ha
38
10
0
22 Sep 2024
From Cognition to Precognition: A Future-Aware Framework for Social Navigation
Zeying Gong
Tianshuai Hu
Ronghe Qiu
Junwei Liang
167
0
0
20 Sep 2024
Navigation with VLM framework: Go to Any Language
Zecheng Yin
Chonghao Cheng
Lizhen
LM&Ro
32
0
0
18 Sep 2024
Online Diffusion-Based 3D Occupancy Prediction at the Frontier with Probabilistic Map Reconciliation
Alec Reed
Lorin Achey
Brendan Crowe
Bradley Hayes
Christoffer Heckman
35
0
0
16 Sep 2024
Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation
Yifan Xu
Ziming Luo
Qianwei Wang
Vineet Kamat
Carol Menassa
3DV
3DPC
38
0
0
16 Sep 2024
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation
Muraleekrishna Gopinathan
Martin Masek
Jumana Abu-Khalaf
David Suter
LM&Ro
41
1
0
09 Sep 2024
Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective
Tim Bader
Leon Eisemann
Adrian Pogorzelski
Namrata Jangid
Attila B. Kis
51
0
0
06 Sep 2024
Estimating Indoor Scene Depth Maps from Ultrasonic Echoes
Junpei Honma
Akisato Kimura
Go Irie
MDE
43
0
0
05 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-xiong Wang
78
15
0
05 Sep 2024
Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration
Rongge Zhang
Haechan Mark Bong
Giovanni Beltrame
57
1
0
27 Aug 2024
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth
Cho-Ying Wu
Quankai Gao
Chin-Cheng Hsu
Te-Lin Wu
Jing-Wen Chen
Ulrich Neumann
MDE
37
0
0
25 Aug 2024
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Youjun Zhao
Jiaying Lin
Shuquan Ye
Qianshi Pang
Rynson W. H. Lau
64
1
0
20 Aug 2024
Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration
Hao Ai
Lin Wang
40
0
0
18 Aug 2024
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability Maps
Senthil Hariharan Arul
Dhruva Kumar
Vivek Sugirtharaj
Richard Kim
Xuewei
Qi
R. Madhivanan
Arnie Sen
Dinesh Manocha
23
1
0
15 Aug 2024
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions
Ryosuke Korekata
Kanta Kaneda
Shunya Nagashima
Yuto Imai
Komei Sugiura
ObjD
LM&Ro
53
2
0
15 Aug 2024
Structure-preserving Planar Simplification for Indoor Environments
Bishwash Khanal
Sanjay Rijal
Manish Awale
V. Ojha
3DPC
35
0
0
13 Aug 2024
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Junrui Zhang
Jiaqi Li
Yachuan Huang
Yiran Wang
Jinghong Zheng
Liao Shen
Z. Cao
MDE
39
3
0
12 Aug 2024
UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
LM&Ro
37
1
0
08 Aug 2024
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng
Jun Wang
Chuanhao Li
Quanfeng Lu
Hao Tian
...
Jifeng Dai
Ping Luo
Ping Luo
Kaipeng Zhang
Wenqi Shao
VLM
60
18
0
05 Aug 2024
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
61
1
0
02 Aug 2024
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
51
2
0
31 Jul 2024
ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments
Taewoong Kim
Cheolhong Min
Byeonghwi Kim
Jinyeon Kim
Wonje Jeung
Jonghyun Choi
LM&Ro
44
5
0
26 Jul 2024
Deep Spherical Superpixels
Rémi Giraud
Michael Clement
MDE
61
0
0
24 Jul 2024
Navigation Instruction Generation with BEV Perception and Large Language Models
Sheng Fan
Rui Liu
Wenguan Wang
Yi Yang
50
5
0
21 Jul 2024
Self-training Room Layout Estimation via Geometry-aware Ray-casting
Bolivar Solarte
Chin-Hsuan Wu
Jin-Cheng Jhang
Jonathan Lee
Yi-Hsuan Tsai
Min Sun
SSL
34
2
0
21 Jul 2024
VisFly: An Efficient and Versatile Simulator for Training Vision-based Flight
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
53
3
0
20 Jul 2024
MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References
Lukas Bosiger
Mihai Dusmanu
Marc Pollefeys
Z. Bauer
59
0
0
18 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
67
4
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
52
3
0
18 Jul 2024
GenRC: Generative 3D Room Completion from Sparse Image Collections
Ming-feng Li
Yueh-Feng Ku
Hong-Xuan Yen
Chi Liu
Yu-Lun Liu
Albert Y. C. Chen
Cheng-Hao Kuo
Min Sun
3DV
VGen
62
4
0
17 Jul 2024
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou
Yicong Hong
Zun Wang
Xin Eric Wang
Qi Wu
LM&Ro
50
19
0
17 Jul 2024
GRUtopia: Dream General Robots in a City at Scale
Hanqing Wang
Jiahe Chen
Wensi Huang
Qingwei Ben
Tai Wang
...
Ying Zhao
Zhongying Tu
Yu Qiao
Dahua Lin
Jiangmiao Pang
LM&Ro
VGen
60
16
0
15 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DV
VLM
47
6
0
13 Jul 2024
Semantic UV mapping to improve texture inpainting for indoor scenes
J. Vermandere
M. Bassier
M. Vergauwen
59
1
0
12 Jul 2024
UNRealNet: Learning Uncertainty-Aware Navigation Features from High-Fidelity Scans of Real Environments
S. Triest
David D. Fan
Sebastian Scherer
Ali-Akbar Agha-Mohammadi
45
4
0
11 Jul 2024
SRPose: Two-view Relative Pose Estimation with Sparse Keypoints
Rui Yin
Yulun Zhang
Zherong Pan
Jianjun Zhu
Cheng Wang
Biao Jia
44
1
0
11 Jul 2024
Fusion of Short-term and Long-term Attention for Video Mirror Detection
Mingchen Xu
Jing Wu
Yukun Lai
Ze Ji
37
1
0
10 Jul 2024
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
73
27
0
10 Jul 2024
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
Xiaoding Yuan
Shitao Tang
Kejie Li
Alan Yuille
Peng Wang
DiffM
39
3
0
09 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Yang Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&Ro
SyDa
AI4CE
56
52
0
09 Jul 2024
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Jiaqi Chen
Bingqian Lin
Xinmin Liu
Lin Ma
Xiaodan Liang
Kwan-Yee K. Wong
LM&Ro
57
10
0
08 Jul 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun
Hang Zhou
Wengang Zhou
Li Li
Houqiang Li
3DPC
3DV
46
6
0
07 Jul 2024
Open Panoramic Segmentation
Junwei Zheng
Ruiping Liu
Yufan Chen
Kunyu Peng
Chengzhi Wu
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
VLM
47
8
0
02 Jul 2024
Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Takayuki Nishimura
Katsuyuki Kuyo
Motonari Kambara
Komei Sugiura
DiffM
35
0
0
01 Jul 2024
HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model
Hieu T. Nguyen
Yiwen Chen
Vikram S. Voleti
Varun Jampani
Huaizu Jiang
61
0
0
28 Jun 2024
Previous
1
2
3
4
5
...
22
23
24
Next