Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1709.06158
Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments
18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Matterport3D: Learning from RGB-D Data in Indoor Environments"
50 / 1,327 papers shown
SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control
Xiaohan Zhang
Sebastian Starke
Vladimir Guzov
Zhensong Zhang
Eduardo Pérez-Pellitero
Gerard Pons-Moll
DiffM
VGen
308
9
0
20 Dec 2024
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
Kun Wu
Chengkai Hou
Jiaming Liu
Zhengping Che
Xiaozhu Ju
...
Zhenyu Wang
Pengju An
Siyuan Qian
Shanghang Zhang
Jian Tang
LM&Ro
557
88
0
18 Dec 2024
iKap: Kinematics-aware Planning with Imperative Learning
IEEE International Conference on Robotics and Automation (ICRA), 2024
Qihang Li
Zhuoqun Chen
Haoze Zheng
Haonan He
Shaoshu Su
Shaoshu Su
Junyi Geng
Chen Wang
542
5
0
12 Dec 2024
NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2024
Qiang Qu
Hanxue Liang
Xiaoming Chen
Yuk Ying Chung
Yiran Shen
307
21
0
11 Dec 2024
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds
Computer Vision and Pattern Recognition (CVPR), 2024
Z-H. Tang
Yuchen Fan
Dilin Wang
Hongyu Xu
Rakesh Ranjan
Alex Schwing
Zhicheng Yan
3DGS
VGen
3DV
263
78
0
09 Dec 2024
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts
Gengze Zhou
Yicong Hong
Zun Wang
Chongyang Zhao
Joey Tianyi Zhou
Qi Wu
164
5
0
07 Dec 2024
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances
AAAI Conference on Artificial Intelligence (AAAI), 2024
Wenting Xu
Viorela Ila
Luping Zhou
Craig T. Jin
397
2
0
07 Dec 2024
TANGO: Training-free Embodied AI Agents for Open-world Tasks
Computer Vision and Pattern Recognition (CVPR), 2024
Filippo Ziliotto
Tommaso Campari
Luciano Serafini
Lamberto Ballan
LLMAG
LM&Ro
MLLM
LRM
331
12
0
05 Dec 2024
Multi-view Image Diffusion via Coordinate Noise and Fourier Attention
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Justin D. Theiss
Norman Müller
Daeil Kim
Aayush Prakash
236
0
0
04 Dec 2024
Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zijiao Yang
Xiangxi Shi
Eric Slyman
Stefan Lee
AAML
322
4
0
03 Dec 2024
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans
Dillon Loh
Tomasz Bednarz
Xinxing Xia
Frank Guan
360
1
0
27 Nov 2024
Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
Computer Vision and Pattern Recognition (CVPR), 2024
Mehdi Zayene
Jannik Endres
Albias Havolli
Charles Corbière
Salim Cherkaoui
Alexandre Kontouli
Alexandre Alahi
MDE
625
1
0
27 Nov 2024
g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks
Computer Vision and Pattern Recognition (CVPR), 2024
Zihan Wang
Gim Hee Lee
257
8
0
26 Nov 2024
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Computer Vision and Pattern Recognition (CVPR), 2024
Xinhao Liu
Jiajian Li
Yichen Jiang
Niranjan Sujay
Zhiyong Yang
Juexiao Zhang
John Abanes
Jing Zhang
Chen Feng
537
25
0
26 Nov 2024
Revisiting Point Cloud Completion: Are We Ready For The Real-World?
Stuti Pathak
Prashant Kumar
Nicholus Mboga
Gunther Steenackers
R. Penne
Rudi Penne
1.2K
1
0
26 Nov 2024
DiffDesign: Controllable Diffusion with Meta Prior for Efficient Interior Design Generation
PLoS ONE (PLoS ONE), 2024
Yuxuan Yang
Wenwen Qiang
DiffM
598
4
0
25 Nov 2024
TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
Linqing Zhong
Chen Gao
Zihan Ding
Yue Liao
Si Liu
Shifeng Zhang
Xu Zhou
Si Liu
LRM
1.1K
10
0
25 Nov 2024
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
Computer Vision and Pattern Recognition (CVPR), 2024
Chan Hee Song
Valts Blukis
Jonathan Tremblay
Stephen Tyree
Yu-Chuan Su
Stan Birchfield
841
82
0
25 Nov 2024
Understanding World or Predicting Future? A Comprehensive Survey of World Models
ACM Computing Surveys (ACM CSUR), 2024
Jingtao Ding
Yunke Zhang
Yu Shang
Yuheng Zhang
Zefang Zong
...
Fengli Xu
Yong Li
Chen Gao
Fengli Xu
Yong Li
VGen
SyDa
517
17
0
21 Nov 2024
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation
Umamaheswaran Raman Kumar
A. Fayjie
Jurgen Hannaert
Patrick Vandewalle
3DV
3DPC
338
1
0
20 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
Hamidreza Kasaei
Tingguang Li
M. Cao
LM&Ro
385
8
0
18 Nov 2024
The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods
The international journal of robotics research (IJRR), 2024
Yifu Tao
Miguel Ángel Muñoz-Bañón
Lintong Zhang
Jiahao Wang
L. Fu
Maurice F. Fallon
246
23
0
15 Nov 2024
VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes
IEEE International Conference on Robotics and Automation (ICRA), 2024
A. Sethuraman
Onur Bagoren
Harikrishnan Seetharaman
Dalton Richardson
Joseph Taylor
Katherine A. Skinner
3DV
248
0
0
07 Nov 2024
SA3DIP: Segment Any 3D Instance with Potential 3D Priors
Neural Information Processing Systems (NeurIPS), 2024
Xi Yang
Xu Gu
Xingyilang Yin
Xinbo Gao
271
2
0
06 Nov 2024
VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation
Haochen Zhang
Nader Zantout
Pujith Kachana
Zongyuan Wu
Ji Zhang
Ji Zhang
3DV
LM&Ro
317
14
0
05 Nov 2024
Deep Learning on 3D Semantic Segmentation: A Detailed Review
Remote Sensing (Remote Sens.), 2024
Thodoris Betsas
Andreas Georgopoulos
Anastasios Doulamis
Pierre Grussenmeyer
3DV
3DPC
342
13
0
04 Nov 2024
Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images
Kun Huang
Fang-Lue Zhang
Fangfang Zhang
Yu-Kun Lai
Paul L. Rosin
N. Dodgson
232
1
0
04 Nov 2024
CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2024
Yiqin Zhao
Mallesham Dasari
Tian Guo
403
1
0
04 Nov 2024
MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Wang Zhao
Jiachen Liu
Sheng Zhang
Yongbin Li
Sili Chen
S. X. Huang
Wenshu Fan
Hengkai Guo
192
0
0
02 Nov 2024
DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Neural Information Processing Systems (NeurIPS), 2024
Weicai Ye
Chenhao Ji
Zheng Chen
Junyao Gao
Xiaoshui Huang
Song-Hai Zhang
Wanli Ouyang
Tong He
Cairong Zhao
Guofeng Zhang
266
29
0
31 Oct 2024
Deep Learning for 3D Point Cloud Enhancement: A Survey
Siwen Quan
Junhao Yu
Ziming Nie
Muze Wang
Sijia Feng
Pei An
Jiaqi Yang
3DPC
229
5
0
30 Oct 2024
SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark
Neural Information Processing Systems (NeurIPS), 2024
Hyunjun Jung
Weihang Li
Shun-cheng Wu
William Bittner
Nikolas Brasch
...
Eduardo Pérez-Pellitero
Zhensong Zhang
Arthur Moreau
Nassir Navab
Benjamin Busam
289
7
0
30 Oct 2024
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents
International Conference on Learning Representations (ICLR), 2024
Junting Chen
Checheng Yu
Xunzhe Zhou
Tianqi Xu
Yao Mu
Mengkang Hu
Wenqi Shao
Yun Wang
Ge Li
Lin Shao
369
14
0
30 Oct 2024
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting
IEEE Robotics and Automation Letters (RA-L), 2024
Yongqian Li
Zijia Kuang
Ting Li
Guyue Zhou
Zike Yan
Guyue Zhou
Shaohui Zhang
3DGS
422
26
0
29 Oct 2024
ANAVI: Audio Noise Awareness using Visuals of Indoor environments for NAVIgation
Vidhi Jain
Rishi Veerapaneni
Yonatan Bisk
157
0
0
24 Oct 2024
Scale Propagation Network for Generalizable Depth Completion
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Haotian Wang
Meng Yang
Xinhu Zheng
Gang Hua
274
5
0
24 Oct 2024
PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model
Zhongchen Deng
Zhechen Yang
Chi Chen
Cheng Zeng
Yan Meng
Bisheng Yang
216
3
0
21 Oct 2024
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Neural Information Processing Systems (NeurIPS), 2024
Yu Zhao
Hao Fei
Xiangtai Li
L. Qin
Jiayi Ji
Erik Cambria
Meishan Zhang
Hao Fei
Jianguo Wei
DiffM
263
2
0
20 Oct 2024
Vision-Language Navigation with Energy-Based Policy
Neural Information Processing Systems (NeurIPS), 2024
Rui Liu
Wenguan Wang
Yue Yang
229
18
0
18 Oct 2024
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
Computer Vision and Pattern Recognition (CVPR), 2024
Guangda Ji
Silvan Weder
Francis Engelmann
Marc Pollefeys
Hermann Blum
3DV
431
5
0
17 Oct 2024
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation
IEEE Robotics and Automation Letters (RA-L), 2024
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
255
1
0
16 Oct 2024
3D Gaussian Splatting in Robotics: A Survey
Siting Zhu
Guangming Wang
Dezhi Kong
Hesheng Wang
3DGS
273
49
0
16 Oct 2024
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
IEEE Robotics and Automation Letters (RA-L), 2024
Joey Wilson
Ruihan Xu
Yile Sun
Parker Ewen
Minghan Zhu
Kira Barton
Maani Ghaffari
370
2
0
15 Oct 2024
ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination
International Conference on Learning Representations (ICLR), 2024
Xinxin Zhao
Wenzhe Cai
Likun Tang
Teng Wang
LM&Ro
238
20
0
13 Oct 2024
SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Neural Information Processing Systems (NeurIPS), 2024
Hang Yin
Xiuwei Xu
Zhenyu Wu
Jie Zhou
Jiwen Lu
228
70
0
10 Oct 2024
Automated Creation of Digital Cousins for Robust Policy Learning
Conference on Robot Learning (CoRL), 2024
Tianyuan Dai
Josiah Wong
Yunfan Jiang
Chen Wang
Cem Gokmen
Ruohan Zhang
Jiajun Wu
Li Fei-Fei
271
79
0
09 Oct 2024
Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology
Xinyu Wang
Donglin Yang
Ziqin Wang
Hohin Kwan
Jinyu Chen
wenjun wu
Hongsheng Li
Yue Liao
Si Liu
247
50
0
09 Oct 2024
3D Representation Methods: A Survey
Zhengren Wang
3DGS
194
12
0
09 Oct 2024
CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality
Wenjie Chang
Hao Ai
Tianzhu Zhang
Lin Wang
MDE
208
1
0
08 Oct 2024
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Xue Liu
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
754
12
0
07 Oct 2024
Previous
1
2
3
...
5
6
7
...
25
26
27
Next