Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1709.06158
Cited By
Matterport3D: Learning from RGB-D Data in Indoor Environments
18 September 2017
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Matterport3D: Learning from RGB-D Data in Indoor Environments"
50 / 1,324 papers shown
Title
CompassNav: Steering From Path Imitation To Decision Understanding In Navigation
Linfeng Li
Jian Zhao
Yuan Xie
Xin Tan
Xuelong Li
106
2
0
11 Oct 2025
An End-to-End Room Geometry Constrained Depth Estimation Framework for Indoor Panorama Images
Kanglin Ning
Ruzhao Chen
Penghong Wang
X. Wang
Ruiqin Xiong
Xiaopeng Fan
MDE
3DV
185
0
0
09 Oct 2025
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Dongki Jung
Jaehoon Choi
Yonghan Lee
Sungmin Eum
Heesung Kwon
Dinesh Manocha
104
0
0
08 Oct 2025
Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Kun Xiang
Terry Jingchen Zhang
Yinya Huang
Jixi He
Zirong Liu
...
J. N. Han
Hang Xu
Han Li
Bin Dong
Xiaodan Liang
PINN
AI4CE
348
1
0
06 Oct 2025
GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Weijia Dou
X. Zhang
Yi Bin
Jian Liu
Bo Peng
Guoqing Wang
Yang Yang
H. Shen
128
0
0
02 Oct 2025
What Matters in RL-Based Methods for Object-Goal Navigation? An Empirical Study and A Unified Framework
Hongze Wang
Boyang Sun
Jiaxu Xing
Fan Yang
Marco Hutter
Dhruv Shah
Davide Scaramuzza
Marc Pollefeys
84
0
0
02 Oct 2025
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
Jiahao Wang
Luoxin Ye
Taiming Lu
Junfei Xiao
Jiahan Zhang
...
Xijun Liu
Rama Chellappa
Cheng-Fang Peng
Alan Yuille
Jieneng Chen
VGen
121
1
0
01 Oct 2025
Semantic Visual Simultaneous Localization and Mapping: A Survey on State of the Art, Challenges, and Future Directions
Thanh Nguyen Canh
Haolan Zhang
Xiem HoangVan
N. Chong
145
0
0
01 Oct 2025
OmniNav: A Unified Framework for Prospective Exploration and Visual-Language Navigation
Xinda Xue
Junjun Hu
Minghua Luo
Xie Shichao
Jintao Chen
Zixun Xie
Quan Kuichen
Guo Wei
Mu Xu
Zedong Chu
244
6
0
30 Sep 2025
OceanGym: A Benchmark Environment for Underwater Embodied Agents
Yida Xue
Mingjun Mao
Xiangyuan Ru
Yuqi Zhu
Baochang Ren
...
Shumin Deng
Xinyu An
Ningyu Zhang
Ying Chen
Huajun Chen
158
0
0
30 Sep 2025
DA
2
^{2}
2
: Depth Anything in Any Direction
Haodong Li
Wangguangdong Zheng
Jing He
Yuhao Liu
Xin Lin
Xin Yang
Ying-Cong Chen
Chunchao Guo
MDE
444
4
0
30 Sep 2025
Landmark-Guided Knowledge for Vision-and-Language Navigation
International Conference on Intelligent Computing (ICIC), 2025
Dongsheng Yang
Meiling Zhu
Yinfeng Yu
LM&Ro
111
0
0
30 Sep 2025
OWL: Geometry-Aware Spatial Reasoning for Audio Large Language Models
Subrata Biswas
Mohammad Nur Hossain Khan
Bashima Islam
VLM
LRM
109
1
0
30 Sep 2025
Iterative Residual Cross-Attention Mechanism: An Integrated Approach for Audio-Visual Navigation Tasks
Hailong Zhang
Yinfeng Yu
Liejun Wang
Fuchun Sun
Wendong Zheng
64
0
0
30 Sep 2025
SSR-ZSON: Zero-Shot Object Navigation via Spatial-Semantic Relations within a Hierarchical Exploration Framework
Xiangyi Meng
D. Li
Zihao Mao
Yi Yang
Wenjie Song
96
1
0
29 Sep 2025
AdaNav: Adaptive Reasoning with Uncertainty for Vision-Language Navigation
X. Ding
Jianyu Wei
Yifan Yang
Shiqi Jiang
Qianxi Zhang
...
Yuxuan Yan
Weijun Wang
Yunxin Liu
Zhibo Chen
Ting Cao
LRM
117
0
0
29 Sep 2025
LLM-RG: Referential Grounding in Outdoor Scenarios using Large Language Models
Pranav Saxena
A. Bhattacharya
Ji Zhang
Wenshan Wang
151
1
0
29 Sep 2025
DepthLM: Metric Depth From Vision Language Models
Zhipeng Cai
Ching-Feng Yeh
Hu Xu
Zhuang Liu
Gregory Meyer
X. Lei
Changsheng Zhao
Shang-Wen Li
Vikas Chandra
Yangyang Shi
VLM
3DV
242
1
0
29 Sep 2025
M3DLayout: A Multi-Source Dataset of 3D Indoor Layouts and Structured Descriptions for 3D Generation
Yiheng Zhang
Zhuojiang Cai
Mingdao Wang
Meitong Guo
Tianxiao Li
Li Lin
Yuwang Wang
3DV
140
0
0
28 Sep 2025
RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization
Dongki Jung
Jaehoon Choi
Yonghan Lee
Dinesh Manocha
MDE
142
0
0
28 Sep 2025
GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State
Guole Shen
Tianchen Deng
Yanbo Wang
Yongtao Chen
Yilin Shen
Jiuming Liu
Jingchuan Wang
3DV
114
3
0
28 Sep 2025
WavJEPA: Semantic learning unlocks robust audio foundation models for raw waveforms
Goksenin Yuksel
Pierre Guetschel
Michael Tangermann
Marcel van Gerven
Kiki van der Heijden
AI4TS
124
0
0
27 Sep 2025
Robot Learning from Any Images
Siheng Zhao
Jiageng Mao
Wei Chow
Zeyu Shangguan
Tianheng Shi
...
Daniel Seita
Leonidas Guibas
Sergey Zakharov
Vitor Campagnolo Guizilini
Yue Wang
158
4
0
26 Sep 2025
JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Shuang Zeng
Dekang Qi
Xinyuan Chang
Feng Xiong
Shichao Xie
Xiaolong Wu
Shiyi Liang
Mu Xu
Xing Wei
148
13
0
26 Sep 2025
Reflect3r: Single-View 3D Stereo Reconstruction Aided by Mirror Reflections
Jing Wu
Zirui Wang
Iro Laina
V. Prisacariu
92
0
0
24 Sep 2025
Boosting Zero-Shot VLN via Abstract Obstacle Map-Based Waypoint Prediction with TopoGraph-and-VisitInfo-Aware Prompting
Boqi Li
Siyuan Li
Weiyi Wang
Anran Li
Zhong Cao
Henry X. Liu
LM&Ro
104
1
0
24 Sep 2025
PersONAL: Towards a Comprehensive Benchmark for Personalized Embodied Agents
Filippo Ziliotto
Jelin Raphael Akkara
Alessandro Daniele
Lamberto Ballan
Luciano Serafini
Tommaso Campari
LM&Ro
92
0
0
24 Sep 2025
Advancing Audio-Visual Navigation Through Multi-Agent Collaboration in 3D Environments
Hailong Zhang
Yinfeng Yu
Liejun Wang
Fuchun Sun
Wendong Zheng
84
0
0
21 Sep 2025
Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI
Fei Ni
Min Zhang
Pengyi Li
Yifu Yuan
Lingfeng Zhang
...
Yuzheng Zhuang
Yingxue Zhang
Yan Zheng
Hongyao Tang
Jianye Hao
ELM
174
1
0
18 Sep 2025
SPATIALGEN: Layout-guided 3D Indoor Scene Generation
Chuan Fang
Heng Li
Yixun Liang
Jia Zheng
Yongsen Mao
Yuan Liu
Rui Tang
Zihan Zhou
Ping Tan
3DV
320
0
0
18 Sep 2025
MetricNet: Recovering Metric Scale in Generative Navigation Policies
Abhijeet Nayak
Débora N.P. Oliveira
Samiran Gode
Cordelia Schmid
Wolfram Burgard
88
0
0
17 Sep 2025
PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
Xu Zheng
Chenfei Liao
Ziqiao Weng
Kaiyu Lei
Zihao Dongfang
...
D. Paudel
Kailun Yang
L. Zhang
Luc Van Gool
Xuming Hu
152
3
0
16 Sep 2025
3D Aware Region Prompted Vision Language Model
A. Cheng
Yang Fu
Yukang Chen
Zhijian Liu
X. Li
...
Jan Kautz
Pavlo Molchanov
Hongxu Yin
Xiaolong Wang
Sifei Liu
119
8
0
16 Sep 2025
ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation
Zekai Zhang
Weiye Zhu
Hewei Pan
Xiangchen Wang
Rongtao Xu
Xing Sun
Feng Zheng
148
2
0
16 Sep 2025
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts
Weipeng Zhong
Peizhou Cao
Yichen Jin
Li Ray Luo
Wenzhe Cai
...
Zhaoyang Lyu
Tai Wang
Bo Dai
Xudong Xu
Jiangmiao Pang
3DV
246
1
0
13 Sep 2025
GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation
Hang Yin
Haoyu Wei
Xiuwei Xu
Wenxuan Guo
Jie Zhou
Jiwen Lu
3DV
157
2
0
12 Sep 2025
OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning
Yuecheng Liu
Dafeng Chi
Shiguang Wu
Zhanguang Zhang
Yuzheng Zhuang
...
Pengwei Xie
David Gamaliel Arcos Bravo
Yingxue Zhang
Jianye Hao
Xingyue Quan
LM&Ro
LRM
162
2
0
11 Sep 2025
TopoNav: Topological Graphs as a Key Enabler for Advanced Object Navigation
Peiran Liu
Qiang Zhang
Daojie Peng
Lingfeng Zhang
Yihao Qin
Hang Zhou
Jun Ma
Zhanchen Zhu
Yiding Ji
100
4
0
01 Sep 2025
Look Beyond: Two-Stage Scene View Generation via Panorama and Video Diffusion
Xueyang Kang
Zhengkang Xiang
Zezheng Zhang
Kourosh Khoshelham
DiffM
VGen
101
0
0
31 Aug 2025
Deep Learning for Personalized Binaural Audio Reproduction
Xikun Lu
Yunda Chen
Zehua Chen
Jie Wang
Mingxing Liu
Hongmei Hu
C. Zheng
Stefan Bleeck
Jinqiu Sang
144
2
0
30 Aug 2025
Beyond Pixels: Introducing Geometric-Semantic World Priors for Video-based Embodied Models via Spatio-temporal Alignment
Jinzhou Tang
Jusheng Zhang
Sidi Liu
Waikit Xiu
Qinhan Lv
Xiying Li
66
0
0
29 Aug 2025
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Shouwei Ruan
Liyuan Wang
Caixin Kang
Qihui Zhu
Songming Liu
Xingxing Wei
Hang Su
LM&Ro
135
5
0
24 Aug 2025
Fiducial Marker Splatting for High-Fidelity Robotics Simulations
Diram Tabaa
Gianni Di Caro
3DGS
116
1
0
23 Aug 2025
SIGN: Safety-Aware Image-Goal Navigation for Autonomous Drones via Reinforcement Learning
Zichen Yan
Rui Huang
Lei He
Shao Guo
Tianyuan Chen
120
1
0
17 Aug 2025
Advances in Speech Separation: Techniques, Challenges, and Future Trends
Kai Li
Guo Chen
Wendi Sang
Yi Luo
Zhuo Chen
...
Shulin He
Zhong-Qiu Wang
Andong Li
Z. Wu
Xiaolin Hu
AI4TS
104
4
0
14 Aug 2025
CorrectNav: Self-Correction Flywheel Empowers Vision-Language-Action Navigation Model
Zhuoyuan Yu
Yuxing Long
Zihan Yang
Chengyan Zeng
Hongwei Fan
Jiyao Zhang
Hao Dong
LRM
108
8
0
14 Aug 2025
DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation
Haoxiang Shi
Xiang Deng
Zaijing Li
Gongwei Chen
Yaowei Wang
Liqiang Nie
64
0
0
13 Aug 2025
Distilling LLM Prior to Flow Model for Generalizable Agent's Imagination in Object Goal Navigation
B. Li
Ren-jie Lu
Yu Zhou
Jingke Meng
Wei-Shi Zheng
180
0
0
13 Aug 2025
SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA)
Computers & graphics (Comput. Graph.), 2025
T. Nguyen
Viet-Tham Huynh
Quang-Thuc Nguyen
H. Nguyen
Long Le Bao
...
Dinh-Khoi Vo
Van-Loc Nguyen
Trung-Truc Huynh-Le
Tam V. Nguyen
Minh-Triet Tran
3DV
100
1
0
12 Aug 2025
Harnessing Input-Adaptive Inference for Efficient VLN
Dongwoo Kang
Akhil Perincherry
Zachary Coalson
Aiden Gabriel
Stefan Lee
Sanghyun Hong
LM&Ro
118
0
0
12 Aug 2025
Previous
1
2
3
4
5
...
25
26
27
Next