Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2109.08238
Cited By
Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
16 September 2021
Santhosh Kumar Ramakrishnan
Aaron Gokaslan
Erik Wijmans
Oleksandr Maksymets
Alexander Clegg
John Turner
Eric Undersander
Wojciech Galuba
Andrew Westbury
Angel X. Chang
Manolis Savva
Yili Zhao
Dhruv Batra
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI"
50 / 373 papers shown
Title
SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
Ziyi Chen
Yingnan Guo
Zedong Chu
Minghua Luo
Yanfen Shen
...
Lu Liu
Honglin Han
X. Wu
Mu Xu
Yu Zhang
464
0
0
26 Nov 2025
Wanderland: Geometrically Grounded Simulation for Open-World Embodied AI
Xinhao Liu
Jiaqi Li
Youming Deng
Ruxin Chen
Y. Zhang
Yifei Ma
Li Guo
Yiming Li
Jing Zhang
Chen Feng
VGen
88
0
0
25 Nov 2025
Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution
Dingkang Liang
Cheng Zhang
Xiaopeng Xu
Jianzhong Ju
Zhenbo Luo
Xiang Bai
LM&Ro
112
0
0
24 Nov 2025
ReEXplore: Improving MLLMs for Embodied Exploration with Contextualized Retrospective Experience Replay
Gengyuan Zhang
Mingcong Ding
Jingpei Wu
Ruotong Liao
Volker Tresp
LRM
113
0
0
24 Nov 2025
DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video
Jiawei Hou
Shenghao Zhang
Can Wang
Zheng Gu
Yonggen Ling
Taiping Zeng
Xiangyang Xue
Jingbo Zhang
3DPC
89
0
0
24 Nov 2025
Disc3D: Automatic Curation of High-Quality 3D Dialog Data via Discriminative Object Referring
Siyuan Wei
C. Wang
Xiao Liu
Xiaosheng Yan
Zhishan Zhou
Rui Huang
77
0
0
24 Nov 2025
4D-VGGT: A General Foundation Model with SpatioTemporal Awareness for Dynamic Scene Geometry Estimation
Haonan Wang
Hanyu Zhou
Haoyue Liu
Luxin Yan
44
0
0
23 Nov 2025
POMA-3D: The Point Map Way to 3D Scene Understanding
Ye Mao
Weixun Luo
Ranran Huang
Junpeng Jing
K. Mikolajczyk
3DPC
89
0
0
20 Nov 2025
RoboTidy : A 3D Gaussian Splatting Household Tidying Benchmark for Embodied Navigation and Action
Xiaoquan sun
Ruijian Zhang
Kang Pang
Bingchen Miao
Yuxiang Tan
Zhen Yang
Ming Li
Jiayu Chen
LM&Ro
199
0
0
18 Nov 2025
SocialNav-Map: Dynamic Mapping with Human Trajectory Prediction for Zero-Shot Social Navigation
Lingfeng Zhang
Erjia Xiao
Xiaoshuai Hao
Haoxiang Fu
Zeying Gong
L. Chen
Xiaojun Liang
Renjing Xu
Hangjun Ye
Wenbo Ding
78
0
0
15 Nov 2025
Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy
Italian National Conference on Sensors (INS), 2025
Vinit Mehta
Charu Sharma
Karthick Thiyagarajan
LM&Ro
312
0
0
14 Nov 2025
PanoNav: Mapless Zero-Shot Object Navigation with Panoramic Scene Parsing and Dynamic Memory
Qunchao Jin
Yilin Wu
Changhao Chen
140
0
0
10 Nov 2025
MacroNav: Multi-Task Context Representation Learning Enables Efficient Navigation in Unknown Environments
Kuankuan Sima
Longbin Tang
Haozhe Ma
Tianyuan Chen
88
0
0
06 Nov 2025
A Step Toward World Models: A Survey on Robotic Manipulation
Peng-Fei Zhang
Ying Cheng
Xiaofan Sun
S. Wang
Lei Zhu
Lei Zhu
Heng Tao Shen
LM&Ro
542
2
0
31 Oct 2025
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
Xu Zheng
Zihao Dongfang
Lutao Jiang
Boyuan Zheng
Yulong Guo
...
L. Zhang
Danda Pani Paudel
Nicu Sebe
Luc Van Gool
Xuming Hu
LRM
VLM
540
2
0
29 Oct 2025
Understanding Multi-View Transformers
Michal Stary
Julien Gaubil
A. Tewari
Vincent Sitzmann
ViT
56
0
0
28 Oct 2025
NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation
Mingyu Jeong
Eunsung Kim
Sehun Park
Andrew Jaeyong Choi
64
0
0
28 Oct 2025
HyPerNav: Hybrid Perception for Object-Oriented Navigation in Unknown Environment
Zecheng Yin
H. Vicky Zhao
Zhen Li
101
0
0
27 Oct 2025
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations
Yujia Zhang
Xiaoyang Wu
Yixing Lao
Chengyao Wang
Zhuotao Tian
Naiyan Wang
Hengshuang Zhao
3DPC
117
1
0
27 Oct 2025
Towards Physically Executable 3D Gaussian for Embodied Navigation
Bingchen Miao
Rong Wei
Zhiqi Ge
Xiaoquan sun
Shiqi Gao
...
Renhan Wang
Siliang Tang
Jun Xiao
Rui Tang
Juncheng Billy Li
3DGS
160
1
0
24 Oct 2025
Multi-Step Reasoning for Embodied Question Answering via Tool Augmentation
Mingliang Zhai
Hansheng Liang
Xiaomeng Fan
Zhi Gao
Chuanhao Li
Che Sun
Xu Bin
Yuwei Wu
Yunde Jia
LRM
106
0
0
23 Oct 2025
Kinaema: a recurrent sequence model for memory and pose in motion
Mert Bulent Sariyildiz
Philippe Weinzaepfel
G. Bono
G. Monaci
Christian Wolf
72
0
0
23 Oct 2025
Embodied Navigation with Auxiliary Task of Action Description Prediction
Haru Kondoh
Asako Kanezaki
76
0
0
21 Oct 2025
World-in-World: World Models in a Closed-Loop World
Jiahan Zhang
Muqing Jiang
Nanru Dai
Taiming Lu
Arda Uzunoglu
...
Rama Chellappa
Tianmin Shu
Alan Yuille
Yilun Du
Jieneng Chen
VGen
VLM
156
4
0
20 Oct 2025
A Comprehensive Survey on World Models for Embodied AI
Xinqing Li
Xin He
Le Zhang
Yun-Hai Liu
Xiaoli Li
Yun Liu
VGen
LM&Ro
SyDa
168
2
0
19 Oct 2025
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Peiran Xu
Xicheng Gong
Yadong Mu
98
0
0
18 Oct 2025
GaussGym: An open-source real-to-sim framework for learning locomotion from pixels
Alejandro Escontrela
Justin Kerr
Arthur Allshire
Jonas Frey
Rocky Duan
Carmelo Sferrazza
Pieter Abbeel
3DGS
115
5
0
17 Oct 2025
SNAP: Towards Segmenting Anything in Any Point Cloud
Aniket Gupta
Hanhui Wang
Charles Saunders
Aruni RoyChowdhury
H. Singh
Huaizu Jiang
3DPC
VLM
98
0
0
13 Oct 2025
Into the Unknown: Towards using Generative Models for Sampling Priors of Environment Uncertainty for Planning in Configuration Spaces
Subhransu S. Bhattacharjee
Hao Lu
Dylan Campbell
Rahul Shome
3DPC
76
0
0
13 Oct 2025
CompassNav: Steering From Path Imitation To Decision Understanding In Navigation
Linfeng Li
Jian Zhao
Yuan Xie
Xin Tan
Xuelong Li
85
2
0
11 Oct 2025
NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions
Haolin Yang
Yuxing Long
Zhuoyuan Yu
Zihan Yang
Minghan Wang
...
Y. Wang
Ziyan Yu
Wenzhe Cai
Lei Kang
Hao Dong
64
0
0
09 Oct 2025
Learning to Navigate Socially Through Proactive Risk Perception
Erjia Xiao
Lingfeng Zhang
Yingbo Tang
Hao Cheng
Zhanchen Zhu
Wenbo Ding
Lei Zhou
L. Chen
Hangjun Ye
Xiaoshuai Hao
136
0
0
09 Oct 2025
IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Yandu Chen
Kefan Gu
Yuqing Wen
Yucheng Zhao
Tiancai Wang
Liqiang Nie
LM&Ro
LRM
45
0
0
09 Oct 2025
Automated Repeatable Adversary Threat Emulation with Effects Language (EL)
Suresh Damodaran
Paul D. Rowe
AAML
76
8
0
07 Oct 2025
Active Semantic Perception
Huayi Tang
Pratik Chaudhari
3DV
105
0
0
06 Oct 2025
SegMASt3R: Geometry Grounded Segment Matching
Rohit Jayanti
Swayam Agrawal
Vansh Garg
Siddharth Tourani
Muhammad Haris Khan
Sourav Garg
Madhava Krishna
3DV
217
0
0
06 Oct 2025
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
Jiahao Wang
Luoxin Ye
Taiming Lu
Junfei Xiao
Jiahan Zhang
...
Xijun Liu
Rama Chellappa
Cheng-Fang Peng
Alan Yuille
Jieneng Chen
VGen
93
1
0
01 Oct 2025
MUVLA: Learning to Explore Object Navigation via Map Understanding
Peilong Han
Fan Jia
Min Zhang
Yutao Qiu
Hongyao Tang
Yan Zheng
Tiancai Wang
Jianye Hao
64
1
0
30 Sep 2025
OmniNav: A Unified Framework for Prospective Exploration and Visual-Language Navigation
Xinda Xue
Junjun Hu
Minghua Luo
Xie Shichao
Jintao Chen
Zixun Xie
Quan Kuichen
Guo Wei
Mu Xu
Zedong Chu
160
3
0
30 Sep 2025
DepthLM: Metric Depth From Vision Language Models
Zhipeng Cai
Ching-Feng Yeh
Hu Xu
Zhuang Liu
Gregory Meyer
X. Lei
Changsheng Zhao
Shang-Wen Li
Vikas Chandra
Yangyang Shi
VLM
3DV
174
1
0
29 Sep 2025
SSR-ZSON: Zero-Shot Object Navigation via Spatial-Semantic Relations within a Hierarchical Exploration Framework
Xiangyi Meng
D. Li
Zihao Mao
Yi Yang
Wenjie Song
76
1
0
29 Sep 2025
LLM-RG: Referential Grounding in Outdoor Scenarios using Large Language Models
Pranav Saxena
A. Bhattacharya
Ji Zhang
Wenshan Wang
119
1
0
29 Sep 2025
FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention
Hangtian Zhao
Xiang Chen
Yizhe Li
Qianhao Wang
Haibo Lu
Fei Gao
MDE
82
0
0
28 Sep 2025
HELIOS: Hierarchical Exploration for Language-grounded Interaction in Open Scenes
Katrina Ashton
Chahyon Ku
Shrey Shah
W. Jiang
Kostas Daniilidis
Bernadette Bucher
LM&Ro
55
0
0
26 Sep 2025
PersONAL: Towards a Comprehensive Benchmark for Personalized Embodied Agents
Filippo Ziliotto
Jelin Raphael Akkara
Alessandro Daniele
Lamberto Ballan
Luciano Serafini
Tommaso Campari
LM&Ro
76
0
0
24 Sep 2025
DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction
Bo Liu
Runlong Li
Li Zhou
Yan Zhou
61
2
0
21 Sep 2025
Agentic Aerial Cinematography: From Dialogue Cues to Cinematic Trajectories
Yifan Lin
Sophie Ziyu Liu
Ran Qi
George Z. Xue
Xinping Song
Chao Qin
Hugh H. T. Liu
VGen
89
0
0
19 Sep 2025
FiLM-Nav: Efficient and Generalizable Navigation via VLM Fine-tuning
Naoki Yokoyama
Sehoon Ha
LM&Ro
48
0
0
19 Sep 2025
Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI
Fei Ni
Min Zhang
Pengyi Li
Yifu Yuan
Lingfeng Zhang
...
Yuzheng Zhuang
Yingxue Zhang
Yan Zheng
Hongyao Tang
Jianye Hao
ELM
130
1
0
18 Sep 2025
PA-MPPI: Perception-Aware Model Predictive Path Integral Control for Quadrotor Navigation in Unknown Environments
Yifan Zhai
Rudolf Reiter
Davide Scaramuzza
97
2
0
18 Sep 2025
1
2
3
4
5
6
7
8
Next