ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.08238
  4. Cited By
Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments
  for Embodied AI

Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI

16 September 2021
Santhosh Kumar Ramakrishnan
Aaron Gokaslan
Erik Wijmans
Oleksandr Maksymets
Alexander Clegg
John Turner
Eric Undersander
Wojciech Galuba
Andrew Westbury
Angel X. Chang
Manolis Savva
Yili Zhao
Dhruv Batra
ArXiv (abs)PDFHTML

Papers citing "Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI"

50 / 382 papers shown
FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention
FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention
Hangtian Zhao
Xiang Chen
Yizhe Li
Qianhao Wang
Haibo Lu
Fei Gao
MDE
118
0
0
28 Sep 2025
HELIOS: Hierarchical Exploration for Language-grounded Interaction in Open Scenes
HELIOS: Hierarchical Exploration for Language-grounded Interaction in Open Scenes
Katrina Ashton
Chahyon Ku
Shrey Shah
W. Jiang
Kostas Daniilidis
Bernadette Bucher
LM&Ro
119
0
0
26 Sep 2025
PersONAL: Towards a Comprehensive Benchmark for Personalized Embodied Agents
PersONAL: Towards a Comprehensive Benchmark for Personalized Embodied Agents
Filippo Ziliotto
Jelin Raphael Akkara
Alessandro Daniele
Lamberto Ballan
Luciano Serafini
Tommaso Campari
LM&Ro
121
0
0
24 Sep 2025
DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction
DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction
Bo Liu
Runlong Li
Li Zhou
Yan Zhou
102
2
0
21 Sep 2025
Agentic Aerial Cinematography: From Dialogue Cues to Cinematic Trajectories
Agentic Aerial Cinematography: From Dialogue Cues to Cinematic Trajectories
Yifan Lin
Sophie Ziyu Liu
Ran Qi
George Z. Xue
Xinping Song
Chao Qin
Hugh H. T. Liu
VGen
145
0
0
19 Sep 2025
FiLM-Nav: Efficient and Generalizable Navigation via VLM Fine-tuning
FiLM-Nav: Efficient and Generalizable Navigation via VLM Fine-tuning
Naoki Yokoyama
Sehoon Ha
LM&Ro
135
2
0
19 Sep 2025
PA-MPPI: Perception-Aware Model Predictive Path Integral Control for Quadrotor Navigation in Unknown Environments
PA-MPPI: Perception-Aware Model Predictive Path Integral Control for Quadrotor Navigation in Unknown Environments
Yifan Zhai
Rudolf Reiter
Davide Scaramuzza
157
2
0
18 Sep 2025
Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI
Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI
Fei Ni
Min Zhang
Pengyi Li
Yifu Yuan
Lingfeng Zhang
...
Yuzheng Zhuang
Yingxue Zhang
Yan Zheng
Hongyao Tang
Jianye Hao
ELM
198
1
0
18 Sep 2025
PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
Xu Zheng
Chenfei Liao
Ziqiao Weng
Kaiyu Lei
Zihao Dongfang
...
D. Paudel
Kailun Yang
L. Zhang
Luc Van Gool
Xuming Hu
174
3
0
16 Sep 2025
Synthetic vs. Real Training Data for Visual Navigation
Synthetic vs. Real Training Data for Visual Navigation
Lauri Suomela
Sasanka Kuruppu Arachchige
German F. Torres
Harry Edelman
Joni-Kristian Kämäräinen
108
2
0
15 Sep 2025
ParaEQsA: Parallel and Asynchronous Embodied Questions Scheduling and Answering
ParaEQsA: Parallel and Asynchronous Embodied Questions Scheduling and Answering
Haisheng Wang
Weiming Zhi
108
0
0
15 Sep 2025
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts
Weipeng Zhong
Peizhou Cao
Yichen Jin
Li Ray Luo
Wenzhe Cai
...
Zhaoyang Lyu
Tai Wang
Bo Dai
Xudong Xu
Jiangmiao Pang
3DV
292
1
0
13 Sep 2025
OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning
OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning
Yuecheng Liu
Dafeng Chi
Shiguang Wu
Zhanguang Zhang
Yuzheng Zhuang
...
Pengwei Xie
David Gamaliel Arcos Bravo
Yingxue Zhang
Jianye Hao
Xingyue Quan
LM&RoLRM
190
2
0
11 Sep 2025
ObjectReact: Learning Object-Relative Control for Visual Navigation
ObjectReact: Learning Object-Relative Control for Visual Navigation
Sourav Garg
Dustin Craggs
Vineeth Bhat
Lachlan Mares
Stefan Podgorski
Madhava Krishna
Feras Dayoub
Ian Reid
140
1
0
11 Sep 2025
TANGO: Traversability-Aware Navigation with Local Metric Control for Topological Goals
TANGO: Traversability-Aware Navigation with Local Metric Control for Topological GoalsIEEE International Conference on Robotics and Automation (ICRA), 2025
Stefan Podgorski
Sourav Garg
M. Hosseinzadeh
Lachlan Mares
Feras Dayoub
Ian Reid
168
3
0
10 Sep 2025
OpenGuide: Assistive Object Retrieval in Indoor Spaces for Individuals with Visual Impairments
OpenGuide: Assistive Object Retrieval in Indoor Spaces for Individuals with Visual Impairments
Yifan Xu
Qianwei Wang
V. Kamat
Carol Menassa
146
0
0
02 Sep 2025
TopoNav: Topological Graphs as a Key Enabler for Advanced Object Navigation
TopoNav: Topological Graphs as a Key Enabler for Advanced Object Navigation
Peiran Liu
Qiang Zhang
Daojie Peng
Lingfeng Zhang
Yihao Qin
Hang Zhou
Jun Ma
Zhanchen Zhu
Yiding Ji
127
4
0
01 Sep 2025
ActLoc: Learning to Localize on the Move via Active Viewpoint Selection
ActLoc: Learning to Localize on the Move via Active Viewpoint Selection
Jiajie Li
Boyang Sun
Luca Di Giammarino
Hermann Blum
Marc Pollefeys
108
0
0
28 Aug 2025
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Shouwei Ruan
Liyuan Wang
Caixin Kang
Qihui Zhu
Songming Liu
Xingxing Wei
Hang Su
LM&Ro
147
5
0
24 Aug 2025
SIGN: Safety-Aware Image-Goal Navigation for Autonomous Drones via Reinforcement Learning
SIGN: Safety-Aware Image-Goal Navigation for Autonomous Drones via Reinforcement Learning
Zichen Yan
Rui Huang
Lei He
Shao Guo
Tianyuan Chen
155
1
0
17 Aug 2025
Distilling LLM Prior to Flow Model for Generalizable Agent's Imagination in Object Goal Navigation
Distilling LLM Prior to Flow Model for Generalizable Agent's Imagination in Object Goal Navigation
B. Li
Ren-jie Lu
Yu Zhou
Jingke Meng
Wei-Shi Zheng
193
0
0
13 Aug 2025
Imaginative World Modeling with Scene Graphs for Embodied Agent Navigation
Imaginative World Modeling with Scene Graphs for Embodied Agent Navigation
Yue Hu
Junzhe Wu
Ruihan Xu
Hang Liu
Avery Xi
Henry X. Liu
Ram Vasudevan
Maani Ghaffari
LM&Ro
132
2
0
09 Aug 2025
SkeNa: Learning to Navigate Unseen Environments Based on Abstract Hand-Drawn Maps
SkeNa: Learning to Navigate Unseen Environments Based on Abstract Hand-Drawn Maps
Haojun Xu
Jiaqi Xiang
Wu Wei
Jinyu Chen
Linqing Zhong
Linjiang Huang
Hongyu Yang
Si Liu
156
0
0
05 Aug 2025
NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks
NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks
Zhihao Luo
Wentao Yan abd Jingyu Gong
Min Wang
Zhizhong Zhang
Xuhong Wang
Yuan Xie
Xin Tan
Xin Tan
202
5
0
04 Aug 2025
VPN: Visual Prompt Navigation
VPN: Visual Prompt Navigation
Shuo Feng
Zihan Wang
Yuchen Li
Rui Kong
Hengyi Cai
Shuaiqiang Wang
Gim Hee Lee
Piji Li
Shuqiang Jiang
249
0
0
03 Aug 2025
The Missing Parts: Augmenting Fact Verification with Half-Truth Detection
The Missing Parts: Augmenting Fact Verification with Half-Truth Detection
Yixuan Tang
Jincheng Wang
A. Tung
HILM
207
10
0
01 Aug 2025
PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction
PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction
Jiahui Ren
Mochu Xiang
Jiajun Zhu
Yuchao Dai
125
1
0
29 Jul 2025
Recursive Visual Imagination and Adaptive Linguistic Grounding for Vision Language Navigation
Recursive Visual Imagination and Adaptive Linguistic Grounding for Vision Language Navigation
Bolei Chen
Jiaxu Kang
Yifei Wang
Ping Zhong
Qi Wu
Jianxin Wang
LM&Ro
109
0
0
29 Jul 2025
LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments
LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments
Junhao Chen
Zhen Zhang
Chengrui Zhu
Xiaojun Hou
T. Hu
Huifeng Wu
Y. Liu
3DV
123
0
0
29 Jul 2025
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities
Liuyi Wang
Xinyuan Xia
Hui Zhao
Hanqing Wang
Tai Wang
Yilun Chen
Chengju Liu
Qijun Chen
Jiangmiao Pang
LM&Ro
208
3
0
17 Jul 2025
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation
Zhen Xu
Hongyu Zhou
Sida Peng
Haotong Lin
Haoyu Guo
...
Yue Wang
Ruizhen Hu
Yiyi Liao
Xiaowei Zhou
Hujun Bao
VLM
177
3
0
15 Jul 2025
GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding
GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding
Zijun Lin
Shuting He
Cheston Tan
Bihan Wen
AI4TS
307
2
0
26 Jun 2025
GeNIE: A Generalizable Navigation System for In-the-Wild Environments
GeNIE: A Generalizable Navigation System for In-the-Wild EnvironmentsIEEE Robotics and Automation Letters (IEEE RA-L), 2025
Jiaming Wang
Diwen Liu
Jizhuo Chen
Jiaxuan Da
Nuowen Qian
Tram Minh Man
Harold Soh
3DV
176
1
0
22 Jun 2025
General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting
General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting
Bernard Lange
Anil Yildiz
Mansur Arief
Shehryar Khattak
Mykel J. Kochenderfer
Georgios Georgakis
LM&Ro
158
1
0
20 Jun 2025
Co-VisiON: Co-Visibility ReasONing on Sparse Image Sets of Indoor Scenes
Co-VisiON: Co-Visibility ReasONing on Sparse Image Sets of Indoor Scenes
Chao-Yeh Chen
Nobel Dang
Juexiao Zhang
Wenkai Sun
Pengfei Zheng
Xuhang He
Yimeng Ye
Taarun Srinivas
Taarun Srinivas
Chen Feng
3DV
358
0
0
20 Jun 2025
Uncertainty-Informed Active Perception for Open Vocabulary Object Goal Navigation
Uncertainty-Informed Active Perception for Open Vocabulary Object Goal NavigationEuropean Conference on Mobile Robots (ECMR), 2025
Utkarsh Bajpai
Julius Ruckin
Cyrill Stachniss
Marija Popović
272
0
0
16 Jun 2025
EQA-RM: A Generative Embodied Reward Model with Test-time Scaling
EQA-RM: A Generative Embodied Reward Model with Test-time Scaling
Yuhang Chen
Zhen Tan
Tianlong Chen
381
1
0
12 Jun 2025
LEO-VL: Efficient Scene Representation for Scalable 3D Vision-Language Learning
LEO-VL: Efficient Scene Representation for Scalable 3D Vision-Language Learning
J. Huang
Xiaojian Ma
Xiongkun Linghu
Yue Fan
Junchao He
...
Qing Li
Song-Chun Zhu
Yixin Chen
Baoxiong Jia
Siyuan Huang
287
2
0
11 Jun 2025
OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization
OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization
Yixuan Yang
Zhen Luo
Tongsheng Ding
Junru Lu
Mingqi Gao
Jinyu Yang
Victor Sanchez
Feng Zheng
3DV
258
0
0
09 Jun 2025
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations
Linjie Li
Mahtab Bigverdi
Jiawei Gu
Zixian Ma
Yinuo Yang
Ziang Li
Yejin Choi
Ranjay Krishna
LRM
230
8
0
05 Jun 2025
ArtVIP: Articulated Digital Assets of Visual Realism, Modular Interaction, and Physical Fidelity for Robot Learning
ArtVIP: Articulated Digital Assets of Visual Realism, Modular Interaction, and Physical Fidelity for Robot Learning
Zhao Jin
Zhengping Che
Zhen Zhao
Kun Wu
Yuheng Zhang
...
Qiang Zhang
Xiaozhu Ju
Jing Tian
Yousong Xue
Jian Tang
VGen
379
4
0
05 Jun 2025
SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models
SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models
Arnab Debnath
Gregory J. Stein
Jana Kosecka
LM&Ro
256
1
0
04 Jun 2025
RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models
RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Junjie Li
Nan Zhang
Xiaoyang Qu
Kai Lu
Guokuan Li
Jiguang Wan
Jianzong Wang
277
2
0
03 Jun 2025
GraphPad: Inference-Time 3D Scene Graph Updates for Embodied Question Answering
GraphPad: Inference-Time 3D Scene Graph Updates for Embodied Question Answering
Muhammad Qasim Ali
Saeejith Nair
Alexander Wong
Yuchen Cui
Yuhao Chen
231
1
0
01 Jun 2025
Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration
Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration
Zeying Gong
Rong Li
Tianshuai Hu
Ronghe Qiu
Lingdong Kong
Lingfeng Zhang
Yiyi Ding
Leying Zhang
Junwei Liang
475
0
0
29 May 2025
DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation
DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation
Tianjun Gu
Linfeng Li
Xuhong Wang
Chenghua Gong
Jingyu Gong
Zhizhong Zhang
Yuan Xie
Lizhuang Ma
Xin Tan
LM&Ro
517
1
0
28 May 2025
SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes
SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes
Dicong Qiu
Jiadi You
Zeying Gong
Ronghe Qiu
Hui Xiong
Junwei Liang
177
0
0
24 May 2025
SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence
SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence
Jiabin Chen
Haiping Wang
Jinpeng Li
Yuan Liu
Zhen Dong
Bisheng Yang
391
2
0
19 May 2025
Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild
Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild
Derek Ming Siang Tan
Shailesh
Boyang Liu
Alok Raj
Qi Xuan Ang
...
Tanishq Duhan
Jimmy Chiun
Yuhong Cao
Florian Shkurti
Guillaume Sartoretti
675
1
0
16 May 2025
Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities
Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities
Zachary Ravichandran
Fernando Cladera
Jason Hughes
Varun Murali
M. Hsieh
George J. Pappas
Camillo J Taylor
Vijay Kumar
LM&Ro
327
2
0
14 May 2025
Previous
12345678
Next