Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2109.08238
Cited By
Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
16 September 2021
Santhosh Kumar Ramakrishnan
Aaron Gokaslan
Erik Wijmans
Oleksandr Maksymets
Alexander Clegg
John Turner
Eric Undersander
Wojciech Galuba
Andrew Westbury
Angel X. Chang
Manolis Savva
Yili Zhao
Dhruv Batra
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI"
50 / 373 papers shown
Title
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
Chuanneng Sun
Songjun Huang
D. Pompili
LLMAG
286
60
0
17 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Brandon Smart
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
280
30
0
16 May 2024
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation
Computer Vision and Pattern Recognition (CVPR), 2024
Yunhao Ge
Yihe Tang
Lyne Tchapmi
Cem Gokmen
Chengshu Li
...
Miao Liu
Pengchuan Zhang
Ruohan Zhang
Fei-Fei Li
Jiajun Wu
VGen
154
11
0
15 May 2024
Learning Latent Dynamic Robust Representations for World Models
International Conference on Machine Learning (ICML), 2024
Ruixiang Sun
Hongyu Zang
Xin-hui Li
Riashat Islam
195
11
0
10 May 2024
An Empty Room is All We Want: Automatic Defurnishing of Indoor Panoramas
Mira Slavcheva
Dave Gausebeck
Kevin Chen
David Buchhofer
Azwad Sabik
Chen Ma
Sachal Dhillon
Olaf Brandt
Alan Dolhasz
136
8
0
06 May 2024
PhilHumans: Benchmarking Machine Learning for Personal Health
Vadim Liventsev
Vivek Kumar
Allmin Pradhap Singh Susaiyah
Zixiu "Alex" Wu
Ivan Rodin
...
Milan Petkovic
Diego Reforgiato Recupero
Ehud Reiter
Daniele Riboni
Raymond Sterling
AI4MH
LM&MA
207
0
0
04 May 2024
CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications
J. Blumenkamp
Steven D. Morad
Jennifer Gielis
Amanda Prorok
257
6
0
02 May 2024
Following the Human Thread in Social Navigation
Luca Scofano
Alessio Sampieri
Tommaso Campari
Valentino Sacco
Indro Spinelli
Lamberto Ballan
Yuta Kyuragi
287
3
0
17 Apr 2024
Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration
Jing Zeng
Yanxu Li
Jiahao Sun
Qi Ye
Yunlong Ran
Jiming Chen
151
5
0
16 Apr 2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
GAN
180
4
0
15 Apr 2024
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
Computer Vision and Pattern Recognition (CVPR), 2024
Mukul Khanna
Ram Ramrakhya
Gunjan Chhablani
Sriram Yenamandra
Théophile Gervet
Matthew Chang
Z. Kira
Devendra Singh Chaplot
Dhruv Batra
Roozbeh Mottaghi
LM&Ro
196
59
0
09 Apr 2024
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
International Conference on Learning Representations (ICLR), 2024
Francis Engelmann
Fabian Manhardt
Michael Niemeyer
Keisuke Tateno
Marc Pollefeys
Federico Tombari
VLM
237
49
1
04 Apr 2024
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2024
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Junjie Hu
Ming Jiang
Shuqiang Jiang
180
44
0
02 Apr 2024
PRISM-TopoMap: Online Topological Mapping with Place Recognition and Scan Matching
IEEE Robotics and Automation Letters (RA-L), 2024
K. Muravyev
Alexander Melekhin
Dmitriy Yudin
Konstantin Yakovlev
409
7
0
02 Apr 2024
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Muhammad Zubair Irshad
Sergey Zakahrov
Vitor Campagnolo Guizilini
Adrien Gaidon
Z. Kira
Rares Andrei Ambrus
ViT
238
20
0
01 Apr 2024
Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark
Ziyang Chen
I. D. Gebru
Christian Richardt
Anurag Kumar
William Laney
Andrew Owens
Alexander Richard
184
34
0
27 Mar 2024
Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
P. Sadler
Sherzod Hakimov
David Schlangen
LLMAG
140
4
0
26 Mar 2024
OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation
Ganlong Zhao
Guanbin Li
Weikai Chen
Yizhou Yu
212
13
0
26 Mar 2024
Explore until Confident: Efficient Exploration for Embodied Question Answering
Allen Z. Ren
Jaden Clark
Anushri Dixit
Masha Itkina
Anirudha Majumdar
Dorsa Sadigh
346
60
0
23 Mar 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Mu Hu
Wei Yin
C. Zhang
Yong Deng
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
464
292
0
22 Mar 2024
Leveraging Large Language Model-based Room-Object Relationships Knowledge for Enhancing Multimodal-Input Object Goal Navigation
Leyuan Sun
Asako Kanezaki
Guillaume Caron
Yusuke Yoshiyasu
LM&Ro
184
7
0
21 Mar 2024
R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding
European Conference on Computer Vision (ECCV), 2024
Qirui Wu
Sonia Raychaudhuri
Daniel E. Ritchie
Manolis Savva
Angel X. Chang
3DPC
156
3
0
18 Mar 2024
SceneSense: Diffusion Models for 3D Occupancy Synthesis from Partial Observation
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Alec Reed
Brendan Crowe
Doncey Albin
Lorin Achey
Bradley Hayes
Christoffer Heckman
DiffM
182
4
0
18 Mar 2024
Agent3D-Zero: An Agent for Zero-shot 3D Understanding
European Conference on Computer Vision (ECCV), 2024
Sha Zhang
Di Huang
Jiajun Deng
Weizhen He
Wanli Ouyang
Tong He
Yanyong Zhang
VGen
141
29
0
18 Mar 2024
Prioritized Semantic Learning for Zero-shot Instance Navigation
European Conference on Computer Vision (ECCV), 2024
Xander Sun
Louis Lau
Hoyard Zhi
Ronghe Qiu
Junwei Liang
198
16
0
18 Mar 2024
Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Vishnu Sashank Dorbala
Sanjoy Chowdhury
Dinesh Manocha
LM&Ro
320
6
0
18 Mar 2024
Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Rao Fu
Jingyu Liu
Xilun Chen
Yixin Nie
Wenhan Xiong
LM&Ro
LRM
201
131
0
18 Mar 2024
Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
Mengying Lin
Shugao Liu
Dong Zhao
Yaran Chen
Zhaoran Wang
Haoran Li
Dongbin Zhao
381
6
0
15 Mar 2024
Mapping High-level Semantic Regions in Indoor Environments without Object Recognition
IEEE International Conference on Robotics and Automation (ICRA), 2024
Roberto Bigazzi
Lorenzo Baraldi
Shreyas Kousik
Rita Cucchiara
Marco Pavone
130
4
0
11 Mar 2024
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na
Yunkyeong Seo
IL-Chul Moon
214
10
0
02 Mar 2024
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments
Ji Ma
Hongming Dai
Yao Mu
Pengying Wu
Hao Wang
Yatian Wang
Yang Fei
Shanghang Zhang
Chang-rui Liu
204
8
0
29 Feb 2024
Opening Articulated Structures in the Real World
Arjun Gupta
Michelle Zhang
Rishik Sathua
Saurabh Gupta
239
1
0
27 Feb 2024
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Yao Mu
Junting Chen
Qinglong Zhang
Shoufa Chen
Qiaojun Yu
...
Wenhai Wang
Jifeng Dai
Yu Qiao
Mingyu Ding
Ping Luo
216
44
0
25 Feb 2024
Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation
X. Lei
Min Wang
Wen-gang Zhou
Li Li
Houqiang Li
227
16
0
25 Feb 2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
JIazhao Zhang
Kunyu Wang
Rongtao Xu
Gengze Zhou
Yicong Hong
Xiaomeng Fang
Qi Wu
Dongbin Zhao
Wang He
LM&Ro
491
137
0
24 Feb 2024
RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation
Hanxiao Jiang
Binghao Huang
Ruihai Wu
Zhuoran Li
Shubham Garg
H. Nayyeri
Shenlong Wang
Yunzhu Li
231
45
0
23 Feb 2024
Zero-BEV: Zero-shot Projection of Any First-Person Modality to BEV Maps
G. Monaci
L. Antsfeld
Boris Chidlovskii
Christian Wolf
167
0
0
21 Feb 2024
Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality
Christian Marinoni
R. F. Gramaccioni
Changan Chen
A. Uncini
Danilo Comminiello
120
7
0
14 Feb 2024
Unsupervised Discovery of Object-Centric Neural Fields
Rundong Luo
Hong-Xing Yu
Jiajun Wu
3DPC
OCL
339
7
0
12 Feb 2024
Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation
Dennis Hoftijzer
Gertjan J. Burghouts
Luuk J. Spreeuwers
193
3
0
07 Feb 2024
Belief Scene Graphs: Expanding Partial Scenes with Objects through Computation of Expectation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Mario A. V. Saucedo
Akash Patel
Akshit Saradagi
Christoforos Kanellakis
G. Nikolakopoulos
121
6
0
06 Feb 2024
Vision-Language Models Provide Promptable Representations for Reinforcement Learning
William Chen
Oier Mees
Aviral Kumar
Sergey Levine
VLM
LM&Ro
303
41
0
05 Feb 2024
Learning to navigate efficiently and precisely in real environments
Computer Vision and Pattern Recognition (CVPR), 2024
G. Bono
Hervé Poirier
L. Antsfeld
G. Monaci
Boris Chidlovskii
Christian Wolf
202
5
0
25 Jan 2024
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
Computer Vision and Pattern Recognition (CVPR), 2024
Yining Hong
Zishuo Zheng
Peihao Chen
Yian Wang
Junyan Li
Chuang Gan
189
49
0
16 Jan 2024
VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model
Pengying Wu
Yao Mu
Bingxian Wu
Yi Hou
Ji Ma
Shanghang Zhang
Chang-rui Liu
LM&Ro
190
59
0
05 Jan 2024
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
Tai Wang
Xiaohan Mao
Chenming Zhu
Runsen Xu
Ruiyuan Lyu
...
Tianfan Xue
Xihui Liu
Cewu Lu
Dahua Lin
Jiangmiao Pang
LM&Ro
223
124
0
26 Dec 2023
360 Layout Estimation via Orthogonal Planes Disentanglement and Multi-view Geometric Consistency Perception
Zhijie Shen
Chunyu Lin
Junsong Zhang
Lang Nie
K. Liao
Yao Zhao
77
7
0
26 Dec 2023
LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset
Haolin Liu
Chongjie Ye
Y. Nie
Yingfan He
Xiaoguang Han
3DV
176
6
0
19 Dec 2023
Holodeck: Language Guided Generation of 3D Embodied AI Environments
Computer Vision and Pattern Recognition (CVPR), 2023
Yue Yang
Fan-Yun Sun
Luca Weihs
Eli VanderBilt
Alvaro Herrasti
...
Lingjie Liu
Chris Callison-Burch
Mark Yatskar
Aniruddha Kembhavi
Christopher Clark
LM&Ro
367
170
0
14 Dec 2023
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation
Xiaobo Hu
Youfang Lin
Hehe Fan
Shuo Wang
Zhihao Wu
Kai Lv
205
9
0
06 Dec 2023
Previous
1
2
3
4
5
6
7
8
Next