Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.02209
Cited By
Building Generalizable Agents with a Realistic and Rich 3D Environment
7 January 2018
Yi Wu
Yuxin Wu
Georgia Gkioxari
Yuandong Tian
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Building Generalizable Agents with a Realistic and Rich 3D Environment"
50 / 225 papers shown
Title
HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard
Yifei Dong
Fengyi Wu
Qi He
Heng Li
Minghan Li
...
Yuxuan Zhou
Jingdong Sun
Qi Dai
Zhi-Qi Cheng
Alexander G. Hauptmann
LM&Ro
38
0
0
18 Mar 2025
Robotic Sim-to-Real Transfer for Long-Horizon Pick-and-Place Tasks in the Robotic Sim2Real Competition
Ming Yang
Hongyu Cao
Lixuan Zhao
Chenrui Zhang
Yaran Chen
44
0
0
14 Mar 2025
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Fangwei Zhong
Kui Wu
Churan Wang
Hao Chen
Hai Ci
Zhoujun Li
Yizhou Wang
VGen
38
0
0
31 Dec 2024
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation
Umamaheswaran Raman Kumar
A. Fayjie
Jurgen Hannaert
Patrick Vandewalle
3DV
3DPC
75
1
0
20 Nov 2024
Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings
Aditya Sanghi
Aliasghar Khani
Pradyumna Reddy
Arianna Rampini
Derek Cheung
Kamal Rahimi Malekshan
Kanika Madan
Hooman Shayani
34
3
0
12 Nov 2024
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Zhaowei Wang
Hongming Zhang
Tianqing Fang
Ye Tian
Yue Yang
Kaixin Ma
Xiaoman Pan
Yangqiu Song
Dong Yu
LM&Ro
33
3
0
03 Oct 2024
Multi-modal Situated Reasoning in 3D Scenes
Xiongkun Linghu
Jiangyong Huang
Xuesong Niu
Xiaojian Ma
Baoxiong Jia
Siyuan Huang
34
11
0
04 Sep 2024
Narrowing the Gap between Vision and Action in Navigation
Yue Zhang
Parisa Kordjamshidi
26
2
0
19 Aug 2024
Perceive, Reflect, and Plan: Designing LLM Agent for Goal-Directed City Navigation without Instructions
Qingbin Zeng
Qinglong Yang
Shunan Dong
Heming Du
Liang Zheng
Fengli Xu
Yong Li
LLMAG
LM&Ro
31
8
0
08 Aug 2024
3D Question Answering for City Scene Understanding
Penglei Sun
Yaoxian Song
Xiang Liu
Xiaofei Yang
Qiang-qiang Wang
Tiefeng Li
Yang Yang
Xiaowen Chu
16
0
0
24 Jul 2024
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
Jiefu Ou
Arda Uzunoglu
Benjamin Van Durme
Daniel Khashabi
LM&Ro
VGen
30
3
0
10 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Y. Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&Ro
SyDa
AI4CE
48
47
0
09 Jul 2024
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions
Minghan Li
Heng Li
Zhi-Qi Cheng
Yifei Dong
Yuxuan Zhou
Jun-Yan He
Qi Dai
Teruko Mitamura
Alexander G. Hauptmann
LM&Ro
35
4
0
27 Jun 2024
Map-based Modular Approach for Zero-shot Embodied Question Answering
Koya Sakamoto
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
M. Kawanabe
25
3
0
26 May 2024
Virtually Enriched NYU Depth V2 Dataset for Monocular Depth Estimation: Do We Need Artificial Augmentation?
D. Ignatov
Andrey D. Ignatov
Radu Timofte
MDE
32
3
0
15 Apr 2024
Differentiable and Stable Long-Range Tracking of Multiple Posterior Modes
Ali Younis
Erik B. Sudderth
28
4
0
12 Apr 2024
Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Masahiro Yasuda
Noboru Harada
Yasunori Ohishi
Shoichiro Saito
Akira Nakayama
Nobutaka Ono
34
3
0
12 Apr 2024
Prioritized Semantic Learning for Zero-shot Instance Navigation
Xander Sun
Louis Lau
Hoyard Zhi
Ronghe Qiu
Junwei Liang
30
8
0
18 Mar 2024
Language to Map: Topological map generation from natural language path instructions
Hideki Deguchi
Kazuki Shibata
Shun Taguchi
26
3
0
15 Mar 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Fei-Yue Wang
Ruyue Yuan
LM&Ro
35
2
0
22 Feb 2024
Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation
Dennis Hoftijzer
Gertjan J. Burghouts
Luuk J. Spreeuwers
13
1
0
07 Feb 2024
HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments
Qinhong Zhou
Sunli Chen
Yisong Wang
Haozhe Xu
Weihua Du
Hongxin Zhang
Yilun Du
Josh Tenenbaum
Chuang Gan
AI4CE
20
12
0
23 Jan 2024
Make-A-Shape: a Ten-Million-scale 3D Shape Model
Ka-Hei Hui
Aditya Sanghi
Arianna Rampini
Kamal Rahimi Malekshan
Zhengzhe Liu
Hooman Shayani
Chi-Wing Fu
DiffM
21
17
0
20 Jan 2024
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination
Jijia Liu
Chao Yu
Jiaxuan Gao
Yuqing Xie
Qingmin Liao
Yi Wu
Yu Wang
LLMAG
LM&Ro
82
35
0
23 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
13
2
0
05 Dec 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Jingkang Yang
Yuhao Dong
Shuai Liu
Bo-wen Li
Ziyue Wang
...
Haoran Tan
Jiamu Kang
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
LM&Ro
44
45
0
12 Oct 2023
FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests
Ali Abdari
Alex Falcon
Giuseppe Serra
30
2
0
06 Sep 2023
Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Ruipu Luo
Jiwen Zhang
Zhongyu Wei
VLM
16
0
0
16 Jul 2023
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation
Xiwen Liang
Liang Ma
Shanshan Guo
Jianhua Han
Hang Xu
Shikui Ma
Xiaodan Liang
LM&Ro
LLMAG
79
4
0
17 Jun 2023
Estimating Generic 3D Room Structures from 2D Annotations
D. Rozumnyi
S. Popov
Kevis-Kokitsi Maninis
Matthias Nießner
V. Ferrari
3DV
3DPC
11
6
0
15 Jun 2023
L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning
Kibeom Kim
Hyun-Dong Lee
Min Whoo Lee
Moonheon Lee
Minsu Lee
Byoung-Tak Zhang
18
1
0
23 May 2023
Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang
Tianhua Tao
Yi Gu
Tianmin Shu
Zirui Wang
Zichao Yang
Zhiting Hu
ALM
LLMAG
LM&Ro
CLL
27
94
0
18 May 2023
Modality-invariant Visual Odometry for Embodied Vision
Marius Memmel
Roman Bachmann
Amir Zamir
54
8
0
29 Apr 2023
USA-Net: Unified Semantic and Affordance Representations for Robot Memory
Benjamin Bolte
Austin S. Wang
Jimmy Yang
Mustafa Mukadam
Mrinal Kalakrishnan
Chris Paxton
3DV
LM&Ro
19
13
0
24 Apr 2023
Human Pose Estimation in Monocular Omnidirectional Top-View Images
Jingrui Yu
Tobias Scheck
Roman Seidel
Yukti Adya
Dipankar Nandi
G. Hirtz
30
3
0
17 Apr 2023
EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision
Jiahui Lei
Congyue Deng
Karl Schmeckpeper
Leonidas J. Guibas
Kostas Daniilidis
3DPC
24
21
0
27 Mar 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
17
19
0
07 Mar 2023
Analyzing Effects of Fake Training Data on the Performance of Deep Learning Systems
Pratinav Seth
Akshat Bhandari
Kumud Lakara
15
0
0
02 Mar 2023
A Short Survey of Systematic Generalization
Yuanpeng Li
AI4CE
22
1
0
22 Nov 2022
Scalable Modular Synthetic Data Generation for Advancing Aerial Autonomy
Mehrnaz Sabet
Praveen Palanisamy
Sakshi Mishra
20
4
0
10 Nov 2022
Towards Versatile Embodied Navigation
H. Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
42
20
0
30 Oct 2022
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Hao Liu
Tom Zahavy
Volodymyr Mnih
Satinder Singh
SSL
25
7
0
19 Oct 2022
On the Learning Mechanisms in Physical Reasoning
Shiqian Li
Ke Wu
Chi Zhang
Yixin Zhu
AI4CE
44
13
0
05 Oct 2022
Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Kanishk Jain
Varun Chhangani
Amogh Tiwari
K. M. Krishna
Vineet Gandhi
LM&Ro
16
27
0
24 Sep 2022
Active Particle Filter Networks: Efficient Active Localization in Continuous Action Spaces and Large Maps
Daniel Honerkamp
Suresh Guttikonda
Abhinav Valada
25
2
0
20 Sep 2022
Meta-simulation for the Automated Design of Synthetic Overhead Imagery
Handi Yu
Simiao Ren
L. Collins
Jordan M. Malof
11
1
0
19 Sep 2022
Monocular Camera-based Complex Obstacle Avoidance via Efficient Deep Reinforcement Learning
Jianchuan Ding
Lingping Gao
Wenxi Liu
Haiyin Piao
Jia-Yu Pan
Z. Du
Xin Yang
Baocai Yin
9
12
0
01 Sep 2022
A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes
Tianjiao Zhang
Yuen-Fui Lau
Qifeng Chen
22
4
0
30 Aug 2022
CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning
Vasu Sharma
Prasoon Goyal
Kaixiang Lin
Govind Thattai
Qiaozi Gao
Gaurav Sukhatme
15
5
0
26 Aug 2022
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke
Eli VanderBilt
Alvaro Herrasti
Luca Weihs
Jordi Salvador
...
Winson Han
Eric Kolve
Ali Farhadi
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
28
234
0
14 Jun 2022
1
2
3
4
5
Next