ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.02209
  4. Cited By
Building Generalizable Agents with a Realistic and Rich 3D Environment

Building Generalizable Agents with a Realistic and Rich 3D Environment

7 January 2018
Yi Wu
Yuxin Wu
Georgia Gkioxari
Yuandong Tian
    3DV
ArXivPDFHTML

Papers citing "Building Generalizable Agents with a Realistic and Rich 3D Environment"

50 / 225 papers shown
Title
HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard
HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard
Yifei Dong
Fengyi Wu
Qi He
Heng Li
Minghan Li
...
Yuxuan Zhou
Jingdong Sun
Qi Dai
Zhi-Qi Cheng
Alexander G. Hauptmann
LM&Ro
38
0
0
18 Mar 2025
Robotic Sim-to-Real Transfer for Long-Horizon Pick-and-Place Tasks in the Robotic Sim2Real Competition
Ming Yang
Hongyu Cao
Lixuan Zhao
Chenrui Zhang
Yaran Chen
44
0
0
14 Mar 2025
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Fangwei Zhong
Kui Wu
Churan Wang
Hao Chen
Hai Ci
Zhoujun Li
Yizhou Wang
VGen
38
0
0
31 Dec 2024
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D
  Point Cloud Semantic Segmentation
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation
Umamaheswaran Raman Kumar
A. Fayjie
Jurgen Hannaert
Patrick Vandewalle
3DV
3DPC
75
1
0
20 Nov 2024
Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model
  with Compact Wavelet Encodings
Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings
Aditya Sanghi
Aliasghar Khani
Pradyumna Reddy
Arianna Rampini
Derek Cheung
Kamal Rahimi Malekshan
Kanika Madan
Hooman Shayani
34
3
0
12 Nov 2024
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes
  and Objects
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Zhaowei Wang
Hongming Zhang
Tianqing Fang
Ye Tian
Yue Yang
Kaixin Ma
Xiaoman Pan
Yangqiu Song
Dong Yu
LM&Ro
33
3
0
03 Oct 2024
Multi-modal Situated Reasoning in 3D Scenes
Multi-modal Situated Reasoning in 3D Scenes
Xiongkun Linghu
Jiangyong Huang
Xuesong Niu
Xiaojian Ma
Baoxiong Jia
Siyuan Huang
34
11
0
04 Sep 2024
Narrowing the Gap between Vision and Action in Navigation
Narrowing the Gap between Vision and Action in Navigation
Yue Zhang
Parisa Kordjamshidi
26
2
0
19 Aug 2024
Perceive, Reflect, and Plan: Designing LLM Agent for Goal-Directed City
  Navigation without Instructions
Perceive, Reflect, and Plan: Designing LLM Agent for Goal-Directed City Navigation without Instructions
Qingbin Zeng
Qinglong Yang
Shunan Dong
Heming Du
Liang Zheng
Fengli Xu
Yong Li
LLMAG
LM&Ro
31
8
0
08 Aug 2024
3D Question Answering for City Scene Understanding
3D Question Answering for City Scene Understanding
Penglei Sun
Yaoxian Song
Xiang Liu
Xiaofei Yang
Qiang-qiang Wang
Tiefeng Li
Yang Yang
Xiaowen Chu
16
0
0
24 Jul 2024
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
Jiefu Ou
Arda Uzunoglu
Benjamin Van Durme
Daniel Khashabi
LM&Ro
VGen
30
3
0
10 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on
  Embodied AI
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Y. Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&Ro
SyDa
AI4CE
48
47
0
09 Jul 2024
Human-Aware Vision-and-Language Navigation: Bridging Simulation to
  Reality with Dynamic Human Interactions
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions
Minghan Li
Heng Li
Zhi-Qi Cheng
Yifei Dong
Yuxuan Zhou
Jun-Yan He
Qi Dai
Teruko Mitamura
Alexander G. Hauptmann
LM&Ro
35
4
0
27 Jun 2024
Map-based Modular Approach for Zero-shot Embodied Question Answering
Map-based Modular Approach for Zero-shot Embodied Question Answering
Koya Sakamoto
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
M. Kawanabe
25
3
0
26 May 2024
Virtually Enriched NYU Depth V2 Dataset for Monocular Depth Estimation:
  Do We Need Artificial Augmentation?
Virtually Enriched NYU Depth V2 Dataset for Monocular Depth Estimation: Do We Need Artificial Augmentation?
D. Ignatov
Andrey D. Ignatov
Radu Timofte
MDE
32
3
0
15 Apr 2024
Differentiable and Stable Long-Range Tracking of Multiple Posterior
  Modes
Differentiable and Stable Long-Range Tracking of Multiple Posterior Modes
Ali Younis
Erik B. Sudderth
28
4
0
12 Apr 2024
Guided Masked Self-Distillation Modeling for Distributed Multimedia
  Sensor Event Analysis
Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Masahiro Yasuda
Noboru Harada
Yasunori Ohishi
Shoichiro Saito
Akira Nakayama
Nobutaka Ono
34
3
0
12 Apr 2024
Prioritized Semantic Learning for Zero-shot Instance Navigation
Prioritized Semantic Learning for Zero-shot Instance Navigation
Xander Sun
Louis Lau
Hoyard Zhi
Ronghe Qiu
Junwei Liang
30
8
0
18 Mar 2024
Language to Map: Topological map generation from natural language path
  instructions
Language to Map: Topological map generation from natural language path instructions
Hideki Deguchi
Kazuki Shibata
Shun Taguchi
26
3
0
15 Mar 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Fei-Yue Wang
Ruyue Yuan
LM&Ro
27
2
0
22 Feb 2024
Language-Based Augmentation to Address Shortcut Learning in Object Goal
  Navigation
Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation
Dennis Hoftijzer
Gertjan J. Burghouts
Luuk J. Spreeuwers
13
1
0
07 Feb 2024
HAZARD Challenge: Embodied Decision Making in Dynamically Changing
  Environments
HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments
Qinhong Zhou
Sunli Chen
Yisong Wang
Haozhe Xu
Weihua Du
Hongxin Zhang
Yilun Du
Josh Tenenbaum
Chuang Gan
AI4CE
20
12
0
23 Jan 2024
Make-A-Shape: a Ten-Million-scale 3D Shape Model
Make-A-Shape: a Ten-Million-scale 3D Shape Model
Ka-Hei Hui
Aditya Sanghi
Arianna Rampini
Kamal Rahimi Malekshan
Zhengzhe Liu
Hooman Shayani
Chi-Wing Fu
DiffM
21
17
0
20 Jan 2024
LLM-Powered Hierarchical Language Agent for Real-time Human-AI
  Coordination
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination
Jijia Liu
Chao Yu
Jiaxuan Gao
Yuqing Xie
Qingmin Liao
Yi Wu
Yu Wang
LLMAG
LM&Ro
82
35
0
23 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
13
2
0
05 Dec 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Jingkang Yang
Yuhao Dong
Shuai Liu
Bo-wen Li
Ziyue Wang
...
Haoran Tan
Jiamu Kang
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
LM&Ro
39
45
0
12 Oct 2023
FArMARe: a Furniture-Aware Multi-task methodology for Recommending
  Apartments based on the user interests
FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests
Ali Abdari
Alex Falcon
Giuseppe Serra
30
2
0
06 Sep 2023
Breaking Down the Task: A Unit-Grained Hybrid Training Framework for
  Vision and Language Decision Making
Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Ruipu Luo
Jiwen Zhang
Zhongyu Wei
VLM
16
0
0
16 Jul 2023
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot
  Vision-and-Language Navigation
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation
Xiwen Liang
Liang Ma
Shanshan Guo
Jianhua Han
Hang Xu
Shikui Ma
Xiaodan Liang
LM&Ro
LLMAG
79
4
0
17 Jun 2023
Estimating Generic 3D Room Structures from 2D Annotations
Estimating Generic 3D Room Structures from 2D Annotations
D. Rozumnyi
S. Popov
Kevis-Kokitsi Maninis
Matthias Nießner
V. Ferrari
3DV
3DPC
11
6
0
15 Jun 2023
L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement
  Learning
L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning
Kibeom Kim
Hyun-Dong Lee
Min Whoo Lee
Moonheon Lee
Minsu Lee
Byoung-Tak Zhang
18
1
0
23 May 2023
Language Models Meet World Models: Embodied Experiences Enhance Language
  Models
Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang
Tianhua Tao
Yi Gu
Tianmin Shu
Zirui Wang
Zichao Yang
Zhiting Hu
ALM
LLMAG
LM&Ro
CLL
27
94
0
18 May 2023
Modality-invariant Visual Odometry for Embodied Vision
Modality-invariant Visual Odometry for Embodied Vision
Marius Memmel
Roman Bachmann
Amir Zamir
54
8
0
29 Apr 2023
USA-Net: Unified Semantic and Affordance Representations for Robot
  Memory
USA-Net: Unified Semantic and Affordance Representations for Robot Memory
Benjamin Bolte
Austin S. Wang
Jimmy Yang
Mustafa Mukadam
Mrinal Kalakrishnan
Chris Paxton
3DV
LM&Ro
19
13
0
24 Apr 2023
Human Pose Estimation in Monocular Omnidirectional Top-View Images
Human Pose Estimation in Monocular Omnidirectional Top-View Images
Jingrui Yu
Tobias Scheck
Roman Seidel
Yukti Adya
Dipankar Nandi
G. Hirtz
30
3
0
17 Apr 2023
EFEM: Equivariant Neural Field Expectation Maximization for 3D Object
  Segmentation Without Scene Supervision
EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision
Jiahui Lei
Congyue Deng
Karl Schmeckpeper
Leonidas J. Guibas
Kostas Daniilidis
3DPC
24
21
0
27 Mar 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation
  Using Scene Object Spectrum Grounding
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
17
19
0
07 Mar 2023
Analyzing Effects of Fake Training Data on the Performance of Deep
  Learning Systems
Analyzing Effects of Fake Training Data on the Performance of Deep Learning Systems
Pratinav Seth
Akshat Bhandari
Kumud Lakara
13
0
0
02 Mar 2023
A Short Survey of Systematic Generalization
A Short Survey of Systematic Generalization
Yuanpeng Li
AI4CE
22
1
0
22 Nov 2022
Scalable Modular Synthetic Data Generation for Advancing Aerial Autonomy
Scalable Modular Synthetic Data Generation for Advancing Aerial Autonomy
Mehrnaz Sabet
Praveen Palanisamy
Sakshi Mishra
18
4
0
10 Nov 2022
Towards Versatile Embodied Navigation
Towards Versatile Embodied Navigation
H. Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
40
20
0
30 Oct 2022
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Hao Liu
Tom Zahavy
Volodymyr Mnih
Satinder Singh
SSL
25
7
0
19 Oct 2022
On the Learning Mechanisms in Physical Reasoning
On the Learning Mechanisms in Physical Reasoning
Shiqian Li
Ke Wu
Chi Zhang
Yixin Zhu
AI4CE
44
13
0
05 Oct 2022
Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Kanishk Jain
Varun Chhangani
Amogh Tiwari
K. M. Krishna
Vineet Gandhi
LM&Ro
16
27
0
24 Sep 2022
Active Particle Filter Networks: Efficient Active Localization in
  Continuous Action Spaces and Large Maps
Active Particle Filter Networks: Efficient Active Localization in Continuous Action Spaces and Large Maps
Daniel Honerkamp
Suresh Guttikonda
Abhinav Valada
25
2
0
20 Sep 2022
Meta-simulation for the Automated Design of Synthetic Overhead Imagery
Meta-simulation for the Automated Design of Synthetic Overhead Imagery
Handi Yu
Simiao Ren
L. Collins
Jordan M. Malof
11
1
0
19 Sep 2022
Monocular Camera-based Complex Obstacle Avoidance via Efficient Deep
  Reinforcement Learning
Monocular Camera-based Complex Obstacle Avoidance via Efficient Deep Reinforcement Learning
Jianchuan Ding
Lingping Gao
Wenxi Liu
Haiyin Piao
Jia-Yu Pan
Z. Du
Xin Yang
Baocai Yin
9
12
0
01 Sep 2022
A Portable Multiscopic Camera for Novel View and Time Synthesis in
  Dynamic Scenes
A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes
Tianjiao Zhang
Yuen-Fui Lau
Qifeng Chen
22
4
0
30 Aug 2022
CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous
  Multi-Agent Reinforcement Learning
CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning
Vasu Sharma
Prasoon Goyal
Kaixiang Lin
Govind Thattai
Qiaozi Gao
Gaurav Sukhatme
15
5
0
26 Aug 2022
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke
Eli VanderBilt
Alvaro Herrasti
Luca Weihs
Jordi Salvador
...
Winson Han
Eric Kolve
Ali Farhadi
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
28
234
0
14 Jun 2022
12345
Next