ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.06886
  4. Cited By
Aligning Cyber Space with Physical World: A Comprehensive Survey on
  Embodied AI

Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI

9 July 2024
Y. Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
    LM&Ro
    SyDa
    AI4CE
ArXivPDFHTML

Papers citing "Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI"

50 / 52 papers shown
Title
Multi-agent Embodied AI: Advances and Future Directions
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian-jun Sun
Gang Wang
AI4CE
38
0
0
08 May 2025
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
43
0
0
01 May 2025
Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and Scalability
Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and Scalability
Zishen Wan
Jiayi Qian
Yuhang Du
Jason J. Jabbour
Yilun Du
Yang Katie Zhao
A. Raychowdhury
Tushar Krishna
Vijay Janapa Reddi
LM&Ro
86
0
0
26 Apr 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Kaixuan Jiang
Y. Liu
Weixing Chen
Jingzhou Luo
Ziliang Chen
Ling Pan
G. Li
Liang Lin
47
2
0
14 Mar 2025
Air-Ground Collaborative Robots for Fire and Rescue Missions: Towards Mapping and Navigation Perspective
Air-Ground Collaborative Robots for Fire and Rescue Missions: Towards Mapping and Navigation Perspective
Ying Zhang
Haibao Yan
Danni Zhu
Jiankun Wang
Cui-Hua Zhang
Weili Ding
Xi Luo
C. Hua
M. Meng
AI4CE
36
1
0
30 Dec 2024
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Xinshuai Song
Weixing Chen
Y. Liu
Weikai Chen
Guanbin Li
Liang Lin
108
3
0
12 Dec 2024
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent
  Approach
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
Rory Young
Nicolas Pugeault
AAML
27
0
0
14 Oct 2024
GRUtopia: Dream General Robots in a City at Scale
GRUtopia: Dream General Robots in a City at Scale
Hanqing Wang
Jiahe Chen
Wensi Huang
Qingwei Ben
Tai Wang
...
Ying Zhao
Zhongying Tu
Yu Qiao
Dahua Lin
Jiangmiao Pang
LM&Ro
VGen
39
1
0
15 Jul 2024
ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities
ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities
Chenming Zhu
Tai Wang
Wenwei Zhang
Kai Chen
Xihui Liu
ReLM
LRM
34
16
0
01 Jul 2024
IRASim: Learning Interactive Real-Robot Action Simulators
IRASim: Learning Interactive Real-Robot Action Simulators
Fangqi Zhu
Hongtao Wu
Song Guo
Yuxiao Liu
Chilam Cheang
Tao Kong
72
11
0
20 Jun 2024
Embodied Question Answering via Multi-LLM Systems
Embodied Question Answering via Multi-LLM Systems
Bhrij Patel
Vishnu Sashank Dorbala
Dinesh Manocha
Amrit Singh Bedi
47
1
0
16 Jun 2024
Pandora: Towards General World Model with Natural Language Actions and
  Video States
Pandora: Towards General World Model with Natural Language Actions and Video States
Jiannan Xiang
Guangyi Liu
Yi Gu
Qiyue Gao
Yuting Ning
...
Shibo Hao
Yemin Shi
Zhengzhong Liu
Eric P. Xing
Zhiting Hu
VGen
48
4
0
12 Jun 2024
Unifying 3D Vision-Language Understanding via Promptable Queries
Unifying 3D Vision-Language Understanding via Promptable Queries
Ziyu Zhu
Zhuofan Zhang
Xiaojian Ma
Xuesong Niu
Yixin Chen
Baoxiong Jia
Zhidong Deng
Siyuan Huang
Qing Li
29
21
0
19 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
76
35
0
06 May 2024
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
Yandan Yang
Baoxiong Jia
Peiyuan Zhi
Siyuan Huang
LM&Ro
VGen
33
41
0
15 Apr 2024
Explore until Confident: Efficient Exploration for Embodied Question
  Answering
Explore until Confident: Efficient Exploration for Embodied Question Answering
Allen Z. Ren
Jaden Clark
Anushri Dixit
Masha Itkina
Anirudha Majumdar
Dorsa Sadigh
32
28
0
23 Mar 2024
3D-VLA: A 3D Vision-Language-Action Generative World Model
3D-VLA: A 3D Vision-Language-Action Generative World Model
Haoyu Zhen
Xiaowen Qiu
Peihao Chen
Jincheng Yang
Xin Yan
Yilun Du
Yining Hong
Chuang Gan
LM&Ro
VGen
PINN
27
81
0
14 Mar 2024
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach
  for Robust Manipulation
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation
M. Torné
Anthony Simeonov
Zechu Li
April Chan
Tao Chen
Abhishek Gupta
Pulkit Agrawal
34
51
0
06 Mar 2024
PointMamba: A Simple State Space Model for Point Cloud Analysis
PointMamba: A Simple State Space Model for Point Cloud Analysis
Dingkang Liang
Xin Zhou
Wei Xu
Xingkui Zhu
Zhikang Zou
Xiaoqing Ye
Xinyu Wang
Xiang Bai
77
87
0
16 Feb 2024
Reasoning Grasping via Multimodal Large Language Model
Reasoning Grasping via Multimodal Large Language Model
Shiyu Jin
Jinxuan Xu
Yutian Lei
Liangjun Zhang
LRM
26
19
0
09 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for
  Offline Reinforcement Learning
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
26
17
0
05 Feb 2024
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost
  Whole-Body Teleoperation
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
Zipeng Fu
Tony Zhao
Chelsea Finn
98
98
0
04 Jan 2024
Point Transformer V3: Simpler, Faster, Stronger
Point Transformer V3: Simpler, Faster, Stronger
Xiaoyang Wu
Li Jiang
Peng-Shuai Wang
Zhijian Liu
Xihui Liu
Yu Qiao
Wanli Ouyang
Tong He
Hengshuang Zhao
63
205
0
15 Dec 2023
Video Language Planning
Video Language Planning
Yilun Du
Mengjiao Yang
Peter R. Florence
Fei Xia
Ayzaan Wahid
...
Pieter Abbeel
Josh Tenenbaum
L. Kaelbling
Andy Zeng
Jonathan Tompson
PINN
LM&Ro
84
83
0
16 Oct 2023
Q-Transformer: Scalable Offline Reinforcement Learning via
  Autoregressive Q-Functions
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Yevgen Chebotar
Q. Vuong
A. Irpan
Karol Hausman
F. Xia
...
Brianna Zitkovich
Tomas Jackson
Kanishka Rao
Chelsea Finn
Sergey Levine
OffRL
104
51
0
18 Sep 2023
General In-Hand Object Rotation with Vision and Touch
General In-Hand Object Rotation with Vision and Touch
Haozhi Qi
Brent Yi
Sudharshan Suresh
Mike Lambeta
Y. Ma
Roberto Calandra
Jitendra Malik
47
33
0
18 Sep 2023
Dynamic Planning with a LLM
Dynamic Planning with a LLM
Gautier Dagan
Frank Keller
A. Lascarides
LLMAG
80
34
0
11 Aug 2023
ETPNav: Evolving Topological Planning for Vision-Language Navigation in
  Continuous Environments
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
Dongyan An
H. Wang
Wenguan Wang
Zun Wang
Yan Huang
Keji He
Liang Wang
42
61
0
06 Apr 2023
Chat with the Environment: Interactive Multimodal Perception Using Large
  Language Models
Chat with the Environment: Interactive Multimodal Perception Using Large Language Models
Xufeng Zhao
Mengdi Li
C. Weber
Muhammad Burhan Hafez
S. Wermter
LLMAG
LM&Ro
LRM
85
32
0
14 Mar 2023
NeU-NBV: Next Best View Planning Using Uncertainty Estimation in
  Image-Based Neural Rendering
NeU-NBV: Next Best View Planning Using Uncertainty Estimation in Image-Based Neural Rendering
Liren Jin
Xieyuanli Chen
Julius Ruckin
Marija Popović
36
29
0
02 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing
  Data
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
66
103
0
23 Oct 2022
Mask3D: Mask Transformer for 3D Semantic Instance Segmentation
Mask3D: Mask Transformer for 3D Semantic Instance Segmentation
Jonas Schult
Francis Engelmann
Alexander Hermans
Or Litany
Siyu Tang
Bastian Leibe
ISeg
39
164
0
06 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
208
2,413
0
06 Oct 2022
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual
  Grounding
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Yanmin Wu
Xinhua Cheng
Renrui Zhang
Zesen Cheng
Jian Zhang
31
62
0
29 Sep 2022
Self-Supervised Visuo-Tactile Pretraining to Locate and Follow Garment
  Features
Self-Supervised Visuo-Tactile Pretraining to Locate and Follow Garment Features
J. Kerr
Huang Huang
Albert Wilcox
Ryan Hoque
Jeffrey Ichnowski
Roberto Calandra
Ken Goldberg
41
27
0
26 Sep 2022
ProgPrompt: Generating Situated Robot Task Plans using Large Language
  Models
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
D. Fox
Jesse Thomason
Animesh Garg
LM&Ro
LLMAG
104
616
0
22 Sep 2022
Semantic Visual Simultaneous Localization and Mapping: A Survey
Semantic Visual Simultaneous Localization and Mapping: A Survey
Kaiqi Chen
Jianhua Zhang
Jialing Liu
Qiyi Tong
Ruyu Liu
Shengyong Chen
Jianhua Zhang
Arash Ajoudani
Shengyong Chen
27
12
0
14 Sep 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
136
430
0
10 Jul 2022
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
Xiaofeng Gao
Qiaozi Gao
Ran Gong
Kaixiang Lin
Govind Thattai
Gaurav Sukhatme
LM&Ro
73
69
0
27 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
313
8,261
0
28 Jan 2022
FILM: Following Instructions in Language with Modular Methods
FILM: Following Instructions in Language with Modular Methods
So Yeon Min
Devendra Singh Chaplot
Pradeep Ravikumar
Yonatan Bisk
Ruslan Salakhutdinov
LM&Ro
187
159
0
12 Oct 2021
Skill Induction and Planning with Latent Language
Skill Induction and Planning with Latent Language
Pratyusha Sharma
Antonio Torralba
Jacob Andreas
LM&Ro
178
108
0
04 Oct 2021
ShapeMap 3-D: Efficient shape mapping through dense touch and vision
ShapeMap 3-D: Efficient shape mapping through dense touch and vision
Sudharshan Suresh
Zilin Si
Joshua G. Mangelson
Wenzhen Yuan
Michael Kaess
58
56
0
20 Sep 2021
iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday
  Household Tasks
iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday Household Tasks
Chengshu Li
Fei Xia
Roberto Martín-Martín
Michael Lingelbach
S. Srivastava
...
Karen Liu
H. Gweon
Jiajun Wu
Li Fei-Fei
Silvio Savarese
LM&Ro
134
154
0
06 Aug 2021
Fast Contact-Implicit Model-Predictive Control
Fast Contact-Implicit Model-Predictive Control
Simon Le Cleac'h
Taylor A. Howell
Shuo Yang
Chia-Yen Lee
John Z. Zhang
Arun L. Bishop
Mac Schwager
Zachary Manchester
79
79
0
12 Jul 2021
Tactile Object Pose Estimation from the First Touch with Geometric
  Contact Rendering
Tactile Object Pose Estimation from the First Touch with Geometric Contact Rendering
Maria Bauzá
Eric Valls
Bryan Lim
Theo Sechopoulos
Alberto Rodriguez
63
67
0
09 Dec 2020
Autonomous Spot: Long-Range Autonomous Exploration of Extreme
  Environments with Legged Locomotion
Autonomous Spot: Long-Range Autonomous Exploration of Extreme Environments with Legged Locomotion
Amanda Bouman
M. Ginting
Nikhilesh Alatur
M. Palieri
David D. Fan
...
T. Pailevanian
Sung-Kyun Kim
K. Otsu
J. W. Burdick
Ali-akbar Agha-mohammadi
88
128
0
19 Oct 2020
SAPIEN: A SimulAted Part-based Interactive ENvironment
SAPIEN: A SimulAted Part-based Interactive ENvironment
Fanbo Xiang
Yuzhe Qin
Kaichun Mo
Yikuan Xia
Hao Zhu
...
He-Nan Wang
Li Yi
Angel X. Chang
Leonidas J. Guibas
Hao Su
195
482
0
19 Mar 2020
Neural Modular Control for Embodied Question Answering
Neural Modular Control for Embodied Question Answering
Abhishek Das
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
117
126
0
26 Oct 2018
12
Next