ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.02010
  4. Cited By
Towards Learning a Generalist Model for Embodied Navigation

Towards Learning a Generalist Model for Embodied Navigation

4 December 2023
Duo Zheng
Shijia Huang
Lin Zhao
Yiwu Zhong
Liwei Wang
    LM&Ro
ArXivPDFHTML

Papers citing "Towards Learning a Generalist Model for Embodied Navigation"

32 / 32 papers shown
Title
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
45
0
0
01 May 2025
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Junrong Yue
Y. Zhang
Chuan Qin
Bo Li
Xiaomin Lie
Xinlei Yu
Wenxin Zhang
Zhendong Zhao
43
0
0
23 Apr 2025
Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation
Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation
Luo Ling
Bai Qianqian
LM&Ro
39
0
0
09 Apr 2025
The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?
The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?
Weichen Zhang
Ruiying Peng
Chen Gao
Jianjie Fang
Xin Zeng
...
Z. Wang
Jinqiang Cui
Xin Wang
Xinlei Chen
Y. Li
LRM
71
0
0
06 Apr 2025
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Zike Yan
Qi Wu
Zhihua Wei
J. Liu
48
0
0
31 Mar 2025
P3Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction
P3Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction
Yufeng Zhong
Chengjian Feng
Feng Yan
Fanfan Liu
Liming Zheng
Lin Ma
44
0
0
24 Mar 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Kaixuan Jiang
Y. Liu
Weixing Chen
Jingzhou Luo
Ziliang Chen
Ling Pan
G. Li
Liang Lin
51
2
0
14 Mar 2025
SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation
Xiangyu Shi
Zerui Li
Wenqi Lyu
Jiatong Xia
Feras Dayoub
Yanyuan Qiao
Qi Wu
46
0
0
13 Mar 2025
Understanding and Evaluating Hallucinations in 3D Visual Language Models
Understanding and Evaluating Hallucinations in 3D Visual Language Models
Ruiying Peng
Kaiyuan Li
Weichen Zhang
Chen Gao
Xinlei Chen
Y. Li
38
0
0
18 Feb 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Mohit Bansal
Parisa Kordjamshidi
LRM
51
17
0
31 Dec 2024
EscapeBench: Pushing Language Models to Think Outside the Box
EscapeBench: Pushing Language Models to Think Outside the Box
Cheng Qian
Peixuan Han
Qinyu Luo
Bingxiang He
X. Chen
...
Jiarui Yao
Xiaocheng Yang
Denghui Zhang
Yunzhu Li
Heng Ji
LLMAG
LRM
80
3
0
18 Dec 2024
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Xinshuai Song
Weixing Chen
Y. Liu
Weikai Chen
Guanbin Li
Liang Lin
117
3
0
12 Dec 2024
doScenes: An Autonomous Driving Dataset with Natural Language
  Instruction for Human Interaction and Vision-Language Navigation
doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation
Parthib Roy
Srinivasa Perisetla
Shashank Shriram
Harsha Krishnaswamy
Aryan Keskar
Ross Greer
VGen
72
2
0
08 Dec 2024
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
Weixin Mao
Weiheng Zhong
Zhou Jiang
Dong Fang
Zhongyue Zhang
...
Fan Jia
Tiancai Wang
Haoqiang Fan
Osamu Yoshie
Osamu Yoshie
114
4
0
29 Nov 2024
Vision-Language Navigation with Energy-Based Policy
Vision-Language Navigation with Energy-Based Policy
Rui Liu
Wenguan Wang
Y. Yang
32
3
0
18 Oct 2024
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Yanyuan Qiao
Wenqi Lyu
Hui Wang
Zixu Wang
Zerui Li
Yuan Zhang
Mingkui Tan
Qi Wu
LRM
36
2
0
27 Sep 2024
Navigation with VLM framework: Go to Any Language
Navigation with VLM framework: Go to Any Language
Zecheng Yin
Chonghao Cheng
Lizhen
LM&Ro
32
0
0
18 Sep 2024
Learning to Move Like Professional Counter-Strike Players
Learning to Move Like Professional Counter-Strike Players
David Durst
Feng Xie
Vishnu Sarukkai
Brennan Shacklett
I. Frosio
...
Carly Taylor
Gilbert Bernstein
Sanjiban Choudhury
Pat Hanrahan
Kayvon Fatahalian
23
0
0
25 Aug 2024
NavGPT-2: Unleashing Navigational Reasoning Capability for Large
  Vision-Language Models
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou
Yicong Hong
Zun Wang
Xin Eric Wang
Qi Wu
LM&Ro
42
18
0
17 Jul 2024
KUNPENG: An Embodied Large Model for Intelligent Maritime
KUNPENG: An Embodied Large Model for Intelligent Maritime
Naiyao Wang
Tongbang Jiang
Ye Wang
Shaoyang Qiu
Bo Zhang
...
Yiyang Wang
Hongxiang Ren
Ruili Wang
Hongjun Shan
Hongbo Liu
28
3
0
12 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on
  Embodied AI
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Y. Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&Ro
SyDa
AI4CE
48
47
0
09 Jul 2024
Affordances-Oriented Planning using Foundation Models for Continuous
  Vision-Language Navigation
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Jiaqi Chen
Bingqian Lin
Xinmin Liu
Lin Ma
Xiaodan Liang
Kwan-Yee Kenneth Wong
LM&Ro
44
8
0
08 Jul 2024
EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks
  with Large Vision-Language Models
EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models
Mengfei Du
Binhao Wu
Zejun Li
Xuanjing Huang
Zhongyu Wei
21
8
0
09 Jun 2024
A Survey of Useful LLM Evaluation
A Survey of Useful LLM Evaluation
Ji-Lun Peng
Sijia Cheng
Egil Diau
Yung-Yu Shih
Po-Heng Chen
Yen-Ting Lin
Yun-Nung Chen
LLMAG
ELM
24
12
0
03 Jun 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks
  via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
29
12
0
16 May 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Bingqian Lin
Yunshuang Nie
Ziming Wei
Jiaqi Chen
Shikui Ma
Jianhua Han
Hang Xu
Xiaojun Chang
Xiaodan Liang
LM&Ro
LRM
60
19
0
12 Mar 2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language
  Navigation
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Jiazhao Zhang
Kunyu Wang
Rongtao Xu
Gengze Zhou
Yicong Hong
Xiaomeng Fang
Qi Wu
Zhizheng Zhang
Wang He
LM&Ro
27
44
0
24 Feb 2024
Advances in Embodied Navigation Using Large Language Models: A Survey
Advances in Embodied Navigation Using Large Language Models: A Survey
Jinzhou Lin
Han Gao
Xuxiang Feng
Rongtao Xu
Changwei Wang
Man Zhang
Li Guo
Shibiao Xu
LM&Ro
LLMAG
56
9
0
01 Nov 2023
Instruction Tuning with GPT-4
Instruction Tuning with GPT-4
Baolin Peng
Chunyuan Li
Pengcheng He
Michel Galley
Jianfeng Gao
SyDa
ALM
LM&MA
157
576
0
06 Apr 2023
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
D. Fox
LM&Ro
146
449
0
12 Sep 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
139
430
0
10 Jul 2022
Speaker-Follower Models for Vision-and-Language Navigation
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
240
495
0
07 Jun 2018
1