Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.15509
Cited By
ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts
Computer Vision and Pattern Recognition (CVPR), 2022
31 May 2022
Bingqian Lin
Yi Zhu
Zicong Chen
Xiwen Liang
Jian-zhuo Liu
Xiaodan Liang
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts"
40 / 40 papers shown
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Peiran Xu
Xicheng Gong
Yadong Mu
138
0
0
18 Oct 2025
Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Kun Xiang
Terry Jingchen Zhang
Yinya Huang
Jixi He
Zirong Liu
...
J. N. Han
Hang Xu
Han Li
Bin Dong
Xiaodan Liang
PINN
AI4CE
376
1
0
06 Oct 2025
Landmark-Guided Knowledge for Vision-and-Language Navigation
International Conference on Intelligent Computing (ICIC), 2025
Dongsheng Yang
Meiling Zhu
Yinfeng Yu
LM&Ro
134
0
0
30 Sep 2025
Teaching RL Agents to Act Better: VLM as Action Advisor for Online Reinforcement Learning
Xiefeng Wu
Jing Zhao
Shu Zhang
Mingyu Hu
OffRL
96
1
0
25 Sep 2025
GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System
Quang H. Nguyen
T. H. Le
Huy Le Nguyen
T. Vo
Tung D. Ta
Baoru Huang
Minh Nhat Vu
Anh-Tien Nguyen
228
0
0
23 Jun 2025
Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Computer Vision and Pattern Recognition (CVPR), 2025
Akhil Perincherry
Jacob Krantz
Stefan Lee
LM&Ro
261
7
0
20 Mar 2025
MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Junyou Zhu
Yanyuan Qiao
Siqi Zhang
Xingjian He
Qi Wu
Jing Liu
VLM
382
4
0
27 Sep 2024
GSON: A Group-based Social Navigation Framework with Large Multimodal Model
IEEE Robotics and Automation Letters (RA-L), 2024
Shangyi Luo
Peng Sun
Ji Zhu
Yuhong Deng
Cunjun Yu
Anxing Xiao
Xueqian Wang
LM&Ro
461
6
0
26 Sep 2024
Foundation Models for Autonomous Robots in Unstructured Environments
Hossein Naderi
Alireza Shojaei
Lifu Huang
LM&Ro
291
5
0
19 Jul 2024
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou
Yicong Hong
Zun Wang
Xin Eric Wang
Qi Wu
LM&Ro
309
74
0
17 Jul 2024
Towards Open-World Grasping with Large Vision-Language Models
Georgios Tziafas
Hamidreza Kasaei
LM&Ro
LRM
339
21
0
26 Jun 2024
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Mengjiao Shen
Jingwei Yang
Chengju Liu
Qijun Chen
VLM
329
3
0
25 Jun 2024
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi
Yicong Hong
Yuankai Qi
Qi Wu
Shirui Pan
Javen Qinfeng Shi
220
18
0
03 Jun 2024
MC-GPT: Empowering Vision-and-Language Navigation with Memory Map and Reasoning Chains
Zhaohuan Zhan
Lisha Yu
Sijie Yu
Guang Tan
LLMAG
LM&Ro
311
21
0
17 May 2024
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
271
41
0
16 Apr 2024
Scaling Vision-and-Language Navigation With Offline RL
Valay Bundele
Mahesh Bhupati
Biplab Banerjee
Aditya Grover
OffRL
180
1
0
27 Mar 2024
Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation
Bowen Huang
Yanwei Zheng
Chuanlin Lan
Xinpeng Zhao
Yifei Zou
Dongxiao Yu
303
0
0
23 Mar 2024
NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation
IEEE Robotics and Automation Letters (RA-L), 2024
Ran Xu
Yan Shen
Xiaoqi Li
Kai Cheng
Hao Dong
LM&Ro
214
15
0
13 Mar 2024
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation
IEEE International Conference on Multimedia and Expo (ICME), 2024
Dingbang Li
Wenzhou Chen
Xin Lin
LLMAG
LM&Ro
185
11
0
13 Mar 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Bingqian Lin
Yunshuang Nie
Ziming Wei
Jiaqi Chen
Shikui Ma
Jianhua Han
Hang Xu
Xiaojun Chang
Xiaodan Liang
LM&Ro
LRM
364
76
0
12 Mar 2024
Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Ronghao Dang
Huiyi Chen
Chengju Liu
Qi Chen
230
3
0
06 Mar 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jialu Li
Aishwarya Padmakumar
Gaurav Sukhatme
Mohit Bansal
319
10
0
05 Feb 2024
MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Jiaqi Chen
Bingqian Lin
Ran Xu
Zhenhua Chai
Xiaodan Liang
Kwan-Yee K. Wong
LM&Ro
LLMAG
268
73
0
14 Jan 2024
VaQuitA: Enhancing Alignment in LLM-Assisted Video Understanding
Yizhou Wang
Ruiyi Zhang
Haoliang Wang
Uttaran Bhattacharya
Yun Fu
Gang Wu
MLLM
234
19
0
04 Dec 2023
Transfer Learning in Robotics: An Upcoming Breakthrough? A Review of Promises and Challenges
Noémie Jaquier
Michael C. Welle
A. Gams
Kunpeng Yao
Bernardo Fichera
A. Billard
Aleš Ude
Tamim Asfour
Danica Kragic
260
35
0
29 Nov 2023
Robot Learning in the Era of Foundation Models: A Survey
Xuan Xiao
Jiahang Liu
Zhipeng Wang
Yanmin Zhou
Yong Qi
Qian Cheng
Bin He
Shuo Jiang
AI4CE
LM&Ro
428
48
0
24 Nov 2023
Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework
IEEE International Conference on Robotics and Automation (ICRA), 2023
Weiqin Zu
Wenbin Song
Ruiqing Chen
Ze Guo
Fanglei Sun
Zheng Tian
Wei Pan
Jun Wang
216
29
0
14 Nov 2023
Evaluating Explanation Methods for Vision-and-Language Navigation
European Conference on Artificial Intelligence (ECAI), 2023
Guanqi Chen
Lei Yang
Guanhua Chen
Jia Pan
XAI
206
1
0
10 Oct 2023
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Yanyuan Qiao
Zheng Yu
Qi Wu
VLM
179
25
0
20 Aug 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
325
85
0
09 Aug 2023
GridMM: Grid Memory Map for Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
423
106
0
24 Jul 2023
Learning Vision-and-Language Navigation from YouTube Videos
IEEE International Conference on Computer Vision (ICCV), 2023
Kun-Li Channing Lin
Peihao Chen
Di Huang
Thomas H. Li
Zhuliang Yu
Chuang Gan
LM&Ro
213
44
0
22 Jul 2023
Embodied Executable Policy Learning with Language-based Scene Summarization
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Jielin Qiu
Mengdi Xu
William Jongwon Han
Seungwhan Moon
Ding Zhao
LM&Ro
156
9
0
09 Jun 2023
A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Liuyi Wang
Zongtao He
Jiagui Tang
Ronghao Dang
Naijia Wang
Chengju Liu
Qi Chen
252
24
0
05 May 2023
Improving Vision-and-Language Navigation by Generating Future-View Image Semantics
Computer Vision and Pattern Recognition (CVPR), 2023
Jialu Li
Joey Tianyi Zhou
240
56
0
11 Apr 2023
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Dongyan An
Hongru Wang
Wenguan Wang
Zun Wang
Yan Huang
Keji He
Liang Wang
479
140
0
06 Apr 2023
MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2023
Jingjing Jiang
Nanning Zheng
MoE
306
12
0
02 Mar 2023
ESceme: Vision-and-Language Navigation with Episodic Scene Memory
International Journal of Computer Vision (IJCV), 2023
Qinjie Zheng
Daqing Liu
Chaoyue Wang
Jing Zhang
Dadong Wang
Dacheng Tao
LM&Ro
208
10
0
02 Mar 2023
VLN-Trans: Translator for the Vision and Language Navigation Agent
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yue Zhang
Parisa Kordjamshidi
243
24
0
18 Feb 2023
Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation
AAAI Conference on Artificial Intelligence (AAAI), 2023
Bingqian Lin
Yi Zhu
Xiaodan Liang
Liang Lin
Jian-zhuo Liu
CoGe
LM&Ro
295
5
0
13 Feb 2023
1