Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.06453
Cited By
Episodic Transformer for Vision-and-Language Navigation
13 May 2021
Alexander Pashevich
Cordelia Schmid
Chen Sun
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Episodic Transformer for Vision-and-Language Navigation"
50 / 139 papers shown
Title
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Yevgen Chebotar
Q. Vuong
A. Irpan
Karol Hausman
F. Xia
...
Brianna Zitkovich
Tomas Jackson
Kanishka Rao
Chelsea Finn
Sergey Levine
OffRL
121
81
0
18 Sep 2023
Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation
Hongchen Wang
Andy Guan Hong Chen
Xiaoqi Li
Mingdong Wu
Hao Dong
16
14
0
15 Sep 2023
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
Yibo Cui
Liang Xie
Yakun Zhang
Meishan Zhang
Ye Yan
Erwei Yin
LM&Ro
29
16
0
24 Aug 2023
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation
Yi-Chiao Su
Dongyan An
Yuan Xu
Kehan Chen
Yan Huang
42
2
0
22 Aug 2023
Multi-Level Compositional Reasoning for Interactive Instruction Following
Suvaansh Bhambri
Byeonghwi Kim
Jonghyun Choi
LM&Ro
27
11
0
18 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
17
28
0
14 Aug 2023
Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents
Byeonghwi Kim
Jinyeon Kim
Yuyeong Kim
Cheol-Hui Min
Jonghyun Choi
LM&Ro
30
26
0
14 Aug 2023
Object Goal Navigation with Recursive Implicit Maps
Shizhe Chen
Thomas Chabal
Ivan Laptev
Cordelia Schmid
22
19
0
10 Aug 2023
Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI
Hangjie Shi
Leslie Ball
Govind Thattai
Desheng Zhang
Lu Hu
...
Michael Johnston
Akshaya Iyengar
Arindam Mandal
Premkumar Natarajan
R. Ghanadan
25
5
0
09 Aug 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
8
48
0
09 Aug 2023
LEMMA: Learning Language-Conditioned Multi-Robot Manipulation
Ran Gong
Xiaofeng Gao
Qiaozi Gao
Suhaila Shakiah
Govind Thattai
Gaurav Sukhatme
LM&Ro
8
8
0
02 Aug 2023
MAEA: Multimodal Attribution for Embodied AI
Vidhi Jain
Jayant Sravan Tamarapalli
Sahiti Yerramilli
Yonatan Bisk
34
0
0
25 Jul 2023
GridMM: Grid Memory Map for Vision-and-Language Navigation
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
26
50
0
24 Jul 2023
Learning Navigational Visual Representations with Semantic Map Supervision
Yicong Hong
Yang Zhou
Ruiyi Zhang
Franck Dernoncourt
Trung Bui
Stephen Gould
Hao Tan
SSL
30
21
0
23 Jul 2023
Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Ruipu Luo
Jiwen Zhang
Zhongyu Wei
VLM
16
0
0
16 Jul 2023
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Zilai Zeng
Ce Zhang
Shijie Wang
Chen Sun
OffRL
27
5
0
07 Jul 2023
Improving Long-Horizon Imitation Through Instruction Prediction
Joey Hejna
Pieter Abbeel
Lerrel Pinto
9
7
0
21 Jun 2023
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
Jesse Zhang
Karl Pertsch
Jiahui Zhang
Joseph J. Lim
LM&Ro
31
16
0
20 Jun 2023
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments
Xiulong Liu
Sudipta Paul
Moitreya Chatterjee
A. Cherian
23
8
0
06 Jun 2023
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Gengze Zhou
Yicong Hong
Qi Wu
ELM
LM&Ro
LLMAG
LRM
23
140
0
26 May 2023
R2H: Building Multimodal Navigation Helpers that Respond to Help Requests
Yue Fan
Jing Gu
Kaizhi Zheng
Xin Eric Wang
24
4
0
23 May 2023
Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers
P. Sadler
Sherzod Hakimov
David Schlangen
27
1
0
22 May 2023
Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning
Georgia Chalvatzaki
A. Younes
Daljeet Nandha
An T. Le
Leonardo F. R. Ribeiro
Iryna Gurevych
LM&Ro
LRM
LLMAG
30
30
0
12 May 2023
Multimodal Contextualized Plan Prediction for Embodied Task Completion
Mert Inan
Aishwarya Padmakumar
Spandana Gella
P. Lange
Dilek Z. Hakkani-Tür
LM&Ro
44
0
0
10 May 2023
Pretrained Language Models as Visual Planners for Human Assistance
Dhruvesh Patel
H. Eghbalzadeh
Nitin Kamra
Michael L. Iuzzolino
Unnat Jain
Ruta Desai
LM&Ro
19
24
0
17 Apr 2023
ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes
Ran Gong
Jiangyong Huang
Yizhou Zhao
Haoran Geng
Xiaofeng Gao
...
Ziheng Zhou
D. Terzopoulos
Song-Chun Zhu
Baoxiong Jia
Siyuan Huang
LM&Ro
37
45
0
09 Apr 2023
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following
Mingyu Ding
Yan Xu
Zhenfang Chen
David D. Cox
Ping Luo
J. Tenenbaum
Chuang Gan
LM&Ro
51
21
0
07 Apr 2023
Lana: A Language-Capable Navigator for Instruction Following and Generation
Xiaohan Wang
Wenguan Wang
Jiayi Shao
Yi Yang
LLMAG
LM&Ro
36
37
0
15 Mar 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
17
19
0
07 Mar 2023
Alexa Arena: A User-Centric Interactive Platform for Embodied AI
Qiaozi Gao
Govind Thattai
Suhaila Shakiah
Xiaofeng Gao
Shreyas Pansare
...
Michael Johnston
R. Ghanadan
Arindam Mandal
Dilek Z. Hakkani-Tür
Premkumar Natarajan
6
25
0
02 Mar 2023
Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents
Wenlong Huang
Fei Xia
Dhruv Shah
Danny Driess
Andy Zeng
...
Pete Florence
Igor Mordatch
Sergey Levine
Karol Hausman
Brian Ichter
LM&Ro
19
41
0
01 Mar 2023
Multimodal Speech Recognition for Language-Guided Embodied Agents
Allen Chang
Xiaoyuan Zhu
Aarav Monga
Seoho Ahn
Tejas Srinivasan
Jesse Thomason
AuLLM
16
3
0
27 Feb 2023
Learning by Asking for Embodied Visual Navigation and Task Completion
Ying Shen
Ismini Lourentzou
20
1
0
09 Feb 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
Yue Wu
Yewen Fan
Paul Pu Liang
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
OffRL
19
47
0
09 Feb 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
23
24
0
29 Dec 2022
Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
Yu Gu
Xiang Deng
Yu-Chuan Su
LLMAG
26
52
0
19 Dec 2022
RT-1: Robotics Transformer for Real-World Control at Scale
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Joseph Dabis
...
Ted Xiao
Peng-Tao Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
28
1,013
0
13 Dec 2022
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
Chan Hee Song
Jiaman Wu
Clay Washington
Brian M. Sadler
Wei-Lun Chao
Yu-Chuan Su
LLMAG
LM&Ro
11
381
0
08 Dec 2022
Layout-aware Dreamer for Embodied Referring Expression Grounding
Mingxiao Li
Zehao Wang
Tinne Tuytelaars
Marie-Francine Moens
LM&Ro
9
6
0
30 Nov 2022
Prompter: Utilizing Large Language Model Prompting for a Data Efficient Embodied Instruction Following
Y. Inoue
Hiroki Ohashi
LM&Ro
30
43
0
07 Nov 2022
Towards Versatile Embodied Navigation
H. Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
42
20
0
30 Oct 2022
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents
Ziqiao Ma
B. VanDerPloeg
Cristian-Paul Bara
Yidong Huang
Eui-In Kim
Felix Gervits
M. Marge
J. Chai
52
7
0
22 Oct 2022
DANLI: Deliberative Agent for Following Natural Language Instructions
Yichi Zhang
Jianing Yang
Jiayi Pan
Shane Storks
N. Devraj
Ziqiao Ma
Keunwoo Peter Yu
Yuwei Bao
J. Chai
LM&Ro
48
16
0
22 Oct 2022
SQA3D: Situated Question Answering in 3D Scenes
Xiaojian Ma
Silong Yong
Zilong Zheng
Qing Li
Yitao Liang
Song-Chun Zhu
Siyuan Huang
LM&Ro
22
129
0
14 Oct 2022
Retrospectives on the Embodied AI Workshop
Matt Deitke
Dhruv Batra
Yonatan Bisk
Tommaso Campari
Angel X. Chang
...
Jesse Thomason
Alexander Toshev
Joanne Truong
Luca Weihs
Jiajun Wu
LM&Ro
35
50
0
13 Oct 2022
Multi-Object Navigation with dynamically learned neural implicit representations
Pierre Marza
L. Matignon
Olivier Simonin
Christian Wolf
27
23
0
11 Oct 2022
Generating Executable Action Plans with Environmentally-Aware Language Models
Maitrey Gramopadhye
D. Szafir
LM&Ro
LLMAG
10
22
0
10 Oct 2022
Don't Copy the Teacher: Data and Model Challenges in Embodied Dialogue
So Yeon Min
Hao Zhu
Ruslan Salakhutdinov
Yonatan Bisk
LM&Ro
58
12
0
10 Oct 2022
Dialog Acts for Task-Driven Embodied Agents
Spandana Gella
Aishwarya Padmakumar
P. Lange
Dilek Z. Hakkani-Tür
LM&Ro
22
16
0
26 Sep 2022
NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields
Jiankai Sun
Yan Xu
Mingyu Ding
Hongwei Yi
Chen Wang
Jingdong Wang
Liangjun Zhang
Mac Schwager
40
12
0
24 Sep 2022
Previous
1
2
3
Next