v1v2v3v4 (latest)

Interactive Grounded Language Acquisition and Generalization in a 2D World

31 January 2018

Papers citing "Interactive Grounded Language Acquisition and Generalization in a 2D World"

50 / 54 papers shown

CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection

Guankun Wang

Han Xiao

Huxin Gao

Renrui Zhang

Long Bai

Xiaoxiao Yang

Zhen Li

Hongsheng Li

Hongliang Ren

227

10 Oct 2024

Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural languageInternational Conference on Computer Science and Artificial Intelligence (ICCSAI), 2023

238

21 Sep 2023

Type-to-Track: Retrieve Any Object via Prompt-based TrackingNeural Information Processing Systems (NeurIPS), 2023

285

22 May 2023

Distilling Internet-Scale Vision-Language Models into Embodied AgentsInternational Conference on Machine Learning (ICML), 2023

Arun Ahuja

218

29 Jan 2023

Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World ModellingInternational Conference on Machine Learning (ICML), 2023

Kolby Nottingham

Prithviraj Ammanabrolu

Yejin Choi

251

105

28 Jan 2023

A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled RobotsConference on Robot Learning (CoRL), 2023

Tianchen Ji

Katherine Driggs-Campbell

188

23 Jan 2023

A Short Survey of Systematic Generalization

Yuanpeng Li

AI4CE

304

22 Nov 2022

Contrastive language and vision learning of general fashion conceptsScientific Reports (Sci Rep), 2022

425

08 Apr 2022

CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation TasksIEEE Robotics and Automation Letters (RA-L), 2021

Wolfram Burgard

539

417

06 Dec 2021

Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive LearningNeural Information Processing Systems (NeurIPS), 2021

Yizhen Zhang

Minkyu Choi

Kuan Han

Zhongming Liu

VLM

136

13 Nov 2021

In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications

Borja G. Leon

Murray Shanahan

Francesco Belardinelli

AI4CE

280

18 Oct 2021

Systematic Generalization on gSCAN: What is Nearly Solved and What is Next?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

169

25 Sep 2021

Evolving Decomposed Plasticity Rules for Information-Bottlenecked Meta-Learning

Fan Wang

Haoyi Xiong

Yang Cao

366

08 Sep 2021

Learning Visual-Audio Representations for Voice-Controlled RobotsIEEE International Conference on Robotics and Automation (ICRA), 2021

Peixin Chang

Shuijing Liu

D. L. McPherson

Katherine Driggs-Campbell

SSL

237

07 Sep 2021

Vision-Language Navigation: A Survey and Taxonomy

333

26 Aug 2021

Communicative Learning with Natural Gestures for Embodied Navigation Agents with Human-in-the-SceneIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

Qi Wu

Cheng-Ju Wu

Yixin Zhu

Jungseock Joo

242

05 Aug 2021

Interactive Explanations: Diagnosis and Repair of Reinforcement Learning Based Agent Behaviors

Christian Arzate Cruz

Takeo Igarashi

233

27 May 2021

Fast and Slow Learning of Recurrent Independent MechanismsInternational Conference on Learning Representations (ICLR), 2021

250

18 May 2021

Language in a (Search) Box: Grounding Language Learning in Real-World Human-Machine InteractionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Federico Bianchi

C. Greco

Jacopo Tagliabue

189

18 Apr 2021

Reinforcement Learning of Implicit and Explicit Control Flow in InstructionsInternational Conference on Machine Learning (ICML), 2021

Ethan A. Brooks

Janarthanan Rajendran

Richard L. Lewis

Satinder Singh

139

25 Feb 2021

LTL2Action: Generalizing LTL Instructions for Multi-Task RLInternational Conference on Machine Learning (ICML), 2021

Pashootan Vaezipoor

327

13 Feb 2021

A Survey on Deep Reinforcement Learning for Audio-Based ApplicationsArtificial Intelligence Review (AIR), 2021

338

01 Jan 2021

Visually Grounding Language Instruction for History-Dependent ManipulationIEEE International Conference on Robotics and Automation (ICRA), 2020

Dongheui Lee

280

16 Dec 2020

Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2020

361

01 Nov 2020

Multimodal Aggregation Approach for Memory Vision-Voice Indoor Navigation with Meta-LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020

128

01 Sep 2020

Compositional Networks Enable Systematic Generalization for Grounded Language UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Yen-Ling Kuo

Boris Katz

Andrei Barbu

393

06 Aug 2020

Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning

Borja G. Leon

Murray Shanahan

Francesco Belardinelli

NAI AI4CE

266

12 Jun 2020

Probing Emergent Semantics in Predictive Agents via Question AnsweringInternational Conference on Machine Learning (ICML), 2020

...

Arun Ahuja

226

01 Jun 2020

Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text

221

19 May 2020

Language Conditioned Imitation Learning over Unstructured Data

Corey Lynch

P. Sermanet

LM&Ro

321

282

15 May 2020

Robust and Interpretable Grounding of Spatial References with Relation NetworksFindings (Findings), 2020

Tsung-Yen Yang

Andrew S. Lan

Karthik Narasimham

280

02 May 2020

Zero-Shot Compositional Policy Learning via Language Grounding

224

15 Apr 2020

Exploiting Language Instructions for Interpretable and Compositional Reinforcement Learning

Michiel van der Meer

Matteo Pirotta

Elia Bruni

164

13 Jan 2020

Look, Listen, and Act: Towards Audio-Visual Embodied NavigationIEEE International Conference on Robotics and Automation (ICRA), 2019

Chuang Gan

Yiwei Zhang

Jiajun Wu

Boqing Gong

J. Tenenbaum

215

151

25 Dec 2019

Learning to Request Guidance in Emergent CommunicationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

153

11 Dec 2019

Automated curriculum generation for Policy Gradients from Demonstrations

A. Srinivasan

Dzmitry Bahdanau

Maxime Chevalier-Boisvert

Yoshua Bengio

129

01 Dec 2019

Emergence of Numeric Concepts in Multi-Agent Autonomous Communication

Shangmin Guo

LLMAG

172

04 Nov 2019

Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order

Vladislav Kurenkov

Bulat Maksudov

Adil Khan

27 Oct 2019

Environmental drivers of systematicity and generalization in a situated agentInternational Conference on Learning Representations (ICLR), 2019

355

109

01 Oct 2019

Robot Sound Interpretation: Combining Sight and Sound in Learning-Based ControlIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2019

Peixin Chang

Shuijing Liu

Haonan Chen

Katherine Driggs-Campbell

199

19 Sep 2019

Mastering emergent language: learning to guide in simulated navigation

145

14 Aug 2019

Why Build an Assistant in Minecraft?

...

Jason Weston

276

22 Jul 2019

A Survey of Reinforcement Learning Informed by Natural LanguageInternational Joint Conference on Artificial Intelligence (IJCAI), 2019

218

302

10 Jun 2019

REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments

Qi Wu

Chunhua Shen

350

426

23 Apr 2019

ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning

Harris Chan

Yuhuai Wu

J. Kiros

Sanja Fidler

Jimmy Ba

195

12 Feb 2019

Embodied Multimodal Multitask Learning

Devendra Singh Chaplot

Devi Parikh

181

04 Feb 2019

Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning

379

247

03 Dec 2018

Object-oriented Targets for Visual Navigation using Rich Semantic Representations

Jean-Benoit Delbrouck

Stéphane Dupont

154

22 Nov 2018

Neural Modular Control for Embodied Question Answering

Devi Parikh

423

141

26 Oct 2018

Visual Semantic Navigation using Scene Priors

276

351

15 Oct 2018