v1v2v3v4 (latest)

Learning to Understand Goal Specifications by Modelling Reward

5 June 2018

Papers citing "Learning to Understand Goal Specifications by Modelling Reward"

50 / 105 papers shown

Complex Instruction Following with Diverse Style Policies in Football Games

25 Nov 2025

IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation

418

16 Mar 2025

Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement LearningMachine-mediated learning (ML), 2025

Negin Hashemi Dijujin

Seyed Roozbeh Razavi Rohani

Mohammad Samiei

M. Baghshah

277

28 Jan 2025

Few-Shot Task Learning through Inverse Generative ModelingNeural Information Processing Systems (NeurIPS), 2024

484

07 Nov 2024

Compositional Automata Embeddings for Goal-Conditioned Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024

Beyazit Yalcinkaya

Niklas Lauffer

Marcell Vazquez-Chanlatte

Sanjit A. Seshia

AI4CE

485

31 Oct 2024

A Survey on Complex Tasks for Goal-Directed Interactive Agents

Mareike Hartmann

Alexander Koller

LM&Ro LLMAG

293

27 Sep 2024

VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought

699

20 Jun 2024

LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning

596

09 Jun 2024

Transfer Q Star: Principled Decoding for LLM Alignment

Ming Yin

Mengdi Wang

Furong Huang

277

30 May 2024

Reward Design for Justifiable Sequential Decision-Making

A. Sukovic

Goran Radanović

187

24 Feb 2024

GLIDE-RL: Grounded Language Instruction through DEmonstration in RLAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

195

03 Jan 2024

LLF-Bench: Benchmark for Interactive Learning from Language Feedback

262

11 Dec 2023

Axiomatic Preference Modeling for Longform Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Corby Rosset

Guoqing Zheng

Victor C. Dibia

Ahmed Hassan Awadallah

Paul Bennett

SyDa

147

02 Dec 2023

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Lifeng Shang

Xin Jiang

Qun Liu

Kam-Fai Wong

KELM HILM

545

12 Oct 2023

Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural languageInternational Conference on Computer Science and Artificial Intelligence (ICCSAI), 2023

234

21 Sep 2023

Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning

...

Ling Li

279

04 Sep 2023

Distilled Feature Fields Enable Few-Shot Language-Guided ManipulationConference on Robot Learning (CoRL), 2023

315

146

27 Jul 2023

Designing Fiduciary Artificial IntelligenceConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023

Sebastian Benthall

David Shekman

172

27 Jul 2023

Improving Long-Horizon Imitation Through Instruction PredictionAAAI Conference on Artificial Intelligence (AAAI), 2023

Joey Hejna

Pieter Abbeel

Lerrel Pinto

182

21 Jun 2023

Semantic HELM: A Human-Readable Memory for Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

299

15 Jun 2023

Language to Rewards for Robotic Skill SynthesisConference on Robot Learning (CoRL), 2023

Wenhao Yu

...

Dorsa Sadigh

Jie Tan

Yuval Tassa

F. Xia

LM&Ro

256

353

14 Jun 2023

Reward Collapse in Aligning Large Language ModelsJournal of Data Science (JDS), 2023

240

28 May 2023

RRHF: Rank Responses to Align Language Models with Human Feedback without tearsNeural Information Processing Systems (NeurIPS), 2023

Zheng Yuan

Hongyi Yuan

Chuanqi Tan

402

475

11 Apr 2023

Conceptual Reinforcement Learning for Language-Conditioned TasksAAAI Conference on Artificial Intelligence (AAAI), 2023

Ling Li

181

09 Mar 2023

Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction ManualsNeural Information Processing Systems (NeurIPS), 2023

257

09 Feb 2023

Temporal Video-Language Alignment Network for Reward Shaping in Reinforcement Learning

Ziyuan Cao

Reshma A. Ramachandra

K. Yu

171

08 Feb 2023

Grounding Large Language Models in Interactive Environments with Online Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023

384

236

06 Feb 2023

Distilling Internet-Scale Vision-Language Models into Embodied AgentsInternational Conference on Machine Learning (ICML), 2023

Arun Ahuja

217

29 Jan 2023

PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNavComputer Vision and Pattern Recognition (CVPR), 2023

428

18 Jan 2023

Inverse Reinforcement Learning for Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Yujiao Fu

Deyi Xiong

Yue Dong

315

19 Dec 2022

Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models

426

21 Nov 2022

Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning

Rémi Tachet des Combes

OffRL AI4CE

239

01 Nov 2022

Language Control Diffusion: Efficiently Scaling through Space, Time, and TasksInternational Conference on Learning Representations (ICLR), 2022

240

27 Oct 2022

Trust in Language Grounding: a new AI challenge for human-robot teams

David M. Bossens

C. Evers

232

05 Sep 2022

A Computational Interface to Translate Strategic Intent from Unstructured Language in a Low-Data SettingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

244

17 Aug 2022

RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning AgentsInternational Conference on Machine Learning (ICML), 2022

Rafael Rodríguez-Sánchez

150

12 Aug 2022

Compositional Generalization in Grounded Language Learning via Induced Model SparsityNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Sam Spilsbury

Alexander Ilin

204

06 Jul 2022

Leveraging Language for Accelerated Learning of Tool ManipulationConference on Robot Learning (CoRL), 2022

Allen Z. Ren

Anirudha Majumdar

179

27 Jun 2022

EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RLNeural Information Processing Systems (NeurIPS), 2022

364

20 Jun 2022

How to talk so AI will learn: Instructions, descriptions, and autonomyNeural Information Processing Systems (NeurIPS), 2022

Dylan Hadfield-Menell

LM&Ro

455

16 Jun 2022

Language and Culture Internalisation for Human-Like Autotelic AI

230

02 Jun 2022

Inferring Rewards from Language in ContextAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Daniel Fried

211

05 Apr 2022

Preprocessing Reward Functions for Interpretability

Erik Jenner

Adam Gleave

218

25 Mar 2022

Perceiving the World: Question-guided Reinforcement Learning for Text-based GamesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

211

20 Mar 2022

Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022

Carroll L. Wainwright

...

2.1K

17,490

04 Mar 2022

Improving Intrinsic Exploration with Language AbstractionsNeural Information Processing Systems (NeurIPS), 2022

280

17 Feb 2022

Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

192

26 Jan 2022

Safe Deep RL in 3D Environments using Human Feedback

208

20 Jan 2022

Learning to Guide and to Be Guided in the Architect-Builder Problem

320

14 Dec 2021

LILA: Language-Informed Latent ActionsConference on Robot Learning (CoRL), 2021

Dorsa Sadigh

232

05 Nov 2021

All Papers

Learning to Understand Goal Specifications by Modelling Reward

Papers citing "Learning to Understand Goal Specifications by Modelling Reward"