Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
1806.01946
Cited By
v1
v2
v3
v4 (latest)
Learning to Understand Goal Specifications by Modelling Reward
5 June 2018
Dzmitry Bahdanau
Felix Hill
Jan Leike
Edward Hughes
Seyedarian Hosseini
Pushmeet Kohli
Edward Grefenstette
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning to Understand Goal Specifications by Modelling Reward"
50 / 105 papers shown
Complex Instruction Following with Diverse Style Policies in Football Games
Chenglu Sun
Shuo Shen
Haonan Hu
Wei Zhou
Chen Chen
84
0
0
25 Nov 2025
IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation
In-Chang Baek
Sung-Hyun Kim
Seo-Young Lee
Dong-Hyeun Kim
Kyung-Joong Kim
418
2
0
16 Mar 2025
Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement Learning
Machine-mediated learning (ML), 2025
Negin Hashemi Dijujin
Seyed Roozbeh Razavi Rohani
Mohammad Samiei
M. Baghshah
277
0
0
28 Jan 2025
Few-Shot Task Learning through Inverse Generative Modeling
Neural Information Processing Systems (NeurIPS), 2024
Aviv Netanyahu
Yilun Du
Antonia Bronars
Jyothish Pari
J. Tenenbaum
Tianmin Shu
Pulkit Agrawal
484
4
0
07 Nov 2024
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Beyazit Yalcinkaya
Niklas Lauffer
Marcell Vazquez-Chanlatte
Sanjit A. Seshia
AI4CE
485
13
0
31 Oct 2024
A Survey on Complex Tasks for Goal-Directed Interactive Agents
Mareike Hartmann
Alexander Koller
LM&Ro
LLMAG
293
1
0
27 Sep 2024
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought
Gabriel H. Sarch
Lawrence Jang
Michael J. Tarr
William W. Cohen
Kenneth Marino
Katerina Fragkiadaki
LLMAG
699
0
0
20 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
596
3
0
09 Jun 2024
Transfer Q Star: Principled Decoding for LLM Alignment
Souradip Chakraborty
Soumya Suvra Ghosal
Ming Yin
Dinesh Manocha
Mengdi Wang
Amrit Singh Bedi
Furong Huang
277
42
0
30 May 2024
Reward Design for Justifiable Sequential Decision-Making
A. Sukovic
Goran Radanović
187
0
0
24 Feb 2024
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
Adaptive Agents and Multi-Agent Systems (AAMAS), 2024
Chaitanya Kharyal
S. Gottipati
Tanmay Kumar Sinha
Srijita Das
Matthew E. Taylor
LLMAG
195
2
0
03 Jan 2024
LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Ching-An Cheng
Andrey Kolobov
Dipendra Kumar Misra
Allen Nie
Adith Swaminathan
262
24
0
11 Dec 2023
Axiomatic Preference Modeling for Longform Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Corby Rosset
Guoqing Zheng
Victor C. Dibia
Ahmed Hassan Awadallah
Paul Bennett
SyDa
147
7
0
02 Dec 2023
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Boyang Xue
Weichao Wang
Hongru Wang
Fei Mi
Rui Wang
Yasheng Wang
Lifeng Shang
Xin Jiang
Qun Liu
Kam-Fai Wong
KELM
HILM
545
22
0
12 Oct 2023
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language
International Conference on Computer Science and Artificial Intelligence (ICCSAI), 2023
Zhourui Guo
Meng Yao
Yang Yu
Qiyue Yin
OnRL
234
1
0
21 Sep 2023
Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning
Shaohui Peng
Xingui Hu
Qi Yi
Rui Zhang
Jiaming Guo
...
Rui Chen
Zidong Du
Qi Guo
Yunji Chen
Ling Li
LLMAG
LRM
LM&Ro
279
11
0
04 Sep 2023
Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation
Conference on Robot Learning (CoRL), 2023
Bokui (William) Shen
Ge Yang
Alan Yu
J. Wong
L. Kaelbling
Phillip Isola
VLM
315
146
0
27 Jul 2023
Designing Fiduciary Artificial Intelligence
Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023
Sebastian Benthall
David Shekman
172
9
0
27 Jul 2023
Improving Long-Horizon Imitation Through Instruction Prediction
AAAI Conference on Artificial Intelligence (AAAI), 2023
Joey Hejna
Pieter Abbeel
Lerrel Pinto
182
12
0
21 Jun 2023
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
299
17
0
15 Jun 2023
Language to Rewards for Robotic Skill Synthesis
Conference on Robot Learning (CoRL), 2023
Wenhao Yu
Nimrod Gileadi
Chuyuan Fu
Sean Kirmani
Kuang-Huei Lee
...
N. Heess
Dorsa Sadigh
Jie Tan
Yuval Tassa
F. Xia
LM&Ro
256
353
0
14 Jun 2023
Reward Collapse in Aligning Large Language Models
Journal of Data Science (JDS), 2023
Ziang Song
Tianle Cai
Jason D. Lee
Weijie J. Su
ALM
240
26
0
28 May 2023
RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Neural Information Processing Systems (NeurIPS), 2023
Zheng Yuan
Hongyi Yuan
Chuanqi Tan
Wei Wang
Songfang Huang
Feiran Huang
ALM
402
475
0
11 Apr 2023
Conceptual Reinforcement Learning for Language-Conditioned Tasks
AAAI Conference on Artificial Intelligence (AAAI), 2023
Shaohui Peng
Xingui Hu
Rui Zhang
Jiaming Guo
Qi Yi
Rui Chen
Zidong Du
Ling Li
Qi Guo
Yunji Chen
OffRL
181
12
0
09 Mar 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
Neural Information Processing Systems (NeurIPS), 2023
Yue Wu
Yewen Fan
Paul Pu Liang
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
OffRL
257
54
0
09 Feb 2023
Temporal Video-Language Alignment Network for Reward Shaping in Reinforcement Learning
Ziyuan Cao
Reshma A. Ramachandra
K. Yu
171
2
0
08 Feb 2023
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Thomas Carta
Clément Romac
Thomas Wolf
Sylvain Lamprier
Olivier Sigaud
Pierre-Yves Oudeyer
LM&Ro
LLMAG
384
236
0
06 Feb 2023
Distilling Internet-Scale Vision-Language Models into Embodied Agents
International Conference on Machine Learning (ICML), 2023
T. Sumers
Kenneth Marino
Arun Ahuja
Rob Fergus
Ishita Dasgupta
LM&Ro
217
33
0
29 Jan 2023
PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav
Computer Vision and Pattern Recognition (CVPR), 2023
Ram Ramrakhya
Dhruv Batra
Erik Wijmans
Abhishek Das
OffRL
428
95
0
18 Jan 2023
Inverse Reinforcement Learning for Text Summarization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yujiao Fu
Deyi Xiong
Yue Dong
315
5
0
19 Dec 2022
Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models
Ted Xiao
Harris Chan
P. Sermanet
Ayzaan Wahid
Anthony Brohan
Karol Hausman
Sergey Levine
Jonathan Tompson
VLM
LM&Ro
426
77
0
21 Nov 2022
Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning
Riashat Islam
Hongyu Zang
Anirudh Goyal
Alex Lamb
Kenji Kawaguchi
Xin-hui Li
Romain Laroche
Yoshua Bengio
Rémi Tachet des Combes
OffRL
AI4CE
239
12
0
01 Nov 2022
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
International Conference on Learning Representations (ICLR), 2022
Edwin Zhang
Yujie Lu
William Wang
Amy Zhang
DiffM
LM&Ro
240
26
0
27 Oct 2022
Trust in Language Grounding: a new AI challenge for human-robot teams
David M. Bossens
C. Evers
232
1
0
05 Sep 2022
A Computational Interface to Translate Strategic Intent from Unstructured Language in a Low-Data Setting
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Pradyumna Tambwekar
Lakshita Dodeja
Nathan Vaska
Wei Xu
Matthew C. Gombolay
244
1
0
17 Aug 2022
RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents
International Conference on Machine Learning (ICML), 2022
Rafael Rodríguez-Sánchez
Benjamin A. Spiegel
Jenny Wang
Roma Patel
Stefanie Tellex
George Konidaris
150
4
0
12 Aug 2022
Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Sam Spilsbury
Alexander Ilin
204
8
0
06 Jul 2022
Leveraging Language for Accelerated Learning of Tool Manipulation
Conference on Robot Learning (CoRL), 2022
Allen Z. Ren
Bharat Govil
Tsung-Yen Yang
Karthik Narasimhan
Anirudha Majumdar
LM&Ro
179
42
0
27 Jun 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Neural Information Processing Systems (NeurIPS), 2022
Thomas Carta
Pierre-Yves Oudeyer
Olivier Sigaud
Sylvain Lamprier
OffRL
364
29
0
20 Jun 2022
How to talk so AI will learn: Instructions, descriptions, and autonomy
Neural Information Processing Systems (NeurIPS), 2022
T. Sumers
Robert D. Hawkins
Mark K. Ho
Thomas Griffiths
Dylan Hadfield-Menell
LM&Ro
455
26
0
16 Jun 2022
Language and Culture Internalisation for Human-Like Autotelic AI
Cédric Colas
Tristan Karch
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
230
34
0
02 Jun 2022
Inferring Rewards from Language in Context
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jessy Lin
Daniel Fried
Dan Klein
Anca Dragan
LM&Ro
211
65
0
05 Apr 2022
Preprocessing Reward Functions for Interpretability
Erik Jenner
Adam Gleave
218
8
0
25 Mar 2022
Perceiving the World: Question-guided Reinforcement Learning for Text-based Games
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yunqiu Xu
Meng Fang
Ling Chen
Yali Du
Qiufeng Wang
Chengqi Zhang
OffRL
211
22
0
20 Mar 2022
Training language models to follow instructions with human feedback
Neural Information Processing Systems (NeurIPS), 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
2.1K
17,490
0
04 Mar 2022
Improving Intrinsic Exploration with Language Abstractions
Neural Information Processing Systems (NeurIPS), 2022
Jesse Mu
Victor Zhong
Roberta Raileanu
Minqi Jiang
Noah D. Goodman
Tim Rocktaschel
Edward Grefenstette
280
72
0
17 Feb 2022
Learning Invariable Semantical Representation from Language for Extensible Policy Generalization
Yihan Li
Jinsheng Ren
Tianrun Xu
Tianren Zhang
Haichuan Gao
Feng Chen
192
1
0
26 Jan 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
208
5
0
20 Jan 2022
Learning to Guide and to Be Guided in the Architect-Builder Problem
Paul Barde
Tristan Karch
Derek Nowrouzezahrai
Clément Moulin-Frier
C. Pal
Pierre-Yves Oudeyer
320
5
0
14 Dec 2021
LILA: Language-Informed Latent Actions
Conference on Robot Learning (CoRL), 2021
Siddharth Karamcheti
Megha Srivastava
Abigail Z. Jacobs
Dorsa Sadigh
LM&Ro
232
35
0
05 Nov 2021
1
2
3
Next