ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.05064
  4. Cited By
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement
  Learning

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

15 June 2017
Junhyuk Oh
Satinder Singh
Honglak Lee
Pushmeet Kohli
    OffRL
ArXivPDFHTML

Papers citing "Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning"

43 / 43 papers shown
Title
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
Mathias Jackermeier
Alessandro Abate
OffRL
34
1
0
06 Oct 2024
Inductive Generalization in Reinforcement Learning from Specifications
Inductive Generalization in Reinforcement Learning from Specifications
Vignesh Subramanian
Rohit Kushwah
Subhajit Roy
Suguman Bansal
OffRL
33
0
0
05 Jun 2024
Reward Machines for Deep RL in Noisy and Uncertain Environments
Reward Machines for Deep RL in Noisy and Uncertain Environments
Andrew C. Li
Zizhao Chen
Toryn Q. Klassen
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
46
6
0
31 May 2024
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active
  Perception
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
Yiran Qin
Enshen Zhou
Qichang Liu
Zhen-fei Yin
Lu Sheng
Ruimao Zhang
Yu Qiao
Jing Shao
LM&Ro
30
39
0
12 Dec 2023
OceanChat: Piloting Autonomous Underwater Vehicles in Natural Language
OceanChat: Piloting Autonomous Underwater Vehicles in Natural Language
Jia Huang
Mengxue Hou
Junkai Wang
Fumin Zhang
32
5
0
27 Sep 2023
Conceptual Reinforcement Learning for Language-Conditioned Tasks
Conceptual Reinforcement Learning for Language-Conditioned Tasks
Shaohui Peng
Xingui Hu
Rui Zhang
Jiaming Guo
Qi Yi
Rui Chen
Zidong Du
Ling Li
Qi Guo
Yunji Chen
OffRL
33
8
0
09 Mar 2023
Describe, Explain, Plan and Select: Interactive Planning with Large
  Language Models Enables Open-World Multi-Task Agents
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
Zihao Wang
Shaofei Cai
Guanzhou Chen
Anji Liu
Xiaojian Ma
Yitao Liang
LM&Ro
LLMAG
58
315
0
03 Feb 2023
Does Zero-Shot Reinforcement Learning Exist?
Does Zero-Shot Reinforcement Learning Exist?
Ahmed Touati
Jérémy Rapin
Yann Ollivier
OffRL
28
38
0
29 Sep 2022
Goal-Conditioned Q-Learning as Knowledge Distillation
Goal-Conditioned Q-Learning as Knowledge Distillation
Alexander Levine
S. Feizi
OffRL
17
2
0
28 Aug 2022
Fast Inference and Transfer of Compositional Task Structures for
  Few-shot Task Generalization
Fast Inference and Transfer of Compositional Task Structures for Few-shot Task Generalization
Sungryull Sohn
Hyunjae Woo
Jongwook Choi
lyubing qiang
Izzeddin Gur
Aleksandra Faust
Honglak Lee
BDL
OffRL
24
3
0
25 May 2022
SKILL-IL: Disentangling Skill and Knowledge in Multitask Imitation
  Learning
SKILL-IL: Disentangling Skill and Knowledge in Multitask Imitation Learning
Xihan Bian
Oscar Alejandro Mendez Maldonado
Simon Hadfield
24
6
0
06 May 2022
Environment Generation for Zero-Shot Compositional Reinforcement
  Learning
Environment Generation for Zero-Shot Compositional Reinforcement Learning
Izzeddin Gur
Natasha Jaques
Yingjie Miao
Jongwook Choi
Manoj Kumar Tiwari
Honglak Lee
Aleksandra Faust
28
43
0
21 Jan 2022
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical
  Reinforcement Learning
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning
Zichuan Lin
Junyou Li
Jianing Shi
Deheng Ye
Qiang Fu
Wei Yang
BDL
32
34
0
07 Dec 2021
In a Nutshell, the Human Asked for This: Latent Goals for Following
  Temporal Specifications
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
AI4CE
18
15
0
18 Oct 2021
Learning Language-Conditioned Robot Behavior from Offline Data and
  Crowd-Sourced Annotation
Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation
Suraj Nair
E. Mitchell
Kevin Chen
Brian Ichter
Silvio Savarese
Chelsea Finn
LM&Ro
OffRL
28
154
0
02 Sep 2021
Learning to Synthesize Programs as Interpretable and Generalizable
  Policies
Learning to Synthesize Programs as Interpretable and Generalizable Policies
Dweep Trivedi
Jesse Zhang
Shao-Hua Sun
Joseph J. Lim
NAI
9
71
0
31 Aug 2021
Pointer Value Retrieval: A new benchmark for understanding the limits of
  neural network generalization
Pointer Value Retrieval: A new benchmark for understanding the limits of neural network generalization
Chiyuan Zhang
M. Raghu
Jon M. Kleinberg
Samy Bengio
OOD
24
30
0
27 Jul 2021
gComm: An environment for investigating generalization in Grounded
  Language Acquisition
gComm: An environment for investigating generalization in Grounded Language Acquisition
Rishi Hazra
Sonu Dixit
20
0
0
09 May 2021
Grounding Language to Entities and Dynamics for Generalization in
  Reinforcement Learning
Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning
H. Wang
Victor Zhong
Karthik Narasimhan
78
53
0
19 Jan 2021
Evolving Graphical Planner: Contextual Global Planning for
  Vision-and-Language Navigation
Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation
Zhiwei Deng
Karthik Narasimhan
Olga Russakovsky
13
86
0
11 Jul 2020
Systematic Generalisation through Task Temporal Logic and Deep
  Reinforcement Learning
Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
NAI
AI4CE
31
28
0
12 Jun 2020
Human Instruction-Following with Deep Reinforcement Learning via
  Transfer-Learning from Text
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text
Felix Hill
Soňa Mokrá
Nathaniel Wong
Tim Harley
LM&Ro
11
81
0
19 May 2020
Language Conditioned Imitation Learning over Unstructured Data
Language Conditioned Imitation Learning over Unstructured Data
Corey Lynch
P. Sermanet
LM&Ro
30
241
0
15 May 2020
IKEA Furniture Assembly Environment for Long-Horizon Complex
  Manipulation Tasks
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Youngwoon Lee
E. Hu
Zhengyu Yang
Alexander Yin
Joseph J. Lim
28
120
0
17 Nov 2019
Exploration via Hindsight Goal Generation
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
22
84
0
10 Jun 2019
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Tom Schaul
Diana Borsa
Joseph Modayil
Razvan Pascanu
11
63
0
25 Apr 2019
Good-Enough Compositional Data Augmentation
Good-Enough Compositional Data Augmentation
Jacob Andreas
19
230
0
21 Apr 2019
Learning To Follow Directions in Street View
Learning To Follow Directions in Street View
Karl Moritz Hermann
Mateusz Malinowski
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
R. Hadsell
SSL
16
66
0
01 Mar 2019
Transfer in Deep Reinforcement Learning Using Successor Features and
  Generalised Policy Improvement
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
André Barreto
Diana Borsa
John Quan
Tom Schaul
David Silver
Matteo Hessel
D. Mankowitz
Augustin Žídek
Rémi Munos
OffRL
31
161
0
30 Jan 2019
Guiding Policies with Language via Meta-Learning
Guiding Policies with Language via Meta-Learning
John D. Co-Reyes
Abhishek Gupta
Suvansh Sanjeev
Nick Altieri
Jacob Andreas
John DeNero
Pieter Abbeel
Sergey Levine
LM&Ro
26
62
0
19 Nov 2018
Visual Semantic Navigation using Scene Priors
Visual Semantic Navigation using Scene Priors
Wei Yang
X. Wang
Ali Farhadi
Abhinav Gupta
Roozbeh Mottaghi
LM&Ro
22
320
0
15 Oct 2018
Learning Invariances for Policy Generalization
Learning Invariances for Policy Generalization
Rémi Tachet des Combes
Philip Bachman
H. V. Seijen
12
12
0
07 Sep 2018
Mapping Instructions to Actions in 3D Environments with Visual Goal
  Prediction
Mapping Instructions to Actions in 3D Environments with Visual Goal Prediction
Dipendra Kumar Misra
Andrew Bennett
Valts Blukis
Eyvind Niklasson
Max Shatkhin
Yoav Artzi
LM&Ro
16
186
0
04 Sep 2018
Automatically Composing Representation Transformations as a Means for
  Generalization
Automatically Composing Representation Transformations as a Means for Generalization
Michael Chang
Abhishek Gupta
Sergey Levine
Thomas L. Griffiths
21
68
0
12 Jul 2018
A Dissection of Overfitting and Generalization in Continuous
  Reinforcement Learning
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
19
176
0
20 Jun 2018
Learning to Understand Goal Specifications by Modelling Reward
Learning to Understand Goal Specifications by Modelling Reward
Dzmitry Bahdanau
Felix Hill
Jan Leike
Edward Hughes
Seyedarian Hosseini
Pushmeet Kohli
Edward Grefenstette
16
157
0
05 Jun 2018
Transfer Learning for Related Reinforcement Learning Tasks via
  Image-to-Image Translation
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
Shani Gamrian
Yoav Goldberg
24
104
0
31 May 2018
Zero-Shot Dialog Generation with Cross-Domain Latent Actions
Zero-Shot Dialog Generation with Cross-Domain Latent Actions
Tiancheng Zhao
M. Eskénazi
VLM
13
76
0
13 May 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
36
1,303
0
12 Mar 2018
Interactive Grounded Language Acquisition and Generalization in a 2D
  World
Interactive Grounded Language Acquisition and Generalization in a 2D World
Haonan Yu
Haichao Zhang
W. Xu
LLMAG
LM&Ro
14
77
0
31 Jan 2018
Building Generalizable Agents with a Realistic and Rich 3D Environment
Building Generalizable Agents with a Realistic and Rich 3D Environment
Yi Wu
Yuxin Wu
Georgia Gkioxari
Yuandong Tian
3DV
50
338
0
07 Jan 2018
HoME: a Household Multimodal Environment
HoME: a Household Multimodal Environment
Simon Brodeur
Ethan Perez
Ankesh Anand
Florian Golemo
Luca Herranz-Celotti
Florian Strub
Jean Rouat
Hugo Larochelle
Aaron Courville
LM&Ro
31
103
0
29 Nov 2017
FiLM: Visual Reasoning with a General Conditioning Layer
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
72
2,144
0
22 Sep 2017
1