ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.06342
  4. Cited By
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning

19 November 2015
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
    OffRL
ArXivPDFHTML

Papers citing "Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning"

50 / 120 papers shown
Title
Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits
Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits
Jiabin Lin
Shana Moothedath
Namrata Vaswani
64
4
0
08 Jan 2025
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
Mingcan Xiang
Steven Jiaxun Tang
Qizheng Yang
Hui Guan
Tongping Liu
VLM
39
0
0
07 Aug 2024
BAKU: An Efficient Transformer for Multi-Task Policy Learning
BAKU: An Efficient Transformer for Multi-Task Policy Learning
Siddhant Haldar
Zhuoran Peng
Lerrel Pinto
OffRL
46
28
0
11 Jun 2024
Shared learning of powertrain control policies for vehicle fleets
Shared learning of powertrain control policies for vehicle fleets
Lindsey Kerbel
B. Ayalew
Andrej Ivanco
31
0
0
27 Apr 2024
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
38
0
0
25 Apr 2024
Towards Multi-Morphology Controllers with Diversity and Knowledge
  Distillation
Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation
Alican Mertan
Nick Cheney
34
0
0
22 Apr 2024
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal
  Morphology Control
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Zheng Xiong
Risto Vuorio
Jacob Beck
Matthieu Zimmer
Kun Shao
Shimon Whiteson
39
1
0
09 Feb 2024
Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on
  Light-Weighed Backbones and Effective Measurement of Multi-Task Learning
  Challenges by Feature Disentanglement
Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on Light-Weighed Backbones and Effective Measurement of Multi-Task Learning Challenges by Feature Disentanglement
Dayou Mao
Yuhao Chen
Yifan Wu
Maximilian Gilles
Alexander Wong
AAML
41
0
0
05 Feb 2024
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Carlo DÉramo
Davide Tateo
Andrea Bonarini
Marcello Restelli
Jan Peters
59
124
0
17 Jan 2024
All by Myself: Learning Individualized Competitive Behaviour with a
  Contrastive Reinforcement Learning optimization
All by Myself: Learning Individualized Competitive Behaviour with a Contrastive Reinforcement Learning optimization
Pablo V. A. Barros
A. Sciutti
SSL
33
3
0
02 Oct 2023
AdaptNet: Policy Adaptation for Physics-Based Character Control
AdaptNet: Policy Adaptation for Physics-Based Character Control
Pei Xu
Kaixiang Xie
Sheldon Andrews
P. Kry
Michael Neff
Morgan McGuire
Ioannis Karamouzas
Victor Zordan
TTA
37
17
0
30 Sep 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for
  Multi-Policy Reuse
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
24
1
0
14 Aug 2023
Collaborative Development of NLP models
Collaborative Development of NLP models
Fereshte Khani
Marco Tulio Ribeiro
38
2
0
20 May 2023
Intelligent multicast routing method based on multi-agent deep
  reinforcement learning in SDWN
Intelligent multicast routing method based on multi-agent deep reinforcement learning in SDWN
Hongwen Hu
Miao Ye
Chenwei Zhao
Qiuxiang Jiang
Yong Wang
Hongbing Qiu
Xiaofang Deng
22
2
0
12 May 2023
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement
  Learning
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Tuomas Haarnoja
Ben Moran
Guy Lever
Sandy H. Huang
Dhruva Tirumala
...
Andrea Huber
N. Hurley
F. Nori
R. Hadsell
N. Heess
50
143
0
26 Apr 2023
Reinforcement Learning in the Wild with Maximum Likelihood-based Model
  Transfer
Reinforcement Learning in the Wild with Maximum Likelihood-based Model Transfer
Hannes Eriksson
D. Basu
Tommy Tram
Mina Alibeigi
Christos Dimitrakakis
21
1
0
18 Feb 2023
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Raghav Goyal
E. Mavroudi
Xitong Yang
Sainbayar Sukhbaatar
Leonid Sigal
Matt Feiszli
Lorenzo Torresani
Du Tran
26
7
0
16 Feb 2023
Transferring Multiple Policies to Hotstart Reinforcement Learning in an
  Air Compressor Management Problem
Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem
Hélène Plisnier
Denis Steckelmacher
Jeroen Willems
B. Depraetere
Ann Nowé
OffRL
32
1
0
30 Jan 2023
Offline Q-Learning on Diverse Multi-Task Data Both Scales And
  Generalizes
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Aviral Kumar
Rishabh Agarwal
Xinyang Geng
George Tucker
Sergey Levine
OffRL
44
48
0
28 Nov 2022
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended
  Exploration
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Giulia Vezzani
Dhruva Tirumala
Markus Wulfmeier
Dushyant Rao
A. Abdolmaleki
...
Tim Hertweck
Thomas Lampe
Fereshteh Sadeghi
N. Heess
Martin Riedmiller
OffRL
41
6
0
24 Nov 2022
Multi-Environment Pretraining Enables Transfer to Action Limited
  Datasets
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
David Venuto
Sherry Yang
Pieter Abbeel
Doina Precup
Igor Mordatch
Ofir Nachum
OffRL
25
5
0
23 Nov 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task
  Learning with Model-Accelerator Co-design
M3^33ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
42
81
0
26 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
34
6
0
22 Oct 2022
Model-based Lifelong Reinforcement Learning with Bayesian Exploration
Model-based Lifelong Reinforcement Learning with Bayesian Exploration
Haotian Fu
Shangqun Yu
Michael Littman
George Konidaris
BDL
OffRL
24
12
0
20 Oct 2022
Hypernetworks in Meta-Reinforcement Learning
Hypernetworks in Meta-Reinforcement Learning
Jacob Beck
Matthew Jackson
Risto Vuorio
Shimon Whiteson
OffRL
27
30
0
20 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement
  Learning
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Zhuowen Tu
OffRL
31
15
0
19 Oct 2022
Meta Reinforcement Learning for Optimal Design of Legged Robots
Meta Reinforcement Learning for Optimal Design of Legged Robots
Álvaro Belmonte-Baeza
Joonho Lee
Giorgio Valsecchi
Marco Hutter
43
17
0
06 Oct 2022
On the Convergence Theory of Meta Reinforcement Learning with
  Personalized Policies
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies
Haozhi Wang
Qing Wang
Yunfeng Shao
Dong Li
Jianye Hao
Yinchuan Li
28
0
0
21 Sep 2022
Distilling Deep RL Models Into Interpretable Neuro-Fuzzy Systems
Distilling Deep RL Models Into Interpretable Neuro-Fuzzy Systems
Arne Gevaert
Jonathan Peck
Yvan Saeys
23
1
0
07 Sep 2022
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
Chang Yang
Ruiyu Wang
Xinrun Wang
Zhen Wang
OffRL
27
3
0
07 Aug 2022
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic
  Reinforcement Learning
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning
Homer Walke
Jonathan Yang
Albert Yu
Aviral Kumar
Jedrzej Orbik
Avi Singh
Sergey Levine
OffRL
OnRL
27
32
0
11 Jul 2022
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
Shunyu Liu
Kaixuan Chen
Na Yu
Mingli Song
Zunlei Feng
Mingli Song
47
1
0
05 Jul 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to
  Accelerate Progress
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Provable Benefits of Representational Transfer in Reinforcement Learning
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
23
33
0
29 May 2022
Multi-Source Transfer Learning for Deep Model-Based Reinforcement
  Learning
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
Remo Sasso
M. Sabatelli
M. Wiering
49
9
0
28 May 2022
How to Spend Your Robot Time: Bridging Kickstarting and Offline
  Reinforcement Learning for Vision-based Robotic Manipulation
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Alex X. Lee
Coline Devin
Jost Tobias Springenberg
Yuxiang Zhou
Thomas Lampe
A. Abdolmaleki
Konstantinos Bousmalis
OffRL
OnRL
24
15
0
06 May 2022
Learn-to-Race Challenge 2022: Benchmarking Safe Learning and
  Cross-domain Generalisation in Autonomous Racing
Learn-to-Race Challenge 2022: Benchmarking Safe Learning and Cross-domain Generalisation in Autonomous Racing
Jonathan M Francis
Bingqing Chen
Siddha Ganju
Sidharth Kathpal
Jyotish Poonganam
...
Ivan Zhukov
Max Kumskoy
Anirudh Koul
Jean Oh
Eric Nyberg
19
12
0
05 May 2022
Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Jiaqi Yang
Qi Lei
Jason D. Lee
S. Du
43
16
0
29 Mar 2022
MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
Shiming Chen
Ziming Hong
Guosen Xie
Wenhan Wang
Qinmu Peng
Kai Wang
Jian-jun Zhao
Xinge You
VLM
23
100
0
07 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in
  Actor-Critic Algorithms
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
Romain Laroche
Rémi Tachet des Combes
46
2
0
15 Feb 2022
Transferred Q-learning
Transferred Q-learning
Elynn Y. Chen
Michael I. Jordan
Sai Li
OffRL
OnRL
33
4
0
09 Feb 2022
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
Vitaly Kurin
Alessandro De Palma
Ilya Kostrikov
Shimon Whiteson
M. P. Kumar
39
74
0
11 Jan 2022
Curriculum Learning for Safe Mapless Navigation
Curriculum Learning for Safe Mapless Navigation
Luca Marzari
Davide Corsi
Enrico Marchesini
Alessandro Farinelli
24
14
0
23 Dec 2021
CoMPS: Continual Meta Policy Search
CoMPS: Continual Meta Policy Search
Glen Berseth
Zhiwei Zhang
Grace Zhang
Chelsea Finn
Sergey Levine
CLL
OffRL
28
16
0
08 Dec 2021
Conflict-Averse Gradient Descent for Multi-task Learning
Conflict-Averse Gradient Descent for Multi-task Learning
Bo Liu
Xingchao Liu
Xiaojie Jin
Peter Stone
Qiang Liu
47
298
0
26 Oct 2021
Learning Multi-Objective Curricula for Robotic Policy Learning
Learning Multi-Objective Curricula for Robotic Policy Learning
Jikun Kang
Miao Liu
Abhinav Gupta
C. Pal
Xue Liu
Jie Fu
42
4
0
06 Oct 2021
On The Transferability of Deep-Q Networks
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
37
2
0
06 Oct 2021
SLAW: Scaled Loss Approximate Weighting for Efficient Multi-Task
  Learning
SLAW: Scaled Loss Approximate Weighting for Efficient Multi-Task Learning
M. Crawshaw
Jana Kosecka
26
7
0
16 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
35
77
0
16 Sep 2021
123
Next