ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1309.6821
  4. Cited By
Sample Complexity of Multi-task Reinforcement Learning

Sample Complexity of Multi-task Reinforcement Learning

Conference on Uncertainty in Artificial Intelligence (UAI), 2013
26 September 2013
Emma Brunskill
Lihong Li
ArXiv (abs)PDFHTML

Papers citing "Sample Complexity of Multi-task Reinforcement Learning"

50 / 73 papers shown
Learning to Reason as Action Abstractions with Scalable Mid-Training RL
Learning to Reason as Action Abstractions with Scalable Mid-Training RL
Shenao Zhang
Donghan Yu
Yihao Feng
Bowen Jin
Zhaoran Wang
John Peebles
Zirui Wang
OffRLReLMLRM
323
0
0
30 Sep 2025
Efficient Morphology-Aware Policy Transfer to New Embodiments
Efficient Morphology-Aware Policy Transfer to New Embodiments
Michael Przystupa
Hongyao Tang
Martin Jägersand
Santiago Miret
Mariano Phielipp
Matthew E. Taylor
Glen Berseth
163
1
0
05 Aug 2025
J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization
J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization
Austin Xu
Yilun Zhou
Xuan-Phi Nguyen
Caiming Xiong
Shafiq Joty
ELMLRM
599
7
0
19 May 2025
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from
  Shifted-Dynamics Data
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics DataInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRLOnRL
286
6
0
06 Nov 2024
LIMT: Language-Informed Multi-Task Visual World Models
LIMT: Language-Informed Multi-Task Visual World Models
Elie Aljalbout
Nikolaos Sotirakis
Patrick van der Smagt
Maximilian Karl
Nutan Chen
386
5
0
18 Jul 2024
Three Dogmas of Reinforcement Learning
Three Dogmas of Reinforcement Learning
David Abel
Mark K. Ho
Anna Harutyunyan
415
12
0
15 Jul 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy
  Evaluation
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
Constantine Caramanis
Yonathan Efroni
OffRL
431
6
0
03 Jun 2024
Statistical Context Detection for Deep Lifelong Reinforcement Learning
Statistical Context Detection for Deep Lifelong Reinforcement Learning
Jeffery Dick
Saptarshi Nath
Christos Peridis
Eseoghene Ben-Iwhiwhu
Soheil Kolouri
Andrea Soltoggio
OffRL
270
4
0
29 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine
  Learning
Preparing for Black Swans: The Antifragility Imperative for Machine Learning
Ming Jin
346
6
0
18 May 2024
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback
Ruitao Chen
Liwei Wang
349
1
0
18 May 2024
Sample Efficient Myopic Exploration Through Multitask Reinforcement
  Learning with Diverse Tasks
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Ziping Xu
Zifan Xu
Runxuan Jiang
Peter Stone
Ambuj Tewari
367
2
0
03 Mar 2024
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Sharing Knowledge in Multi-Task Deep Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2020
Carlo DÉramo
Davide Tateo
Andrea Bonarini
Marcello Restelli
Jan Peters
380
143
0
17 Jan 2024
Social Contract AI: Aligning AI Assistants with Implicit Group Norms
Social Contract AI: Aligning AI Assistants with Implicit Group Norms
Jan-Philipp Fränken
Sam Kwok
Peixuan Ye
Kanishk Gandhi
Dilip Arumugam
Jared Moore
Alex Tamkin
Tobias Gerstenberg
Noah D. Goodman
344
9
0
26 Oct 2023
Provable Benefits of Multi-task RL under Non-Markovian Decision Making
  Processes
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes
Ruiquan Huang
Yuan Cheng
Jing Yang
Vincent Tan
Yingbin Liang
253
0
0
20 Oct 2023
Prospective Side Information for Latent MDPs
Prospective Side Information for Latent MDPsInternational Conference on Machine Learning (ICML), 2023
Jeongyeol Kwon
Yonathan Efroni
Shie Mannor
Constantine Caramanis
369
7
0
11 Oct 2023
Scaling Distributed Multi-task Reinforcement Learning with Experience
  Sharing
Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing
Sanae Amani
Khushbu Pahwa
samani
Lin F. Yang
LRM
214
1
0
11 Jul 2023
Addressing computational challenges in physical system simulations with
  machine learning
Addressing computational challenges in physical system simulations with machine learning
S. Ahamed
M. Uddin
AI4CE
244
2
0
16 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Bayesian Reinforcement Learning with Limited Cognitive LoadOpen Mind (OM), 2023
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
244
16
0
05 May 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward
  Learning
Provably Feedback-Efficient Reinforcement Learning via Active Reward LearningNeural Information Processing Systems (NeurIPS), 2023
Dingwen Kong
Lin F. Yang
269
16
0
18 Apr 2023
Robust Knowledge Transfer in Tiered Reinforcement Learning
Robust Knowledge Transfer in Tiered Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Jiawei Huang
Niao He
OffRL
422
1
0
10 Feb 2023
Adversarial Online Multi-Task Reinforcement Learning
Adversarial Online Multi-Task Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2023
Quan Nguyen
Nishant A. Mehta
197
1
0
11 Jan 2023
Learning Mixtures of Markov Chains and MDPs
Learning Mixtures of Markov Chains and MDPsInternational Conference on Machine Learning (ICML), 2022
Chinmaya Kausik
Kevin Tan
Ambuj Tewari
312
13
0
17 Nov 2022
Curriculum-based Asymmetric Multi-task Reinforcement Learning
Curriculum-based Asymmetric Multi-task Reinforcement LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
H. Huang
Deheng Ye
Li Shen
Wen Liu
257
21
0
07 Nov 2022
Group Distributionally Robust Reinforcement Learning with Hierarchical
  Latent Variables
Group Distributionally Robust Reinforcement Learning with Hierarchical Latent VariablesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Mengdi Xu
Peide Huang
Yaru Niu
Visak C. V. Kumar
Jielin Qiu
...
Kuan-Hui Lee
Xuewei Qi
Henry Lam
Yue Liu
Ding Zhao
OOD
230
9
0
21 Oct 2022
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent
  Markov Decision Processes
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022
Runlong Zhou
Ruosong Wang
S. Du
383
3
0
20 Oct 2022
On the Power of Pre-training for Generalization in RL: Provable Benefits
  and Hardness
On the Power of Pre-training for Generalization in RL: Provable Benefits and HardnessInternational Conference on Machine Learning (ICML), 2022
Haotian Ye
Xiaoyu Chen
Liwei Wang
S. Du
OffRL
319
8
0
19 Oct 2022
Multi-User Reinforcement Learning with Low Rank Rewards
Multi-User Reinforcement Learning with Low Rank RewardsInternational Conference on Machine Learning (ICML), 2022
Naman Agarwal
Prateek Jain
S. Kowshik
Dheeraj M. Nagaraj
Praneeth Netrapalli
OffRL
243
1
0
11 Oct 2022
Tractable Optimality in Episodic Latent MABs
Tractable Optimality in Episodic Latent MABsNeural Information Processing Systems (NeurIPS), 2022
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
310
3
0
05 Oct 2022
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
202
5
0
05 Oct 2022
Model-Free Generative Replay for Lifelong Reinforcement Learning:
  Application to Starcraft-2
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2
Z. Daniels
Aswin Raghavan
Jesse Hostetler
Abrar Rahman
Indranil Sur
M. Piacentino
Ajay Divakaran
CLLOffRL
273
16
0
09 Aug 2022
Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement
  Learning
Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement LearningComputational Geosciences (Comput. Geosci.), 2022
Y. Nasir
L. Durlofsky
155
11
0
21 Jul 2022
Minimum Description Length Control
Minimum Description Length ControlInternational Conference on Learning Representations (ICLR), 2022
Theodore H. Moskovitz
Ta-Chu Kao
M. Sahani
M. Botvinick
253
1
0
17 Jul 2022
Provably Efficient Lifelong Reinforcement Learning with Linear Function
  Approximation
Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation
Sanae Amani
Lin F. Yang
Ching-An Cheng
OffRL
214
2
0
01 Jun 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Provable Benefits of Representational Transfer in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2022
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
342
40
0
29 May 2022
PAC-Bayesian Lifelong Learning For Multi-Armed Bandits
PAC-Bayesian Lifelong Learning For Multi-Armed BanditsData mining and knowledge discovery (DMKD), 2022
H. Flynn
David Reeb
M. Kandemir
Jan Peters
224
8
0
07 Mar 2022
Provably Efficient Causal Model-Based Reinforcement Learning for
  Systematic Generalization
Provably Efficient Causal Model-Based Reinforcement Learning for Systematic GeneralizationAAAI Conference on Artificial Intelligence (AAAI), 2022
Mirco Mutti
Ric De Santi
Emanuele Rossi
J. Calderón
Michael M. Bronstein
Marcello Restelli
399
16
0
14 Feb 2022
Coordinated Attacks against Contextual Bandits: Fundamental Limits and
  Defense Mechanisms
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense MechanismsInternational Conference on Machine Learning (ICML), 2022
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
AAML
265
6
0
30 Jan 2022
Learning Mixtures of Linear Dynamical Systems
Learning Mixtures of Linear Dynamical SystemsInternational Conference on Machine Learning (ICML), 2022
Yanxi Chen
H. Vincent Poor
305
22
0
26 Jan 2022
Meta Learning MDPs with Linear Transition Models
Meta Learning MDPs with Linear Transition ModelsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Robert Muller
Aldo Pacchiano
245
4
0
21 Jan 2022
Reinforcement Learning in Reward-Mixing MDPs
Reinforcement Learning in Reward-Mixing MDPs
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
394
19
0
07 Oct 2021
Learning Meta Representations for Agents in Multi-Agent Reinforcement
  Learning
Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning
Shenao Zhang
Li Shen
Lei Han
Li Shen
259
8
0
30 Aug 2021
Provably Efficient Multi-Task Reinforcement Learning with Model Transfer
Provably Efficient Multi-Task Reinforcement Learning with Model TransferNeural Information Processing Systems (NeurIPS), 2021
Chicheng Zhang
Zhi Wang
OffRL
318
19
0
19 Jul 2021
Multitasking Inhibits Semantic Drift
Multitasking Inhibits Semantic DriftNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Athul Paul Jacob
M. Lewis
Jacob Andreas
234
13
0
15 Apr 2021
RL for Latent MDPs: Regret Guarantees and a Lower Bound
RL for Latent MDPs: Regret Guarantees and a Lower BoundNeural Information Processing Systems (NeurIPS), 2021
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
316
88
0
09 Feb 2021
Near-optimal Representation Learning for Linear Bandits and Linear RL
Near-optimal Representation Learning for Linear Bandits and Linear RLInternational Conference on Machine Learning (ICML), 2021
Jiachen Hu
Xiaoyu Chen
Chi Jin
Lihong Li
Liwei Wang
OffRL
375
58
0
08 Feb 2021
When Is Generalizable Reinforcement Learning Tractable?
When Is Generalizable Reinforcement Learning Tractable?Neural Information Processing Systems (NeurIPS), 2021
Dhruv Malik
Yuanzhi Li
Pradeep Ravikumar
OffRL
557
27
0
01 Jan 2021
Multitask Bandit Learning Through Heterogeneous Feedback Aggregation
Multitask Bandit Learning Through Heterogeneous Feedback AggregationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Zhi Wang
Chicheng Zhang
Manish Singh
L. Riek
Kamalika Chaudhuri
456
26
0
29 Oct 2020
Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in
  Multi-Agent RL and Inventory Control
Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control
Weichao Mao
Jianchao Tan
Ruihao Zhu
D. Simchi-Levi
Tamer Bacsar
364
17
0
07 Oct 2020
Learning Robust State Abstractions for Hidden-Parameter Block MDPs
Learning Robust State Abstractions for Hidden-Parameter Block MDPs
Amy Zhang
Shagun Sodhani
Khimya Khetarpal
Joelle Pineau
377
5
0
14 Jul 2020
Sequential Transfer in Reinforcement Learning with a Generative Model
Sequential Transfer in Reinforcement Learning with a Generative Model
Andrea Tirinzoni
Riccardo Poiani
Marcello Restelli
202
26
0
01 Jul 2020
12
Next
Page 1 of 2