ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.03743
  4. Cited By
Reinforcement Learning in Reward-Mixing MDPs

Reinforcement Learning in Reward-Mixing MDPs

7 October 2021
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
ArXivPDFHTML

Papers citing "Reinforcement Learning in Reward-Mixing MDPs"

14 / 14 papers shown
Title
A Classification View on Meta Learning Bandits
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
20
0
0
06 Apr 2025
Test-Time Regret Minimization in Meta Reinforcement Learning
Test-Time Regret Minimization in Meta Reinforcement Learning
Mirco Mutti
Aviv Tamar
18
4
0
04 Jun 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy
  Evaluation
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
C. Caramanis
Yonathan Efroni
OffRL
24
2
0
03 Jun 2024
Tractable Optimality in Episodic Latent MABs
Tractable Optimality in Episodic Latent MABs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
37
3
0
05 Oct 2022
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
27
5
0
05 Oct 2022
Learning in Observable POMDPs, without Computationally Intractable
  Oracles
Learning in Observable POMDPs, without Computationally Intractable Oracles
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
8
26
0
07 Jun 2022
Reinforcement Learning with Brain-Inspired Modulation can Improve
  Adaptation to Environmental Changes
Reinforcement Learning with Brain-Inspired Modulation can Improve Adaptation to Environmental Changes
Eric Chalmers
Artur Luczak
14
3
0
19 May 2022
When Is Partially Observable Reinforcement Learning Not Scary?
When Is Partially Observable Reinforcement Learning Not Scary?
Qinghua Liu
Alan Chung
Csaba Szepesvári
Chi Jin
9
92
0
19 Apr 2022
Understanding Curriculum Learning in Policy Optimization for Online
  Combinatorial Optimization
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization
Runlong Zhou
Zelin He
Yuandong Tian
Yi Wu
S. Du
OffRL
10
3
0
11 Feb 2022
Coordinated Attacks against Contextual Bandits: Fundamental Limits and
  Defense Mechanisms
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
AAML
43
6
0
30 Jan 2022
Planning in Observable POMDPs in Quasipolynomial Time
Planning in Observable POMDPs in Quasipolynomial Time
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
14
27
0
12 Jan 2022
Provably Efficient Reinforcement Learning with Linear Function
  Approximation Under Adaptivity Constraints
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
107
166
0
06 Jan 2021
Reward-Free Exploration for Reinforcement Learning
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
104
194
0
07 Feb 2020
Learning mixtures of structured distributions over discrete domains
Learning mixtures of structured distributions over discrete domains
Siu On Chan
Ilias Diakonikolas
Rocco A. Servedio
Xiaorui Sun
59
83
0
02 Oct 2012
1