ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.11424
  4. Cited By
Regret Minimization for Partially Observable Deep Reinforcement Learning

Regret Minimization for Partially Observable Deep Reinforcement Learning

31 October 2017
Peter H. Jin
Kurt Keutzer
Sergey Levine
ArXivPDFHTML

Papers citing "Regret Minimization for Partially Observable Deep Reinforcement Learning"

16 / 16 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
62
8
0
02 Aug 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and
  Continuous Motion Planning in Dynamic Environments
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
32
1
0
17 Mar 2024
Reinforcement Learning of Display Transfer Robots in Glass Flow Control
  Systems: A Physical Simulation-Based Approach
Reinforcement Learning of Display Transfer Robots in Glass Flow Control Systems: A Physical Simulation-Based Approach
Hwajong Lee
Chan Kim
Seong-Woo Kim
21
0
0
12 Oct 2023
ReMIX: Regret Minimization for Monotonic Value Function Factorization in
  Multiagent Reinforcement Learning
ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Yongsheng Mei
Hanhan Zhou
Tian-Shing Lan
40
11
0
11 Feb 2023
Unified Policy Optimization for Continuous-action Reinforcement Learning
  in Non-stationary Tasks and Games
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Rongjun Qin
Fan Luo
Hong Qian
Yang Yu
30
2
0
19 Aug 2022
Let's Collaborate: Regret-based Reactive Synthesis for Robotic
  Manipulation
Let's Collaborate: Regret-based Reactive Synthesis for Robotic Manipulation
Karan Muvvala
Peter Amorese
Morteza Lahijanian
32
12
0
14 Mar 2022
Dual Behavior Regularized Reinforcement Learning
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
23
1
0
19 Sep 2021
Accelerating the Learning of TAMER with Counterfactual Explanations
Accelerating the Learning of TAMER with Counterfactual Explanations
Jakob Karalus
F. Lindner
OffRL
29
4
0
03 Aug 2021
Optimize Neural Fictitious Self-Play in Regret Minimization Thinking
Optimize Neural Fictitious Self-Play in Regret Minimization Thinking
Yuxuan Chen
Li Zhang
Shijian Li
Gang Pan
21
2
0
22 Apr 2021
Adversarial jamming attacks and defense strategies via adaptive deep
  reinforcement learning
Adversarial jamming attacks and defense strategies via adaptive deep reinforcement learning
Feng Wang
Chen Zhong
M. C. Gursoy
Senem Velipasalar
AAML
23
8
0
12 Jul 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free
  learning
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
36
53
0
18 Jun 2020
Algorithms in Multi-Agent Systems: A Holistic Perspective from
  Reinforcement Learning and Game Theory
Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory
Yunlong Lu
Kai Yan
AI4CE
15
13
0
17 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Actor-Critic Policy Optimization in Partially Observable Multiagent
  Environments
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
13
148
0
21 Oct 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
16
8
0
10 Sep 2018
Online Convex Optimization for Sequential Decision Processes and
  Extensive-Form Games
Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games
Gabriele Farina
Christian Kroer
T. Sandholm
24
59
0
10 Sep 2018
1