ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1203.0203
  4. Cited By
Fast Reinforcement Learning with Large Action Sets using
  Error-Correcting Output Codes for MDP Factorization

Fast Reinforcement Learning with Large Action Sets using Error-Correcting Output Codes for MDP Factorization

29 February 2012
Gabriel Dulac-Arnold
Ludovic Denoyer
Philippe Preux
Patrick Gallinari
    OffRL
ArXivPDFHTML

Papers citing "Fast Reinforcement Learning with Large Action Sets using Error-Correcting Output Codes for MDP Factorization"

15 / 15 papers shown
Title
On-line Policy Improvement using Monte-Carlo Search
On-line Policy Improvement using Monte-Carlo Search
Gerald Tesauro
Gregory R. Galperin
92
270
0
09 Jan 2025
Stochastic Q-learning for Large Discrete Action Spaces
Stochastic Q-learning for Large Discrete Action Spaces
Fares Fourati
Vaneet Aggarwal
Mohamed-Slim Alouini
OffRL
39
2
0
16 May 2024
Dynamic Neighborhood Construction for Structured Large Discrete Action
  Spaces
Dynamic Neighborhood Construction for Structured Large Discrete Action Spaces
F. Akkerman
Julius Luy
W. V. Heeswijk
Maximilian Schiffer
34
1
0
31 May 2023
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect
  Modeling
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Yuta Saito
Qingyang Ren
Thorsten Joachims
CML
OffRL
19
22
0
14 May 2023
Off-Policy Evaluation for Large Action Spaces via Embeddings
Off-Policy Evaluation for Large Action Spaces via Embeddings
Yuta Saito
Thorsten Joachims
OffRL
27
43
0
13 Feb 2022
Action Redundancy in Reinforcement Learning
Action Redundancy in Reinforcement Learning
Nir Baram
Guy Tennenholtz
Shie Mannor
24
7
0
22 Feb 2021
Training a Resilient Q-Network against Observational Interference
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
26
14
0
18 Feb 2021
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from
  forbidden action
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action
Mathieu Seurin
Philippe Preux
Olivier Pietquin
16
12
0
04 Oct 2019
Neural Architecture Search in Embedding Space
Neural Architecture Search in Embedding Space
Chunmiao Liu
19
0
0
09 Sep 2019
Toybox: A Suite of Environments for Experimental Evaluation of Deep
  Reinforcement Learning
Toybox: A Suite of Environments for Experimental Evaluation of Deep Reinforcement Learning
Emma Tosch
Kaleigh Clary
John Foley
David D. Jensen
OffRL
17
9
0
07 May 2019
Approximate Dynamic Programming with Neural Networks in Linear Discrete
  Action Spaces
Approximate Dynamic Programming with Neural Networks in Linear Discrete Action Spaces
W. V. Heeswijk
H. L. Poutré
18
9
0
26 Feb 2019
Learn What Not to Learn: Action Elimination with Deep Reinforcement
  Learning
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Tom Zahavy
Matan Haroush
Nadav Merlis
D. Mankowitz
Shie Mannor
18
184
0
06 Sep 2018
Reinforcement Learning with Function-Valued Action Spaces for Partial
  Differential Equation Control
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control
Yangchen Pan
Amir-massoud Farahmand
Martha White
S. Nabi
P. Grover
D. Nikovski
38
18
0
13 Jun 2018
Deep Reinforcement Learning in Large Discrete Action Spaces
Deep Reinforcement Learning in Large Discrete Action Spaces
Gabriel Dulac-Arnold
Richard Evans
H. V. Hasselt
P. Sunehag
Timothy Lillicrap
Jonathan J. Hunt
Timothy A. Mann
T. Weber
T. Degris
Ben Coppin
OffRL
9
565
0
24 Dec 2015
Rollout Sampling Approximate Policy Iteration
Rollout Sampling Approximate Policy Iteration
Christos Dimitrakakis
M. Lagoudakis
OffRL
69
74
0
14 May 2008
1