ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.14171
  4. Cited By
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

25 June 2020
Shengyi Huang
Santiago Ontañón
ArXivPDFHTML

Papers citing "A Closer Look at Invalid Action Masking in Policy Gradient Algorithms"

32 / 82 papers shown
Title
Automatic Design Method of Building Pipeline Layout Based on Deep
  Reinforcement Learning
Automatic Design Method of Building Pipeline Layout Based on Deep Reinforcement Learning
Chen Yang
Zhe Zheng
Jiali Lin
AI4CE
11
1
0
18 May 2023
Discovery of Optimal Quantum Error Correcting Codes via Reinforcement
  Learning
Discovery of Optimal Quantum Error Correcting Codes via Reinforcement Learning
V. P. Su
ChunJun Cao
Hong-Ye Hu
Y. Yanay
C. Tahan
Brian Swingle
15
16
0
10 May 2023
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs
  Transformation
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation
Guoliang He
Sean Parker
Eiko Yoneki
19
2
0
28 Apr 2023
Centralized control for multi-agent RL in a complex Real-Time-Strategy
  game
Centralized control for multi-agent RL in a complex Real-Time-Strategy game
Roger Creus Castanyer
21
2
0
25 Apr 2023
Learning policies for resource allocation in business processes
Learning policies for resource allocation in business processes
J. Middelhuis
R. Bianco
E. Scherzer
Z. A. Bukhsh
I. Adan
R. Dijkman
9
6
0
19 Apr 2023
Frontier Semantic Exploration for Visual Target Navigation
Frontier Semantic Exploration for Visual Target Navigation
Bangguo Yu
H. Kasaei
M. Cao
19
12
0
11 Apr 2023
Multi-Agent Reinforcement Learning with Action Masking for UAV-enabled
  Mobile Communications
Multi-Agent Reinforcement Learning with Action Masking for UAV-enabled Mobile Communications
D. Rizvi
David P. Boyle
14
4
0
29 Mar 2023
Optimization of Topology-Aware Job Allocation on a High-Performance
  Computing Cluster by Neural Simulated Annealing
Optimization of Topology-Aware Job Allocation on a High-Performance Computing Cluster by Neural Simulated Annealing
Zekang Lan
Yan Xu
Ying-Min Huang
Dianxun Huang
Sheng-zhong Feng
11
1
0
06 Feb 2023
Task Placement and Resource Allocation for Edge Machine Learning: A
  GNN-based Multi-Agent Reinforcement Learning Paradigm
Task Placement and Resource Allocation for Edge Machine Learning: A GNN-based Multi-Agent Reinforcement Learning Paradigm
Yihong Li
Xiaoxi Zhang
Tian Zeng
Jingpu Duan
Chuanxi Wu
Di Wu
Xu Chen
8
15
0
01 Feb 2023
Learning to Generate All Feasible Actions
Learning to Generate All Feasible Actions
Mirco Theile
Daniele Bernardini
Raphael Trumpp
C. Piazza
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
27
2
0
26 Jan 2023
Learning to solve arithmetic problems with a virtual abacus
Learning to solve arithmetic problems with a virtual abacus
Flavio Petruzzellis
Ling-Hao Chen
Alberto Testolin
34
1
0
17 Jan 2023
Transformers as Policies for Variable Action Environments
Transformers as Policies for Variable Action Environments
Niklas Zwingenberger
17
2
0
09 Jan 2023
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with
  Robotic and Human Co-Workers
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
10
17
0
22 Dec 2022
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement
  Learning
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning
Leo Ardon
Alberto Pozanco
Daniel Borrajo
Sumitra Ganesh
OffRL
13
0
0
28 Nov 2022
Beyond CAGE: Investigating Generalization of Learned Autonomous Network
  Defense Policies
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
M. Wolk
A. Applebaum
Camron Dennler
P. Dwyer
M. Moskowitz
...
N. Nichols
Nicole Park
Paul Rachwalski
Frank Rau
A. Webster
OffRL
AAML
17
16
0
28 Nov 2022
Learning to design without prior data: Discovering generalizable design
  strategies using deep learning and tree search
Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search
Ayush Raina
Jonathan Cagan
Christopher McComb
AI4CE
23
9
0
28 Nov 2022
Provably Safe Reinforcement Learning via Action Projection using
  Reachability Analysis and Polynomial Zonotopes
Provably Safe Reinforcement Learning via Action Projection using Reachability Analysis and Polynomial Zonotopes
Niklas Kochdumper
Hanna Krasowski
Xiao Wang
Stanley Bak
Matthias Althoff
14
28
0
19 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing:
  Benchmarks, Baselines, and Building Blocks for Natural Language Policy
  Optimization
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
31
239
0
03 Oct 2022
Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement
  Learning
Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement Learning
Y. Nasir
L. Durlofsky
15
3
0
21 Jul 2022
Safe and Psychologically Pleasant Traffic Signal Control with
  Reinforcement Learning using Action Masking
Safe and Psychologically Pleasant Traffic Signal Control with Reinforcement Learning using Action Masking
Arthur Muller
M. Sabatelli
11
8
0
21 Jun 2022
Reinforcement Learning Approach for Mapping Applications to
  Dataflow-Based Coarse-Grained Reconfigurable Array
Reinforcement Learning Approach for Mapping Applications to Dataflow-Based Coarse-Grained Reconfigurable Array
Andre Xian Ming Chang
Parth Khopkar
Bashar Romanous
Abhishek Chaurasia
Patrick Estep
Skyler Windh
Douglas J Vanesko
Sheik Dawood Beer Mohideen
Eugenio Culurciello
57
5
0
26 May 2022
Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and
  Benchmarking
Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking
Hanna Krasowski
Jakob Thumm
Marlon Müller
Lukas Schäfer
Xiao Wang
Matthias Althoff
80
19
0
13 May 2022
A Reinforcement Learning Approach to Domain-Knowledge Inclusion Using
  Grammar Guided Symbolic Regression
A Reinforcement Learning Approach to Domain-Knowledge Inclusion Using Grammar Guided Symbolic Regression
Laure Crochepierre
Lydia Boudjeloud
Vincent Barbesant
6
4
0
09 Feb 2022
CleanRL: High-quality Single-file Implementations of Deep Reinforcement
  Learning Algorithms
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms
Shengyi Huang
Rousslan Fernand Julien Dossa
Chang Ye
Jeff Braga
OffRL
11
0
0
16 Nov 2021
Edge Rewiring Goes Neural: Boosting Network Resilience without Rich
  Features
Edge Rewiring Goes Neural: Boosting Network Resilience without Rich Features
Shanchao Yang
Kaili Ma
Baoxiang Wang
Tianshu Yu
H. Zha
AAML
25
0
0
18 Oct 2021
Gym-$μ$RTS: Toward Affordable Full Game Real-time Strategy Games
  Research with Deep Reinforcement Learning
Gym-μμμRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning
Sheng-Jun Huang
Santiago Ontañón
Chris Bamford
Lukasz Grela
OffRL
6
35
0
21 May 2021
Reinforcement Learning With Sparse-Executing Actions via Sparsity
  Regularization
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization
Jing-Cheng Pang
Tian Xu
Shengyi Jiang
Yu-Ren Liu
Yang Yu
10
0
0
18 May 2021
Generalising Discrete Action Spaces with Conditional Action Trees
Generalising Discrete Action Spaces with Conditional Action Trees
Christopher Bamford
Alvaro Ovalle
9
8
0
15 Apr 2021
A Reinforcement Learning Environment For Job-Shop Scheduling
A Reinforcement Learning Environment For Job-Shop Scheduling
Pierre Tassel
M. Gebser
Konstantin Schekotihin
OffRL
14
48
0
08 Apr 2021
Understanding Continual Learning Settings with Data Distribution Drift
  Analysis
Understanding Continual Learning Settings with Data Distribution Drift Analysis
Timothée Lesort
Massimo Caccia
Irina Rish
25
55
0
04 Apr 2021
Deep Reinforcement Learning for Constrained Field Development
  Optimization in Subsurface Two-phase Flow
Deep Reinforcement Learning for Constrained Field Development Optimization in Subsurface Two-phase Flow
Y. Nasir
Jincong He
Chaoshun Hu
Shusei Tanaka
Kainan Wang
X. Wen
AI4CE
9
18
0
31 Mar 2021
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards
  for Real-time Strategy Games
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games
Shengyi Huang
Santiago Ontañón
6
10
0
05 Oct 2020
Previous
12