Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.14171
Cited By
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
25 June 2020
Shengyi Huang
Santiago Ontañón
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Closer Look at Invalid Action Masking in Policy Gradient Algorithms"
32 / 82 papers shown
Title
Automatic Design Method of Building Pipeline Layout Based on Deep Reinforcement Learning
Chen Yang
Zhe Zheng
Jiali Lin
AI4CE
11
1
0
18 May 2023
Discovery of Optimal Quantum Error Correcting Codes via Reinforcement Learning
V. P. Su
ChunJun Cao
Hong-Ye Hu
Y. Yanay
C. Tahan
Brian Swingle
15
16
0
10 May 2023
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation
Guoliang He
Sean Parker
Eiko Yoneki
19
2
0
28 Apr 2023
Centralized control for multi-agent RL in a complex Real-Time-Strategy game
Roger Creus Castanyer
21
2
0
25 Apr 2023
Learning policies for resource allocation in business processes
J. Middelhuis
R. Bianco
E. Scherzer
Z. A. Bukhsh
I. Adan
R. Dijkman
9
6
0
19 Apr 2023
Frontier Semantic Exploration for Visual Target Navigation
Bangguo Yu
H. Kasaei
M. Cao
19
12
0
11 Apr 2023
Multi-Agent Reinforcement Learning with Action Masking for UAV-enabled Mobile Communications
D. Rizvi
David P. Boyle
14
4
0
29 Mar 2023
Optimization of Topology-Aware Job Allocation on a High-Performance Computing Cluster by Neural Simulated Annealing
Zekang Lan
Yan Xu
Ying-Min Huang
Dianxun Huang
Sheng-zhong Feng
11
1
0
06 Feb 2023
Task Placement and Resource Allocation for Edge Machine Learning: A GNN-based Multi-Agent Reinforcement Learning Paradigm
Yihong Li
Xiaoxi Zhang
Tian Zeng
Jingpu Duan
Chuanxi Wu
Di Wu
Xu Chen
8
15
0
01 Feb 2023
Learning to Generate All Feasible Actions
Mirco Theile
Daniele Bernardini
Raphael Trumpp
C. Piazza
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
27
2
0
26 Jan 2023
Learning to solve arithmetic problems with a virtual abacus
Flavio Petruzzellis
Ling-Hao Chen
Alberto Testolin
34
1
0
17 Jan 2023
Transformers as Policies for Variable Action Environments
Niklas Zwingenberger
17
2
0
09 Jan 2023
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
10
17
0
22 Dec 2022
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning
Leo Ardon
Alberto Pozanco
Daniel Borrajo
Sumitra Ganesh
OffRL
13
0
0
28 Nov 2022
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
M. Wolk
A. Applebaum
Camron Dennler
P. Dwyer
M. Moskowitz
...
N. Nichols
Nicole Park
Paul Rachwalski
Frank Rau
A. Webster
OffRL
AAML
17
16
0
28 Nov 2022
Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search
Ayush Raina
Jonathan Cagan
Christopher McComb
AI4CE
23
9
0
28 Nov 2022
Provably Safe Reinforcement Learning via Action Projection using Reachability Analysis and Polynomial Zonotopes
Niklas Kochdumper
Hanna Krasowski
Xiao Wang
Stanley Bak
Matthias Althoff
14
28
0
19 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
31
239
0
03 Oct 2022
Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement Learning
Y. Nasir
L. Durlofsky
15
3
0
21 Jul 2022
Safe and Psychologically Pleasant Traffic Signal Control with Reinforcement Learning using Action Masking
Arthur Muller
M. Sabatelli
11
8
0
21 Jun 2022
Reinforcement Learning Approach for Mapping Applications to Dataflow-Based Coarse-Grained Reconfigurable Array
Andre Xian Ming Chang
Parth Khopkar
Bashar Romanous
Abhishek Chaurasia
Patrick Estep
Skyler Windh
Douglas J Vanesko
Sheik Dawood Beer Mohideen
Eugenio Culurciello
57
5
0
26 May 2022
Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking
Hanna Krasowski
Jakob Thumm
Marlon Müller
Lukas Schäfer
Xiao Wang
Matthias Althoff
80
19
0
13 May 2022
A Reinforcement Learning Approach to Domain-Knowledge Inclusion Using Grammar Guided Symbolic Regression
Laure Crochepierre
Lydia Boudjeloud
Vincent Barbesant
6
4
0
09 Feb 2022
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms
Shengyi Huang
Rousslan Fernand Julien Dossa
Chang Ye
Jeff Braga
OffRL
11
0
0
16 Nov 2021
Edge Rewiring Goes Neural: Boosting Network Resilience without Rich Features
Shanchao Yang
Kaili Ma
Baoxiang Wang
Tianshu Yu
H. Zha
AAML
25
0
0
18 Oct 2021
Gym-
μ
μ
μ
RTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning
Sheng-Jun Huang
Santiago Ontañón
Chris Bamford
Lukasz Grela
OffRL
6
35
0
21 May 2021
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization
Jing-Cheng Pang
Tian Xu
Shengyi Jiang
Yu-Ren Liu
Yang Yu
10
0
0
18 May 2021
Generalising Discrete Action Spaces with Conditional Action Trees
Christopher Bamford
Alvaro Ovalle
9
8
0
15 Apr 2021
A Reinforcement Learning Environment For Job-Shop Scheduling
Pierre Tassel
M. Gebser
Konstantin Schekotihin
OffRL
14
48
0
08 Apr 2021
Understanding Continual Learning Settings with Data Distribution Drift Analysis
Timothée Lesort
Massimo Caccia
Irina Rish
25
55
0
04 Apr 2021
Deep Reinforcement Learning for Constrained Field Development Optimization in Subsurface Two-phase Flow
Y. Nasir
Jincong He
Chaoshun Hu
Shusei Tanaka
Kainan Wang
X. Wen
AI4CE
9
18
0
31 Mar 2021
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games
Shengyi Huang
Santiago Ontañón
6
10
0
05 Oct 2020
Previous
1
2