ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.14171
  4. Cited By
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

25 June 2020
Shengyi Huang
Santiago Ontañón
ArXivPDFHTML

Papers citing "A Closer Look at Invalid Action Masking in Policy Gradient Algorithms"

50 / 82 papers shown
Title
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
Bofei Liu
Dong Ye
Zunhao Yao
Zhaowei Sun
28
0
0
04 May 2025
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
Jonas Myhre Schiøtt
Viktor Sebastian Petersen
Dimitrios P. Papadopoulos
VLM
35
0
0
16 Apr 2025
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research
Mirko Stappert
Bernhard Lutz
Niklas Goby
Dirk Neumann
OffRL
31
0
0
03 Apr 2025
Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control
Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control
Eloy Anguiano Batanero
Ángela Fernández
Álvaro Barbero
72
0
0
26 Mar 2025
Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree
Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree
Mahsa Khosravi
Zhanhong Jiang
Joshua R. Waite
Sarah Jonesc
Hernan Torres
Arti Singh
Baskar Ganapathysubramanian
Asheesh Kumar Singh
S. Sarkar
36
0
0
23 Mar 2025
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Minori Narita
Ryo Kuroiwa
J. Christopher Beck
42
0
0
20 Mar 2025
Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment
Han Zheng
J. Zhang
Mingyang Jiang
Peiyuan Liu
Danni Liu
Tong Qin
Ming Yang
139
0
0
05 Mar 2025
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Fangqi Liu
Rishav Sen
J. P. Talusan
Ava Pettet
Aaron Kandel
Yoshinori Suzue
Ayan Mukhopadhyay
A. Dubey
OffRL
36
0
0
24 Feb 2025
Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning
Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning
Austin Yubo He
Zi-Wen Liu
97
3
0
21 Feb 2025
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?
Michael Doherty
Robin Matzner
Rasoul Sadeghi
Polina Bayvel
Alejandra Beghelli
65
0
0
18 Feb 2025
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Shilong Zhang
Wenbo Li
Shoufa Chen
Chongjian Ge
Peize Sun
Y. Zhang
Yi-Xin Jiang
Zehuan Yuan
Binyue Peng
Ping Luo
DiffM
VGen
99
0
0
07 Feb 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
48
0
0
24 Jan 2025
Integrating Transit Signal Priority into Multi-Agent Reinforcement
  Learning based Traffic Signal Control
Integrating Transit Signal Priority into Multi-Agent Reinforcement Learning based Traffic Signal Control
Dickness Kwesiga
Suyash Chandra Vishnoi
Angshuman Guin
Michael Hunter
66
0
0
28 Nov 2024
Effective Analog ICs Floorplanning with Relational Graph Neural Networks
  and Reinforcement Learning
Effective Analog ICs Floorplanning with Relational Graph Neural Networks and Reinforcement Learning
Davide Basso
Luca Bortolussi
Mirjana Videnovic-Misic
Husni M. Habal
60
1
0
20 Nov 2024
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh
  Smoothing
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing
Zhichao Wang
Xinhai Chen
Chunye Gong
Bo Yang
Liang Deng
Yufei Sun
Yufei Pang
Jie Liu
AI4CE
30
0
0
19 Oct 2024
Multi-Agent Actor-Critics in Autonomous Cyber Defense
Multi-Agent Actor-Critics in Autonomous Cyber Defense
Mingjun Wang
Remington Dechene
26
0
0
11 Oct 2024
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots
Milad Farjadnasab
Shahin Sirouspour
33
0
0
08 Oct 2024
Climate Adaptation with Reinforcement Learning: Experiments with
  Flooding and Transportation in Copenhagen
Climate Adaptation with Reinforcement Learning: Experiments with Flooding and Transportation in Copenhagen
Miguel Costa
Morten W. Petersen
Arthur Vandervoort
Martin Drews
Karyn Morrissey
Francisco C. Pereira
AI4CE
22
0
0
27 Sep 2024
Revisiting Space Mission Planning: A Reinforcement Learning-Guided
  Approach for Multi-Debris Rendezvous
Revisiting Space Mission Planning: A Reinforcement Learning-Guided Approach for Multi-Debris Rendezvous
Agni Bandyopadhyay
Guenther Waxenegger-Wilfing
19
0
0
25 Sep 2024
Applying Action Masking and Curriculum Learning Techniques to Improve
  Data Efficiency and Overall Performance in Operational Technology Cyber
  Security using Reinforcement Learning
Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning
Alec Wilson
William Holmes
Ryan Menzies
Kez Smithson Whitehead
23
0
0
13 Sep 2024
Cooperative Path Planning with Asynchronous Multiagent Reinforcement
  Learning
Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning
Jiaming Yin
Weixiong Rao
Yu Xiao
Keshuang Tang
16
0
0
01 Sep 2024
DECAF: a Discrete-Event based Collaborative Human-Robot Framework for
  Furniture Assembly
DECAF: a Discrete-Event based Collaborative Human-Robot Framework for Furniture Assembly
Giulio Giacomuzzo
Matteo Terreran
Siddarth Jain
Diego Romeres
21
1
0
28 Aug 2024
Earth Observation Satellite Scheduling with Graph Neural Networks
Earth Observation Satellite Scheduling with Graph Neural Networks
Antoine Jacquet
Guillaume Infantes
Nicolas Meuleau
Emmanuel Benazera
Stéphanie Roussel
Vincent Baudoui
Jonathan Guerra
20
0
0
27 Aug 2024
Scenario-based Thermal Management Parametrization Through Deep
  Reinforcement Learning
Scenario-based Thermal Management Parametrization Through Deep Reinforcement Learning
Thomas Rudolf
Philip Muhl
Sören Hohmann
Lutz Eckstein
26
0
0
04 Aug 2024
Field Deployment of Multi-Agent Reinforcement Learning Based Variable
  Speed Limit Controllers
Field Deployment of Multi-Agent Reinforcement Learning Based Variable Speed Limit Controllers
Yuhang Zhang
Zhiyao Zhang
Marcos Quiñones-Grueiro
William Barbour
Clay Weston
Gautam Biswas
Daniel Work
17
4
0
10 Jul 2024
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha
  Factors
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors
Hao Shi
Weili Song
Xinting Zhang
Jiahe Shi
Cuicui Luo
Xiang Ao
Hamid Arian
Luis Seco
21
1
0
26 Jun 2024
Injecting Combinatorial Optimization into MCTS: Application to the Board
  Game boop
Injecting Combinatorial Optimization into MCTS: Application to the Board Game boop
Florian Richoux
14
2
0
13 Jun 2024
Excluding the Irrelevant: Focusing Reinforcement Learning through
  Continuous Action Masking
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking
Roland Stolz
Hanna Krasowski
Jakob Thumm
Michael Eichelbeck
Philipp Gassert
Matthias Althoff
CLL
19
2
0
06 Jun 2024
HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios
HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios
Mingyang Jiang
Yueyuan Li
Songan Zhang
Siyuan Chen
Chunxiang Wang
Ming Yang
45
4
0
31 May 2024
Safety through Permissibility: Shield Construction for Fast and Safe
  Reinforcement Learning
Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning
A. Politowicz
Sahisnu Mazumder
Bing-Quan Liu
16
0
0
29 May 2024
Egret: Reinforcement Mechanism for Sequential Computation Offloading in
  Edge Computing
Egret: Reinforcement Mechanism for Sequential Computation Offloading in Edge Computing
Haosong Peng
Yufeng Zhan
Dihua Zhai
Xiaopu Zhang
Yuanqing Xia
28
1
0
14 Apr 2024
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
Shang Wang
Deepak Ranganatha Sastry Mamillapalli
Tianpei Yang
Matthew E. Taylor
36
0
0
11 Apr 2024
Deep Reinforcement Learning-Based Approach for a Single Vehicle
  Persistent Surveillance Problem with Fuel Constraints
Deep Reinforcement Learning-Based Approach for a Single Vehicle Persistent Surveillance Problem with Fuel Constraints
Manav Mishra
Hritik Bana
Saswata Sarkar
Sujeevraja Sanjeevi
PB Sujit
K. Sundar
21
0
0
09 Apr 2024
Intervention-Assisted Policy Gradient Methods for Online Stochastic
  Queuing Network Optimization: Technical Report
Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report
Jerrod Wigmore
B. Shrader
E. Modiano
OffRL
16
1
0
05 Apr 2024
Solving a Real-World Optimization Problem Using Proximal Policy
  Optimization with Curriculum Learning and Reward Engineering
Solving a Real-World Optimization Problem Using Proximal Policy Optimization with Curriculum Learning and Reward Engineering
Abhijeet Pendyala
Asma Atamna
Tobias Glasmachers
OffRL
19
1
0
03 Apr 2024
Scaling Team Coordination on Graphs with Reinforcement Learning
Scaling Team Coordination on Graphs with Reinforcement Learning
Manshi Limbu
Zechen Hu
Xuan Wang
Daigo Shishika
Xuesu Xiao
26
4
0
09 Mar 2024
Learning to Solve Job Shop Scheduling under Uncertainty
Learning to Solve Job Shop Scheduling under Uncertainty
Guillaume Infantes
Stéphanie Roussel
Pierre Pereira
Antoine Jacquet
Emmanuel Benazera
25
3
0
04 Mar 2024
Circuit Partitioning for Multi-Core Quantum Architectures with Deep
  Reinforcement Learning
Circuit Partitioning for Multi-Core Quantum Architectures with Deep Reinforcement Learning
Arnau Pastor
Pau Escofet
Sahar Ben Rached
Eduard Alarcón
Pere Barlet-Ros
S. Abadal
GNN
34
4
0
31 Jan 2024
Introducing PetriRL: An Innovative Framework for JSSP Resolution
  Integrating Petri nets and Event-based Reinforcement Learning
Introducing PetriRL: An Innovative Framework for JSSP Resolution Integrating Petri nets and Event-based Reinforcement Learning
Sofiene Lassoued
Andreas Schwung
OffRL
8
5
0
23 Jan 2024
Generative Modelling of Stochastic Actions with Arbitrary Constraints in
  Reinforcement Learning
Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning
Changyu Chen
Ramesha Karunasena
Thanh Hong Nguyen
Arunesh Sinha
Pradeep Varakantham
21
8
0
26 Nov 2023
MARVEL: Multi-Agent Reinforcement-Learning for Large-Scale Variable
  Speed Limits
MARVEL: Multi-Agent Reinforcement-Learning for Large-Scale Variable Speed Limits
Yuhang Zhang
Marcos Quiñones-Grueiro
Zhiyao Zhang
Yanbing Wang
William Barbour
Gautam Biswas
Dan Work
30
5
0
18 Oct 2023
Learning to Recharge: UAV Coverage Path Planning through Deep
  Reinforcement Learning
Learning to Recharge: UAV Coverage Path Planning through Deep Reinforcement Learning
Mirco Theile
Harald Bayerlein
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
20
5
0
06 Sep 2023
The Impact of Overall Optimization on Warehouse Automation
The Impact of Overall Optimization on Warehouse Automation
H. Yoshitake
Pieter Abbeel
OffRL
23
1
0
11 Aug 2023
Reinforcement Learning -based Adaptation and Scheduling Methods for
  Multi-source DASH
Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASH
Nghia T. Nguyen
Long Luu
Phuong Vo
Sang Nguyen
Cuong T. Do
Ngoc-Thanh Nguyen
AI4TS
11
1
0
25 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement
  Learning
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
42
2
0
21 Jul 2023
Learning Hierarchical Interactive Multi-Object Search for Mobile
  Manipulation
Learning Hierarchical Interactive Multi-Object Search for Mobile Manipulation
F. Schmalstieg
Daniel Honerkamp
Tim Welschehold
Abhinav Valada
16
14
0
12 Jul 2023
A Framework for dynamically meeting performance objectives on a service
  mesh
A Framework for dynamically meeting performance objectives on a service mesh
Forough Shahab Samani
Rolf Stadler
14
3
0
25 Jun 2023
Generating Synergistic Formulaic Alpha Collections via Reinforcement
  Learning
Generating Synergistic Formulaic Alpha Collections via Reinforcement Learning
Shuo Yu
Hongyan Xue
Xiang Ao
Feiyang Pan
Jia He
Dandan Tu
Qing He
AIFin
22
10
0
25 May 2023
MARC: A multi-agent robots control framework for enhancing reinforcement
  learning in construction tasks
MARC: A multi-agent robots control framework for enhancing reinforcement learning in construction tasks
Kangkang Duan
C. W. Suen
Zhengbo Zou
18
1
0
23 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
8
0
0
23 May 2023
12
Next