ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.00980
  4. Cited By
Action Space Shaping in Deep Reinforcement Learning
v1v2 (latest)

Action Space Shaping in Deep Reinforcement Learning

2 April 2020
Anssi Kanervisto
Christian Scheller
Ville Hautamaki
ArXiv (abs)PDFHTML

Papers citing "Action Space Shaping in Deep Reinforcement Learning"

39 / 39 papers shown
Underactuated Biomimetic Autonomous Underwater Vehicle for Ecosystem Monitoring
Underactuated Biomimetic Autonomous Underwater Vehicle for Ecosystem Monitoring
Kaustubh Singh
Shivam Kumar
Shashikant Pawar
Sandeep Manjanna
92
0
0
09 Nov 2025
Strategic Coordination for Evolving Multi-agent Systems: A Hierarchical Reinforcement and Collective Learning Approach
Strategic Coordination for Evolving Multi-agent Systems: A Hierarchical Reinforcement and Collective Learning Approach
Chuhao Qin
Evangelos Pournaras
136
2
0
22 Sep 2025
Reinforcement learning for graph theory, Parallelizing Wagner's approach
Reinforcement learning for graph theory, Parallelizing Wagner's approach
Alix Bouffard
Jane Breen
174
0
0
01 Sep 2025
Adapting Vision-Language Models for Evaluating World Models
Adapting Vision-Language Models for Evaluating World Models
Mariya Hendriksen
Tabish Rashid
David Bignell
Raluca Georgescu
Abdelhak Lemkhenter
Katja Hofmann
Sam Devlin
Sarah Parisot
257
1
0
22 Jun 2025
Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
Hao Peng
Xiang Huang
Shuo Sun
Ruitong Zhang
Philip S. Yu
254
0
0
07 May 2025
A General Approach of Automated Environment Design for Learning the Optimal Power Flow
A General Approach of Automated Environment Design for Learning the Optimal Power FlowEnergy-Efficient Computing and Networking (ECN), 2025
Thomas Wolgast
Astrid Nieße
AI4CE
259
0
0
01 May 2025
The Crucial Role of Problem Formulation in Real-World Reinforcement Learning
The Crucial Role of Problem Formulation in Real-World Reinforcement LearningIndustrial Cyber-Physical Systems (ICPS), 2025
Georg Schafer
Tatjana Krau
Jakob Rehrl
Stefan Huber
Simon Hirlaender
OffRL
308
2
0
26 Mar 2025
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term RewardsAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
Fangqi Liu
Rishav Sen
J. P. Talusan
Ava Pettet
Aaron Kandel
Yoshinori Suzue
Ayan Mukhopadhyay
A. Dubey
OffRL
309
3
0
24 Feb 2025
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation
Minwoo Kim
Geunsik Bae
Jinwoo Lee
Woojae Shin
Changseung Kim
Myong-Yol Choi
Heejung Shin
H. Oh
619
5
0
04 Feb 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
271
0
0
24 Jan 2025
Applying Action Masking and Curriculum Learning Techniques to Improve
  Data Efficiency and Overall Performance in Operational Technology Cyber
  Security using Reinforcement Learning
Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning
Alec Wilson
William Holmes
Ryan Menzies
Kez Smithson Whitehead
229
0
0
13 Sep 2024
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement
  Learning
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement LearningIFAC-PapersOnLine (IFAC-PapersOnLine), 2024
Zihan Wang
N. Mahmoudian
273
4
0
13 Sep 2024
Multi-State-Action Tokenisation in Decision Transformers for
  Multi-Discrete Action Spaces
Multi-State-Action Tokenisation in Decision Transformers for Multi-Discrete Action Spaces
Perusha Moodley
Pramod S. Kaushik
Dhillu Thambi
Mark Trovinger
Praveen Paruchuri
Xia Hong
Benjamin Rosman
387
0
0
01 Jul 2024
Safety through Permissibility: Shield Construction for Fast and Safe
  Reinforcement Learning
Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning
A. Politowicz
Sahisnu Mazumder
Bing-Quan Liu
267
1
0
29 May 2024
Embedding-Aligned Language Models
Embedding-Aligned Language Models
Guy Tennenholtz
Yinlam Chow
Chih-Wei Hsu
Lior Shani
Ethan Liang
Craig Boutilier
AIFin
389
6
0
24 May 2024
Learning the Optimal Power Flow: Environment Design Matters
Learning the Optimal Power Flow: Environment Design Matters
Thomas Wolgast
Astrid Nieße
AI4CE
217
15
0
26 Mar 2024
Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A
  Reinforcement Learning Approach
Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A Reinforcement Learning Approach
Alaa Selim
Yanzhu Ye
Junbo Zhao
Bo Yang
134
0
0
24 Feb 2024
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for
  Training and Benchmarking Agents that Solve Fuzzy Tasks
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy TasksNeural Information Processing Systems (NeurIPS), 2023
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
Rohin Shah
378
7
0
05 Dec 2023
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Gregory Palmer
Chris Parry
Daniel J.B. Harrold
Chris Willis
AI4CE
316
1
0
11 Oct 2023
Learning to Recharge: UAV Coverage Path Planning through Deep
  Reinforcement Learning
Learning to Recharge: UAV Coverage Path Planning through Deep Reinforcement Learning
Mirco Theile
Harald Bayerlein
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
247
11
0
06 Sep 2023
LoopTune: Optimizing Tensor Computations with Reinforcement Learning
LoopTune: Optimizing Tensor Computations with Reinforcement Learning
Dejan Grubisic
Bram Wasti
Chris Cummins
John Mellor-Crummey
A. Zlateski
337
3
0
04 Sep 2023
Context-Aware Composition of Agent Policies by Markov Decision Process
  Entity Embeddings and Agent Ensembles
Context-Aware Composition of Agent Policies by Markov Decision Process Entity Embeddings and Agent Ensembles
Nicole Merkle
Ralf Mikut
228
4
0
28 Aug 2023
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep
  Recommendation Models
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation ModelsACM Conference on Recommender Systems (RecSys), 2023
Kabir Nagrecha
Lingyi Liu
P. Delgado
Prasanna Padmanabhan
OffRLAI4CE
294
8
0
13 Aug 2023
Navigates Like Me: Understanding How People Evaluate Human-Like AI in
  Video Games
Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video GamesInternational Conference on Human Factors in Computing Systems (CHI), 2023
Stephanie Milani
Arthur Juliani
Ida Momennejad
Raluca Georgescu
Jaroslaw Rzepecki
Alison Shaw
Gavin Costello
Fei Fang
Sam Devlin
Katja Hofmann
274
19
0
02 Mar 2023
Automating DBSCAN via Deep Reinforcement Learning
Automating DBSCAN via Deep Reinforcement LearningInternational Conference on Information and Knowledge Management (CIKM), 2022
Ruitong Zhang
Hao Peng
Yingtong Dou
Hongzhi Zhang
Qingyun Sun
Jingyi Zhang
Philip S. Yu
OffRL
143
29
0
09 Aug 2022
Towards Modern Card Games with Large-Scale Action Spaces Through Action
  Representation
Towards Modern Card Games with Large-Scale Action Spaces Through Action Representation
Zhiyuan Yao
Tianyu Shi
Site Li
Yiting Xie
Yu Qin
Xiongjie Xie
Huijuan Lu
Yan Zhang
135
2
0
25 Jun 2022
Safe and Psychologically Pleasant Traffic Signal Control with
  Reinforcement Learning using Action Masking
Safe and Psychologically Pleasant Traffic Signal Control with Reinforcement Learning using Action Masking
Arthur Muller
M. Sabatelli
222
12
0
21 Jun 2022
CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data Collection
CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data CollectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Quanfu Fan
Yilai Li
Yuguang Yao
J. M. Cohn
Sijia Liu
S. Vos
M. Cianfrocco
OffRL
155
8
0
15 Apr 2022
Distributional Reinforcement Learning for Scheduling of Chemical
  Production Processes
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes
M. Mowbray
Dongda Zhang
Ehecatl Antonio del Rio Chanona
OffRL
297
7
0
01 Mar 2022
Automated Reinforcement Learning: An Overview
Automated Reinforcement Learning: An Overview
Reza Refaei Afshar
Yingqian Zhang
Joaquin Vanschoren
U. Kaymak
Yaoxin Wu
Wen Song
Yingqian Zhang
OffRL
484
18
0
13 Jan 2022
MimicBot: Combining Imitation and Reinforcement Learning to win in Bot
  Bowl
MimicBot: Combining Imitation and Reinforcement Learning to win in Bot Bowl
Nicola Pezzotti
183
1
0
21 Aug 2021
Distilling Reinforcement Learning Tricks for Video Games
Distilling Reinforcement Learning Tricks for Video Games
Anssi Kanervisto
Christian Scheller
Yanick Schraner
Ville Hautamaki
OffRLVLM
253
5
0
01 Jul 2021
Interactive Explanations: Diagnosis and Repair of Reinforcement Learning
  Based Agent Behaviors
Interactive Explanations: Diagnosis and Repair of Reinforcement Learning Based Agent Behaviors
Christian Arzate Cruz
Takeo Igarashi
291
7
0
27 May 2021
Gym-$μ$RTS: Toward Affordable Full Game Real-time Strategy Games
  Research with Deep Reinforcement Learning
Gym-μμμRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning
Sheng-Jun Huang
Santiago Ontañón
Chris Bamford
Lukasz Grela
OffRL
379
42
0
21 May 2021
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural
  Networks
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks
Hao Peng
Ruitong Zhang
Yingtong Dou
Renyu Yang
Jingyi Zhang
Philip S. Yu
547
141
0
16 Apr 2021
Generalising Discrete Action Spaces with Conditional Action Trees
Generalising Discrete Action Spaces with Conditional Action Trees
Christopher Bamford
Alvaro Ovalle
202
7
0
15 Apr 2021
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards
  for Real-time Strategy Games
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games
Shengyi Huang
Santiago Ontañón
248
11
0
05 Oct 2020
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
A Closer Look at Invalid Action Masking in Policy Gradient AlgorithmsThe Florida AI Research Society (FLAIRS), 2020
Shengyi Huang
Santiago Ontañón
509
472
0
25 Jun 2020
MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning
MDP Playground: An Analysis and Debug Testbed for Reinforcement LearningJournal of Artificial Intelligence Research (JAIR), 2019
Raghunandan Rajan
Jessica Lizeth Borja Diaz
Suresh Guttikonda
Fabio Ferreira
André Biedenkapp
Jan Ole von Hartz
Katharina Eggensperger
525
7
0
17 Sep 2019
1
Page 1 of 1