ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.00824
  4. Cited By
Exploration in Deep Reinforcement Learning: A Survey

Exploration in Deep Reinforcement Learning: A Survey

2 May 2022
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
    OffRL
ArXivPDFHTML

Papers citing "Exploration in Deep Reinforcement Learning: A Survey"

50 / 85 papers shown
Title
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
24
0
0
19 Apr 2025
A Graph-Based Reinforcement Learning Approach with Frontier Potential Based Reward for Safe Cluttered Environment Exploration
A Graph-Based Reinforcement Learning Approach with Frontier Potential Based Reward for Safe Cluttered Environment Exploration
Gabriele Calzolari
Vidya Sumathy
Christoforos Kanellakis
G. Nikolakopoulos
124
0
0
16 Apr 2025
Writing as a testbed for open ended agents
Writing as a testbed for open ended agents
Sian Gooding
Lucia Lopez-Rivilla
Edward Grefenstette
LLMAG
78
1
0
25 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
39
0
0
23 Mar 2025
Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference
Wenjie Qiu
Yi-Chen Li
Xuqin Zhang
Tianyi Zhang
Y. Zhang
Zongzhang Zhang
Yang Yu
ALM
46
0
0
01 Mar 2025
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
Pusen Dong
Tianchen Zhu
Yue Qiu
Haoyi Zhou
Jianxin Li
78
1
0
24 Feb 2025
UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning
Oubo Ma
L. Du
Yang Dai
Chunyi Zhou
Qingming Li
Yuwen Pu
Shouling Ji
41
0
0
28 Jan 2025
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Shresth Verma
Niclas Boehmer
Lingkai Kong
Milind Tambe
69
2
0
17 Jan 2025
Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning
Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning
Animesh Singh Basnet
M. C. Ghanem
Dipo Dunsin
Wiktor Sowinski-Mydlarz
AAML
37
0
0
08 Jan 2025
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors
Niels Justesen
Maria Kaselimi
Sam Snodgrass
Miruna Vozaru
Matthew Schlegel
...
Albert Wang
Christoffer Holmgård
Georgios N. Yannakakis
S. Risi
Julian Togelius
45
0
0
03 Jan 2025
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
Yan Gu
Zhaoze Liu
Shuhong Dai
Cong Liu
Ying Wang
Shen Wang
Georgios Theodoropoulos
Long Cheng
33
0
0
03 Jan 2025
From Hype to Reality: The Road Ahead of Deploying DRL in 6G Networks
From Hype to Reality: The Road Ahead of Deploying DRL in 6G Networks
Haiyuan Li
Hari Madhukumar
Peizheng Li
Yiran Teng
Shuangyi Yan
Dimitra Simeonidou
OffRL
AI4CE
25
0
0
30 Oct 2024
Utilizing Large Language Models for Event Deconstruction to Enhance
  Multimodal Aspect-Based Sentiment Analysis
Utilizing Large Language Models for Event Deconstruction to Enhance Multimodal Aspect-Based Sentiment Analysis
Xiaoyong Huang
Heli Sun
Qunshu Gao
Wenjie Huang
Ruichen Cao
19
0
0
18 Oct 2024
Novelty-based Sample Reuse for Continuous Robotics Control
Novelty-based Sample Reuse for Continuous Robotics Control
Ke Duan
Kai Yang
Houde Liu
Xueqian Wang
35
0
0
17 Oct 2024
Urban Computing for Climate and Environmental Justice: Early
  Perspectives From Two Research Initiatives
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
37
0
0
06 Oct 2024
Multi-agent Reinforcement Learning for Dynamic Dispatching in Material
  Handling Systems
Multi-agent Reinforcement Learning for Dynamic Dispatching in Material Handling Systems
Xian Yeow Lee
Haiyan Wang
Daisuke Katsumata
Takaharu Matsui
Chetan Gupta
29
1
0
27 Sep 2024
A Survey for Deep Reinforcement Learning Based Network Intrusion
  Detection
A Survey for Deep Reinforcement Learning Based Network Intrusion Detection
Wanrong Yang
Alberto Acuto
Yihang Zhou
Dominik Wojtczak
OffRL
36
2
0
25 Sep 2024
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
Bo Yue
Jian Li
Guiliang Liu
29
2
0
24 Sep 2024
An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems
An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems
Peng Liu
Jiawei Zhu
Cong Xu
Ming Zhao
Bin Wang
31
1
0
18 Sep 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and
  Practical Applications
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
35
1
0
13 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
29
4
0
06 Aug 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
40
1
0
26 Jul 2024
Advanced deep-reinforcement-learning methods for flow control:
  group-invariant and positional-encoding networks improve learning speed and
  quality
Advanced deep-reinforcement-learning methods for flow control: group-invariant and positional-encoding networks improve learning speed and quality
Joogoo Jeon
Jean Rabault
Joel Vasanth
Francisco Alcántara-Ávila
Shilaj Baral
Ricardo Vinuesa
AI4CE
37
5
0
25 Jul 2024
Exploration in Knowledge Transfer Utilizing Reinforcement Learning
Exploration in Knowledge Transfer Utilizing Reinforcement Learning
Adam Jedlicka
Tatiana Valentine Guy
23
0
0
15 Jul 2024
Improving Sample Efficiency of Reinforcement Learning with Background
  Knowledge from Large Language Models
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
Fuxiang Zhang
Junyou Li
Yi-Chen Li
Zongzhang Zhang
Yang Yu
Deheng Ye
OffRL
KELM
47
1
0
04 Jul 2024
Model-Free Active Exploration in Reinforcement Learning
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutière
OffRL
16
2
0
30 Jun 2024
External Model Motivated Agents: Reinforcement Learning for Enhanced
  Environment Sampling
External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling
Rishav Bhagat
Jonathan C. Balloch
Zhiyu Lin
Julia Kim
Mark O. Riedl
41
0
0
28 Jun 2024
Beyond Optimism: Exploration With Partially Observable Rewards
Beyond Optimism: Exploration With Partially Observable Rewards
Simone Parisi
Alireza Kazemipour
Michael H. Bowling
OffRL
32
1
0
20 Jun 2024
Toward Enhanced Reinforcement Learning-Based Resource Management via
  Digital Twin: Opportunities, Applications, and Challenges
Toward Enhanced Reinforcement Learning-Based Resource Management via Digital Twin: Opportunities, Applications, and Challenges
Nan Cheng
Xiucheng Wang
Zan Li
Zhisheng Yin
Tom H. Luan
Xuemin Shen
23
14
0
12 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
34
1
0
11 Jun 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Jiayang Song
Zhan Shu
Lei Ma
37
1
0
06 Jun 2024
Models That Prove Their Own Correctness
Models That Prove Their Own Correctness
Noga Amit
S. Goldwasser
Orr Paradise
G. Rothblum
LRM
36
2
0
24 May 2024
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Cong Lu
Shengran Hu
Jeff Clune
LLMAG
39
9
0
24 May 2024
A social path to human-like artificial intelligence
A social path to human-like artificial intelligence
Edgar A. Duénez-Guzmán
Suzanne Sadedin
Jane X. Wang
Kevin R. McKee
Joel Z. Leibo
GNN
26
28
0
22 May 2024
Reinforcement Learning for Adaptive MCMC
Reinforcement Learning for Adaptive MCMC
Congye Wang
Wilson Chen
Heishiro Kanagawa
Chris J. Oates
BDL
21
2
0
22 May 2024
Adaptive Exploration for Data-Efficient General Value Function
  Evaluations
Adaptive Exploration for Data-Efficient General Value Function Evaluations
Arushi Jain
Josiah P. Hanna
Doina Precup
26
1
0
13 May 2024
Hindsight PRIORs for Reward Learning from Human Preferences
Hindsight PRIORs for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
40
5
0
12 Apr 2024
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
Guangchen Lan
Dong-Jun Han
Abolfazl Hashemi
Vaneet Aggarwal
Christopher G. Brinton
122
15
0
09 Apr 2024
Is Exploration All You Need? Effective Exploration Characteristics for
  Transfer in Reinforcement Learning
Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
Jonathan C. Balloch
Rishav Bhagat
Geigh Zollicoffer
Ruoran Jia
Julia Kim
Mark O. Riedl
OffRL
26
1
0
02 Apr 2024
Subequivariant Reinforcement Learning Framework for Coordinated Motion
  Control
Subequivariant Reinforcement Learning Framework for Coordinated Motion Control
Haoyu Wang
Xiaoyu Tan
Xihe Qiu
Chao Qu
33
2
0
22 Mar 2024
Unveiling the Significance of Toddler-Inspired Reward Transition in
  Goal-Oriented Reinforcement Learning
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Junseok Park
Yoonsung Kim
Hee Bin Yoo
Min Whoo Lee
Kibeom Kim
Won-Seok Choi
Minsu Lee
Byoung-Tak Zhang
OffRL
32
1
0
11 Mar 2024
Deep Reinforcement Learning for Modelling Protein Complexes
Deep Reinforcement Learning for Modelling Protein Complexes
Ziqi Gao
Tao Feng
Jiaxuan You
Chenyi Zi
Yan Zhou
Chen Zhang
Jia Li
41
5
0
11 Mar 2024
Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich
  Reasoning
Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
Hanqi Yan
Qinglin Zhu
Xinyu Wang
Lin Gui
Yulan He
LRM
LLMAG
24
4
0
22 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy
  Regularization
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Fuchun Sun
Huazhe Xu
CML
40
11
0
22 Feb 2024
Optimal Parallelization Strategies for Active Flow Control in Deep
  Reinforcement Learning-Based Computational Fluid Dynamics
Optimal Parallelization Strategies for Active Flow Control in Deep Reinforcement Learning-Based Computational Fluid Dynamics
Wang Jia
Hang Xu
AI4CE
33
4
0
18 Feb 2024
Monitored Markov Decision Processes
Monitored Markov Decision Processes
Simone Parisi
Montaser Mohammedalamen
Alireza Kazemipour
Matthew E. Taylor
Michael H. Bowling
OffRL
28
3
0
09 Feb 2024
Training Large Language Models for Reasoning through Reverse Curriculum
  Reinforcement Learning
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Zhiheng Xi
Wenxiang Chen
Boyang Hong
Senjie Jin
Rui Zheng
...
Xinbo Zhang
Peng Sun
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
32
20
0
08 Feb 2024
StepCoder: Improve Code Generation with Reinforcement Learning from
  Compiler Feedback
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
Shihan Dou
Yan Liu
Haoxiang Jia
Limao Xiong
Enyu Zhou
...
Tao Ji
Rui Zheng
Qi Zhang
Xuanjing Huang
Tao Gui
LLMAG
57
28
0
02 Feb 2024
Colored Noise in PPO: Improved Exploration and Performance through
  Correlated Action Sampling
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob J. Hollenstein
Georg Martius
J. Piater
14
3
0
18 Dec 2023
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the
  Generative Artificial Intelligence (AI) Research Landscape
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape
Timothy R. McIntosh
Teo Susnjak
Tong Liu
Paul Watters
Malka N. Halgamuge
81
46
0
18 Dec 2023
12
Next