ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.06668
  4. Cited By
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain

14 September 2021
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
    OffRL
ArXivPDFHTML

Papers citing "Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain"

39 / 39 papers shown
Title
Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks
Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks
Xinyu Wang
Jinbo Bi
Minghu Song
CLL
54
0
0
01 May 2025
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Mohamad Abdul Hady
Siyi Hu
Mahardhika Pratama
Jimmy Cao
Ryszard Kowalczyk
14
0
0
29 Apr 2025
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles
Sorin Grigorescu
Mihai V. Zaha
AI4CE
31
0
0
02 Apr 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Z. Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
I. Kolmanovsky
Dimitar Filev
44
0
0
31 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
87
2
0
24 Mar 2025
CPIG: Leveraging Consistency Policy with Intention Guidance for
  Multi-agent Exploration
CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration
Y. Fu
Yuanheng Zhu
Haoran Li
Zijie Zhao
Jiajun Chai
Dongbin Zhao
32
0
0
06 Nov 2024
Multi-Agent Deep Q-Network with Layer-based Communication Channel for
  Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Multi-Agent Deep Q-Network with Layer-based Communication Channel for Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Mohammad Feizabadi
Arman Hosseini
Zakaria Yahouni
26
0
0
01 Nov 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling
  IoT Applications in Edge and Cloud Computing Environments
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
18
0
0
18 Oct 2024
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement
  Learning and Application in UAV Hovering
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering
Qihan Qi
Xinsong Yang
Gang Xia
Daniel W. C. Ho
Pengyang Tang
18
0
0
09 Oct 2024
GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation
GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation
Yangtao Chen
Zixuan Chen
Junhui Yin
Jing Huo
Pinzhuo Tian
Jieqi Shi
Yang Gao
LM&Ro
35
2
0
30 Sep 2024
Multi-agent Reinforcement Learning for Dynamic Dispatching in Material
  Handling Systems
Multi-agent Reinforcement Learning for Dynamic Dispatching in Material Handling Systems
Xian Yeow Lee
Haiyan Wang
Daisuke Katsumata
Takaharu Matsui
Chetan Gupta
17
0
0
27 Sep 2024
The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
Renye Yan
Yaozhong Gan
You Wu
Ling Liang
Junliang Xing
Yimao Cai
Ru Huang
22
1
0
19 Aug 2024
VL-TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments
VL-TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments
Daeun Song
Jing Liang
Xuesu Xiao
Dinesh Manocha
44
4
0
05 Aug 2024
Discretizing Continuous Action Space with Unimodal Probability
  Distributions for On-Policy Reinforcement Learning
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
Yuanyang Zhu
Zhi Wang
Yuanheng Zhu
Chunlin Chen
Dongbin Zhao
11
0
0
01 Aug 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
25
0
0
25 May 2024
Provably Efficient Information-Directed Sampling Algorithms for
  Multi-Agent Reinforcement Learning
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
Qiaosheng Zhang
Chenjia Bai
Shuyue Hu
Zhen Wang
Xuelong Li
19
1
0
30 Apr 2024
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
Shang Wang
Deepak Ranganatha Sastry Mamillapalli
Tianpei Yang
Matthew E. Taylor
26
0
0
11 Apr 2024
Emergent Braitenberg-style Behaviours for Navigating the ViZDoom `My Way
  Home' Labyrinth
Emergent Braitenberg-style Behaviours for Navigating the ViZDoom `My Way Home' Labyrinth
Caleidgh Bayer
Robert J. Smith
M. Heywood
18
0
0
09 Apr 2024
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor
  Re-planning
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor Re-planning
Gilhyun Ryou
Geoffrey Wang
S. Karaman
32
3
0
13 Mar 2024
StepCoder: Improve Code Generation with Reinforcement Learning from
  Compiler Feedback
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
Shihan Dou
Yan Liu
Haoxiang Jia
Limao Xiong
Enyu Zhou
...
Tao Ji
Rui Zheng
Qi Zhang
Xuanjing Huang
Tao Gui
LLMAG
54
9
0
02 Feb 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A
  Comprehensive Survey on Hybrid Algorithms
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
21
9
0
22 Jan 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in
  Noisy Environments
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
8
6
0
19 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
13
4
0
15 Dec 2023
Small batch deep reinforcement learning
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
23
14
0
05 Oct 2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Jinyi Liu
Y. Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
31
2
0
27 Jun 2023
Large Sequence Models for Sequential Decision-Making: A Survey
Large Sequence Models for Sequential Decision-Making: A Survey
Muning Wen
Runji Lin
Hanjing Wang
Yaodong Yang
Ying Wen
Luo Mai
J. Wang
Haifeng Zhang
Weinan Zhang
LM&Ro
LRM
16
35
0
24 Jun 2023
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi-An Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
OnRL
10
12
0
12 Jun 2023
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement
  Learning
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Wenhao Li
Dan Qiao
Baoxiang Wang
Xiangfeng Wang
Bo Jin
H. Zha
14
5
0
18 May 2023
Behavior Contrastive Learning for Unsupervised Skill Discovery
Behavior Contrastive Learning for Unsupervised Skill Discovery
Rushuai Yang
Chenjia Bai
Hongyi Guo
Siyuan Li
Bin Zhao
Zhen Wang
Peng Liu
Xuelong Li
SSL
9
16
0
08 May 2023
SVDE: Scalable Value-Decomposition Exploration for Cooperative
  Multi-Agent Reinforcement Learning
SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Qiang-qiang Wang
Jia-jia Zhang
Jing Xiao
X. Wang
6
0
0
16 Mar 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep
  Reinforcement Learning
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
9
0
0
15 Mar 2023
Progress and summary of reinforcement learning on energy management of
  MPS-EV
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
13
10
0
08 Nov 2022
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with
  Multi-choice Dynamics Model
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Jinyi Liu
Yingfeng Chen
Changjie Fan
53
12
0
02 Oct 2022
A Policy Resonance Approach to Solve the Problem of Responsibility
  Diffusion in Multiagent Reinforcement Learning
A Policy Resonance Approach to Solve the Problem of Responsibility Diffusion in Multiagent Reinforcement Learning
Qing Fu
Tenghai Qiu
Jianqiang Yi
Zhiqiang Pu
Xiaolin Ai
Wanmai Yuan
14
0
0
16 Aug 2022
Image Augmentation Based Momentum Memory Intrinsic Reward for Sparse
  Reward Visual Scenes
Image Augmentation Based Momentum Memory Intrinsic Reward for Sparse Reward Visual Scenes
Zheng Fang
Biao Zhao
Guizhong Liu
11
2
0
19 May 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei-ping Xu
Haichao Zhang
OffRL
50
31
0
28 Jan 2022
Curious Explorer: a provable exploration strategy in Policy Learning
Curious Explorer: a provable exploration strategy in Policy Learning
M. Miani
Maurizio Parton
M. Romito
24
0
0
29 Jun 2021
MAVEN: Multi-Agent Variational Exploration
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan
Tabish Rashid
Mikayel Samvelyan
Shimon Whiteson
DRL
126
350
0
16 Oct 2019
Dropout as a Bayesian Approximation: Representing Model Uncertainty in
  Deep Learning
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
245
9,042
0
06 Jun 2015
1