ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1701.07274
  4. Cited By
Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning: An Overview

25 January 2017
Yuxi Li
    OffRL
    VLM
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning: An Overview"

50 / 418 papers shown
Title
Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition
Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition
Zeki Doruk Erden
Donia Gasmi
Boi Faltings
CLL
21
0
0
13 May 2025
Adaptive Bias Generalized Rollout Policy Adaptation on the Flexible Job-Shop Scheduling Problem
Adaptive Bias Generalized Rollout Policy Adaptation on the Flexible Job-Shop Scheduling Problem
Lotfi Kobrosly
Marc-Emmanuel Coupvent des Graviers
Christophe Guettier
Tristan Cazenave
18
0
0
13 May 2025
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
Chris Child
Lam Ngo
44
0
0
29 Apr 2025
ARMOR: Adaptive Meshing with Reinforcement Optimization for Real-time 3D Monitoring in Unexposed Scenes
ARMOR: Adaptive Meshing with Reinforcement Optimization for Real-time 3D Monitoring in Unexposed Scenes
Yizhe Zhang
Jianping Li
Xin Zhao
Fuxun Liang
Z. Dong
Bisheng Yang
AI4CE
24
0
0
28 Apr 2025
Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks
Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks
Tran Thuy Nga Truong
Jooyong Kim
19
0
0
24 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
22
0
0
19 Apr 2025
Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems
Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems
Xian Chen
R. Qu
Jing Dong
Ruibin Bai
Yaochu Jin
OffRL
22
0
0
10 Apr 2025
Anomaly Detection in Time Series Data Using Reinforcement Learning, Variational Autoencoder, and Active Learning
Anomaly Detection in Time Series Data Using Reinforcement Learning, Variational Autoencoder, and Active Learning
Bahareh Golchin
Banafsheh Rekabdar
AI4TS
25
0
0
03 Apr 2025
Minimum Description Length of a Spectrum Variational Autoencoder: A Theory
Minimum Description Length of a Spectrum Variational Autoencoder: A Theory
Canlin Zhang
Xiuwen Liu
38
0
0
01 Apr 2025
Reinforcement Learning for Active Matter
Reinforcement Learning for Active Matter
Wenjie Cai
Gongyi Wang
Yu Zhang
X. Qu
Zihan Huang
AI4CE
35
0
0
30 Mar 2025
Iterative Prompt Relocation for Distribution-Adaptive Visual Prompt Tuning
Chikai Shang
Mengke Li
Yiqun Zhang
Zhen Chen
Jinlin Wu
Fangqing Gu
Yang Lu
Yiu-ming Cheung
VLM
66
0
0
10 Mar 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
57
0
0
27 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
81
1
0
13 Feb 2025
A transformer-based deep q learning approach for dynamic load balancing in software-defined networks
Evans Tetteh Owusu
Kwame Agyemang-Prempeh Agyekum
Marinah Benneh
Pius Ayorna
Justice Owusu Agyemang
George Nii Martey Colley
James Dzisi Gazde
33
0
0
28 Jan 2025
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Chen Wang
Kaiyi Ji
Junyi Geng
Zhongqiang Ren
Taimeng Fu
...
Yi Du
Qihang Li
Y. Yang
Xiao Lin
Zhipeng Zhao
SSL
71
9
0
28 Jan 2025
Multi-Modality Collaborative Learning for Sentiment Analysis
Multi-Modality Collaborative Learning for Sentiment Analysis
Shanmin Wang
Chengguang Liu
Qingshan Liu
35
0
0
21 Jan 2025
Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance
Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance
Raúl Arranz
David Carramiñana
Gonzalo de Miguel
Juan A. Besada
Ana M. Bernardos
31
10
0
15 Jan 2025
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
Yan Gu
Zhaoze Liu
Shuhong Dai
Cong Liu
Ying Wang
Shen Wang
Georgios Theodoropoulos
Long Cheng
28
0
0
03 Jan 2025
Amortized Bayesian Experimental Design for Decision-Making
Amortized Bayesian Experimental Design for Decision-Making
Daolang Huang
Yujia Guo
Luigi Acerbi
Samuel Kaski
44
2
0
03 Jan 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
40
16
0
03 Jan 2025
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
59
0
0
28 Nov 2024
From Laws to Motivation: Guiding Exploration through Law-Based Reasoning
  and Rewards
From Laws to Motivation: Guiding Exploration through Law-Based Reasoning and Rewards
Ziyu Chen
Zhiqing Xiao
Xinbei Jiang
Junbo Zhao
75
0
0
24 Nov 2024
TrojanRobot: Physical-World Backdoor Attacks Against VLM-based Robotic Manipulation
X. U. Wang
Hewen Pan
Hangtao Zhang
Minghui Li
Shengshan Hu
...
Peijin Guo
Yichen Wang
Wei Wan
Aishan Liu
L. Zhang
AAML
80
2
0
18 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
20
2
0
08 Nov 2024
Opportunities of Reinforcement Learning in South Africa's Just
  Transition
Opportunities of Reinforcement Learning in South Africa's Just Transition
Claude Formanek
C. Tilbury
Jonathan P. Shock
62
0
0
06 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental
  Adaptation
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
33
0
0
04 Nov 2024
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced
  Resuscitation Techniques
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques
Saidul Islam
Gaith Rjoub
Hanae Elmekki
Jamal Bentahar
Witold Pedrycz
R. Cohen
21
0
0
03 Nov 2024
$α$-TCVAE: On the relationship between Disentanglement and
  Diversity
ααα-TCVAE: On the relationship between Disentanglement and Diversity
Cristian Meo
Louis Mahon
Anirudh Goyal
Justin Dauwels
DRL
59
8
0
01 Nov 2024
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement
  Learning With Data Filter
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter
Yansong Li
Zeyu Dong
Ertai Luo
Yu Wu
Shuo Wu
Shuo Han
17
2
0
16 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous
  State and Action Space
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
18
2
0
15 Oct 2024
Whole-Body Dynamic Throwing with Legged Manipulators
Whole-Body Dynamic Throwing with Legged Manipulators
Humphrey Munn
Brendan Tidd
Peter Böhm
M. Gallagher
David Howard
37
1
0
08 Oct 2024
Urban Computing for Climate and Environmental Justice: Early
  Perspectives From Two Research Initiatives
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
32
0
0
06 Oct 2024
Distribution Guided Active Feature Acquisition
Distribution Guided Active Feature Acquisition
Yang Li
Junier Oliva
17
0
0
04 Oct 2024
AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking
AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking
Michelle S. Lam
Fred Hohman
Dominik Moritz
Jeffrey P. Bigham
Kenneth Holstein
Mary Beth Kery
23
1
0
26 Sep 2024
A Survey for Deep Reinforcement Learning Based Network Intrusion
  Detection
A Survey for Deep Reinforcement Learning Based Network Intrusion Detection
Wanrong Yang
Alberto Acuto
Yihang Zhou
Dominik Wojtczak
OffRL
31
2
0
25 Sep 2024
Fair Reinforcement Learning Algorithm for PV Active Control in LV
  Distribution Networks
Fair Reinforcement Learning Algorithm for PV Active Control in LV Distribution Networks
Maurizio Vassallo
A. Benzerga
Alireza Bahmanyar
Damien Ernst
23
2
0
09 Sep 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and
  Practical Applications
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
35
0
0
13 Aug 2024
Faster Model Predictive Control via Self-Supervised Initialization Learning
Faster Model Predictive Control via Self-Supervised Initialization Learning
Zhaoxin Li
Letian Chen
Rohan R. Paleja
S. Nageshrao
Matthew C. Gombolay
Matthew Gombolay
71
1
0
06 Aug 2024
Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like
  Spontaneous Representation
Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation
Xinhan Di
Jiahao Lu
Yunming Liang
Junjie Zheng
Yihua Wang
Chaofan Ding
ALM
31
1
0
01 Aug 2024
Ontology-driven Reinforcement Learning for Personalized Student Support
Ontology-driven Reinforcement Learning for Personalized Student Support
Ryan Hare
Ying Tang
26
1
0
14 Jul 2024
An Open-source Hardware/Software Architecture and Supporting Simulation
  Environment to Perform Human FPV Flight Demonstrations for Unmanned Aerial
  Vehicle Autonomy
An Open-source Hardware/Software Architecture and Supporting Simulation Environment to Perform Human FPV Flight Demonstrations for Unmanned Aerial Vehicle Autonomy
Haosong Xiao
Prajit KrisshnaKumar
Jagadeswara P K V Pothuri
Puru Soni
Eric Butcher
Souma Chowdhury
25
0
0
08 Jul 2024
Pseudo-Labeling by Multi-Policy Viewfinder Network for Image Cropping
Pseudo-Labeling by Multi-Policy Viewfinder Network for Image Cropping
Zhiyu Pan
Kewei Wang
Yizheng Wu
Liwen Xiao
Jiahao Cui
Zhicheng Wang
Zhiguo Cao
21
0
0
02 Jul 2024
Diffusion Models for Offline Multi-agent Reinforcement Learning with
  Safety Constraints
Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
Jianuo Huang
OffRL
22
0
0
30 Jun 2024
Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing
  Reinforcement Learning Programs
Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing Reinforcement Learning Programs
Shiyu Zhang
Haoyang Song
Qixin Wang
Yu Pei
29
0
0
28 Jun 2024
LiCS: Navigation using Learned-imitation on Cluttered Space
LiCS: Navigation using Learned-imitation on Cluttered Space
J. J. Damanik
Jae-Won Jung
Chala Adane Deresa
Han-Lim Choi
37
4
0
21 Jun 2024
Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving
  Time in E-Commerce
Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-Commerce
Yuan Wang
Zhiyu Li
Changshuo Zhang
Sirui Chen
Xiao Zhang
Jun Xu
Quan Lin
25
1
0
20 Jun 2024
Towards Real-World Efficiency: Domain Randomization in Reinforcement
  Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots
Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots
Bahador Beigomi
Zheng H. Zhu
16
0
0
10 Jun 2024
Online Policy Distillation with Decision-Attention
Online Policy Distillation with Decision-Attention
Xinqiang Yu
Chuanguang Yang
Chengqing Yu
Libo Huang
Zhulin An
Yongjun Xu
OffRL
44
0
0
08 Jun 2024
Prototypical Reward Network for Data-Efficient RLHF
Prototypical Reward Network for Data-Efficient RLHF
Jinghan Zhang
Xiting Wang
Yiqiao Jin
Changyu Chen
Xinhao Zhang
Kunpeng Liu
ALM
26
18
0
06 Jun 2024
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Philip Anastassiou
Jiawei Chen
J. Chen
Yuanzhe Chen
Zhuo Chen
...
Wenjie Zhang
Y. Zhang
Zilin Zhao
Dejian Zhong
Xiaobin Zhuang
44
74
0
04 Jun 2024
123456789
Next