Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1701.07274
Cited By
Deep Reinforcement Learning: An Overview
25 January 2017
Yuxi Li
OffRL
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning: An Overview"
50 / 418 papers shown
Title
Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition
Zeki Doruk Erden
Donia Gasmi
Boi Faltings
CLL
21
0
0
13 May 2025
Adaptive Bias Generalized Rollout Policy Adaptation on the Flexible Job-Shop Scheduling Problem
Lotfi Kobrosly
Marc-Emmanuel Coupvent des Graviers
Christophe Guettier
Tristan Cazenave
18
0
0
13 May 2025
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
Chris Child
Lam Ngo
44
0
0
29 Apr 2025
ARMOR: Adaptive Meshing with Reinforcement Optimization for Real-time 3D Monitoring in Unexposed Scenes
Yizhe Zhang
Jianping Li
Xin Zhao
Fuxun Liang
Z. Dong
Bisheng Yang
AI4CE
24
0
0
28 Apr 2025
Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks
Tran Thuy Nga Truong
Jooyong Kim
19
0
0
24 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
22
0
0
19 Apr 2025
Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems
Xian Chen
R. Qu
Jing Dong
Ruibin Bai
Yaochu Jin
OffRL
22
0
0
10 Apr 2025
Anomaly Detection in Time Series Data Using Reinforcement Learning, Variational Autoencoder, and Active Learning
Bahareh Golchin
Banafsheh Rekabdar
AI4TS
25
0
0
03 Apr 2025
Minimum Description Length of a Spectrum Variational Autoencoder: A Theory
Canlin Zhang
Xiuwen Liu
38
0
0
01 Apr 2025
Reinforcement Learning for Active Matter
Wenjie Cai
Gongyi Wang
Yu Zhang
X. Qu
Zihan Huang
AI4CE
35
0
0
30 Mar 2025
Iterative Prompt Relocation for Distribution-Adaptive Visual Prompt Tuning
Chikai Shang
Mengke Li
Yiqun Zhang
Zhen Chen
Jinlin Wu
Fangqing Gu
Yang Lu
Yiu-ming Cheung
VLM
66
0
0
10 Mar 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
57
0
0
27 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
81
1
0
13 Feb 2025
A transformer-based deep q learning approach for dynamic load balancing in software-defined networks
Evans Tetteh Owusu
Kwame Agyemang-Prempeh Agyekum
Marinah Benneh
Pius Ayorna
Justice Owusu Agyemang
George Nii Martey Colley
James Dzisi Gazde
33
0
0
28 Jan 2025
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Chen Wang
Kaiyi Ji
Junyi Geng
Zhongqiang Ren
Taimeng Fu
...
Yi Du
Qihang Li
Y. Yang
Xiao Lin
Zhipeng Zhao
SSL
71
9
0
28 Jan 2025
Multi-Modality Collaborative Learning for Sentiment Analysis
Shanmin Wang
Chengguang Liu
Qingshan Liu
35
0
0
21 Jan 2025
Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance
Raúl Arranz
David Carramiñana
Gonzalo de Miguel
Juan A. Besada
Ana M. Bernardos
31
10
0
15 Jan 2025
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
Yan Gu
Zhaoze Liu
Shuhong Dai
Cong Liu
Ying Wang
Shen Wang
Georgios Theodoropoulos
Long Cheng
28
0
0
03 Jan 2025
Amortized Bayesian Experimental Design for Decision-Making
Daolang Huang
Yujia Guo
Luigi Acerbi
Samuel Kaski
44
2
0
03 Jan 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
40
16
0
03 Jan 2025
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
59
0
0
28 Nov 2024
From Laws to Motivation: Guiding Exploration through Law-Based Reasoning and Rewards
Ziyu Chen
Zhiqing Xiao
Xinbei Jiang
Junbo Zhao
75
0
0
24 Nov 2024
TrojanRobot: Physical-World Backdoor Attacks Against VLM-based Robotic Manipulation
X. U. Wang
Hewen Pan
Hangtao Zhang
Minghui Li
Shengshan Hu
...
Peijin Guo
Yichen Wang
Wei Wan
Aishan Liu
L. Zhang
AAML
80
2
0
18 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
20
2
0
08 Nov 2024
Opportunities of Reinforcement Learning in South Africa's Just Transition
Claude Formanek
C. Tilbury
Jonathan P. Shock
62
0
0
06 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
33
0
0
04 Nov 2024
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques
Saidul Islam
Gaith Rjoub
Hanae Elmekki
Jamal Bentahar
Witold Pedrycz
R. Cohen
21
0
0
03 Nov 2024
α
α
α
-TCVAE: On the relationship between Disentanglement and Diversity
Cristian Meo
Louis Mahon
Anirudh Goyal
Justin Dauwels
DRL
59
8
0
01 Nov 2024
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter
Yansong Li
Zeyu Dong
Ertai Luo
Yu Wu
Shuo Wu
Shuo Han
17
2
0
16 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
18
2
0
15 Oct 2024
Whole-Body Dynamic Throwing with Legged Manipulators
Humphrey Munn
Brendan Tidd
Peter Böhm
M. Gallagher
David Howard
37
1
0
08 Oct 2024
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
32
0
0
06 Oct 2024
Distribution Guided Active Feature Acquisition
Yang Li
Junier Oliva
17
0
0
04 Oct 2024
AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking
Michelle S. Lam
Fred Hohman
Dominik Moritz
Jeffrey P. Bigham
Kenneth Holstein
Mary Beth Kery
23
1
0
26 Sep 2024
A Survey for Deep Reinforcement Learning Based Network Intrusion Detection
Wanrong Yang
Alberto Acuto
Yihang Zhou
Dominik Wojtczak
OffRL
31
2
0
25 Sep 2024
Fair Reinforcement Learning Algorithm for PV Active Control in LV Distribution Networks
Maurizio Vassallo
A. Benzerga
Alireza Bahmanyar
Damien Ernst
23
2
0
09 Sep 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
35
0
0
13 Aug 2024
Faster Model Predictive Control via Self-Supervised Initialization Learning
Zhaoxin Li
Letian Chen
Rohan R. Paleja
S. Nageshrao
Matthew C. Gombolay
Matthew Gombolay
71
1
0
06 Aug 2024
Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation
Xinhan Di
Jiahao Lu
Yunming Liang
Junjie Zheng
Yihua Wang
Chaofan Ding
ALM
31
1
0
01 Aug 2024
Ontology-driven Reinforcement Learning for Personalized Student Support
Ryan Hare
Ying Tang
26
1
0
14 Jul 2024
An Open-source Hardware/Software Architecture and Supporting Simulation Environment to Perform Human FPV Flight Demonstrations for Unmanned Aerial Vehicle Autonomy
Haosong Xiao
Prajit KrisshnaKumar
Jagadeswara P K V Pothuri
Puru Soni
Eric Butcher
Souma Chowdhury
25
0
0
08 Jul 2024
Pseudo-Labeling by Multi-Policy Viewfinder Network for Image Cropping
Zhiyu Pan
Kewei Wang
Yizheng Wu
Liwen Xiao
Jiahao Cui
Zhicheng Wang
Zhiguo Cao
21
0
0
02 Jul 2024
Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
Jianuo Huang
OffRL
22
0
0
30 Jun 2024
Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing Reinforcement Learning Programs
Shiyu Zhang
Haoyang Song
Qixin Wang
Yu Pei
29
0
0
28 Jun 2024
LiCS: Navigation using Learned-imitation on Cluttered Space
J. J. Damanik
Jae-Won Jung
Chala Adane Deresa
Han-Lim Choi
37
4
0
21 Jun 2024
Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-Commerce
Yuan Wang
Zhiyu Li
Changshuo Zhang
Sirui Chen
Xiao Zhang
Jun Xu
Quan Lin
25
1
0
20 Jun 2024
Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots
Bahador Beigomi
Zheng H. Zhu
16
0
0
10 Jun 2024
Online Policy Distillation with Decision-Attention
Xinqiang Yu
Chuanguang Yang
Chengqing Yu
Libo Huang
Zhulin An
Yongjun Xu
OffRL
44
0
0
08 Jun 2024
Prototypical Reward Network for Data-Efficient RLHF
Jinghan Zhang
Xiting Wang
Yiqiao Jin
Changyu Chen
Xinhao Zhang
Kunpeng Liu
ALM
26
18
0
06 Jun 2024
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Philip Anastassiou
Jiawei Chen
J. Chen
Yuanzhe Chen
Zhuo Chen
...
Wenjie Zhang
Y. Zhang
Zilin Zhao
Dejian Zhong
Xiaobin Zhuang
44
74
0
04 Jun 2024
1
2
3
4
5
6
7
8
9
Next