ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1701.07274
  4. Cited By
Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning: An Overview

25 January 2017
Yuxi Li
    OffRL
    VLM
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning: An Overview"

50 / 418 papers shown
Title
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A
  Model-Based Reinforcement Learning Approach
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
34
9
0
03 Jun 2024
Diffusion Policies creating a Trust Region for Offline Reinforcement
  Learning
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
Tianyu Chen
Zhendong Wang
Mingyuan Zhou
OffRL
24
4
0
30 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
67
41
0
23 May 2024
Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning
Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning
Sean Vaskov
Wilko Schwarting
Chris Baker
17
1
0
19 May 2024
Python-Based Reinforcement Learning on Simulink Models
Python-Based Reinforcement Learning on Simulink Models
Georg Schafer
Max Schirl
Jakob Rehrl
Stefan Huber
Simon Hirlaender
AI4CE
18
4
0
14 May 2024
PhilHumans: Benchmarking Machine Learning for Personal Health
PhilHumans: Benchmarking Machine Learning for Personal Health
Vadim Liventsev
Vivek Kumar
Allmin Pradhap Singh Susaiyah
Zixiu "Alex" Wu
Ivan Rodin
...
Milan Petkovic
Diego Reforgiato Recupero
Ehud Reiter
Daniele Riboni
Raymond Sterling
AI4MH
LM&MA
34
0
0
04 May 2024
Research and application of artificial intelligence based webshell
  detection model: A literature review
Research and application of artificial intelligence based webshell detection model: A literature review
Mingrui Ma
Lansheng Han
Chunjie Zhou
71
2
0
28 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
29
5
0
22 Apr 2024
Physics-based reward driven image analysis in microscopy
Physics-based reward driven image analysis in microscopy
Kamyar Barakati
Hui Yuan
Amit Goyal
Sergei V. Kalinin
19
2
0
22 Apr 2024
Cooperative Sentiment Agents for Multimodal Sentiment Analysis
Cooperative Sentiment Agents for Multimodal Sentiment Analysis
Shan Wang
Hui Shuai
Qingshan Liu
Fei Wang
LLMAG
29
1
0
19 Apr 2024
Enhancing Autonomous Vehicle Training with Language Model Integration
  and Critical Scenario Generation
Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation
Hanlin Tian
Kethan Reddy
Yuxiang Feng
Mohammed Quddus
Y. Demiris
Panagiotis Angeloudis
30
10
0
12 Apr 2024
Generative Pre-Trained Transformer for Symbolic Regression Base
  In-Context Reinforcement Learning
Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning
Yanjie Li
Weijun Li
Lina Yu
Min Wu
Jingyi Liu
Wenqiang Li
Meilan Hao
Shu Wei
Yusong Deng
27
2
0
09 Apr 2024
Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Hao Ma
M. Zeilinger
Michael Muehlebach
27
0
0
08 Apr 2024
From Two-Dimensional to Three-Dimensional Environment with Q-Learning:
  Modeling Autonomous Navigation with Reinforcement Learning and no Libraries
From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries
Ergon Cugler de Moraes Silva
OffRL
34
0
0
27 Mar 2024
Parametric PDE Control with Deep Reinforcement Learning and
  Differentiable L0-Sparse Polynomial Policies
Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies
N. Botteghi
Urban Fasel
AI4CE
22
6
0
22 Mar 2024
Levels of AI Agents: from Rules to Large Language Models
Levels of AI Agents: from Rules to Large Language Models
Yu Huang
AI4CE
ELM
LM&Ro
38
2
0
06 Mar 2024
A Survey on Applications of Reinforcement Learning in Spatial Resource
  Allocation
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation
Di Zhang
Moyang Wang
Joseph D Mango
Xiang Li
Xianrui Xu
32
1
0
06 Mar 2024
Reinforcement Learning-Based Approaches for Enhancing Security and
  Resilience in Smart Control: A Survey on Attack and Defense Methods
Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods
Zheyu Zhang
AAML
16
0
0
23 Feb 2024
SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot
  Interaction
SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot Interaction
Jie Xu
Hanbo Zhang
Xinghang Li
Huaping Liu
Xuguang Lan
Tao Kong
LM&Ro
30
3
0
19 Feb 2024
Optimal Parallelization Strategies for Active Flow Control in Deep
  Reinforcement Learning-Based Computational Fluid Dynamics
Optimal Parallelization Strategies for Active Flow Control in Deep Reinforcement Learning-Based Computational Fluid Dynamics
Wang Jia
Hang Xu
AI4CE
18
4
0
18 Feb 2024
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel
  Allocation in Cognitive Interference Networks
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks
Yaniv Cohen
Tomer Gafni
Ronen Greenberg
Kobi Cohen
11
5
0
17 Feb 2024
Agents Need Not Know Their Purpose
Agents Need Not Know Their Purpose
Paulo Garcia
11
0
0
15 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through
  Partially Supervised Reinforcement Learning
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
21
2
0
14 Feb 2024
Steady-State Error Compensation for Reinforcement Learning with
  Quadratic Rewards
Steady-State Error Compensation for Reinforcement Learning with Quadratic Rewards
Liyao Wang
Zishun Zheng
Yuan Lin
8
0
0
14 Feb 2024
ACTER: Diverse and Actionable Counterfactual Sequences for Explaining
  and Diagnosing RL Policies
ACTER: Diverse and Actionable Counterfactual Sequences for Explaining and Diagnosing RL Policies
Jasmina Gajcin
Ivana Dusparic
CML
OffRL
20
2
0
09 Feb 2024
Circuit Partitioning for Multi-Core Quantum Architectures with Deep
  Reinforcement Learning
Circuit Partitioning for Multi-Core Quantum Architectures with Deep Reinforcement Learning
Arnau Pastor
Pau Escofet
Sahar Ben Rached
Eduard Alarcón
Pere Barlet-Ros
S. Abadal
GNN
24
4
0
31 Jan 2024
DittoGym: Learning to Control Soft Shape-Shifting Robots
DittoGym: Learning to Control Soft Shape-Shifting Robots
Suning Huang
Boyuan Chen
Huazhe Xu
Vincent Sitzmann
37
3
0
24 Jan 2024
Machine Learning on Dynamic Graphs: A Survey on Applications
Machine Learning on Dynamic Graphs: A Survey on Applications
Sanaz Hasanzadeh Fard
AI4CE
11
3
0
16 Jan 2024
Learning Crowd Behaviors in Navigation with Attention-based
  Spatial-Temporal Graphs
Learning Crowd Behaviors in Navigation with Attention-based Spatial-Temporal Graphs
Yanying Zhou
Jochen Garcke
GNN
40
3
0
11 Jan 2024
On Safety and Liveness Filtering Using Hamilton-Jacobi Reachability
  Analysis
On Safety and Liveness Filtering Using Hamilton-Jacobi Reachability Analysis
Javier Borquez
Kaustav Chakraborty
Hao Wang
Somil Bansal
6
7
0
23 Dec 2023
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen
Jiaxin Ge
Tianjun Zhang
Jiaming Liu
Shanghang Zhang
VLM
EGVM
27
0
0
23 Dec 2023
Analyzing Generalization in Policy Networks: A Case Study with the
  Double-Integrator System
Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System
Ruining Zhang
H. Han
Maolong Lv
Qisong Yang
Jian Cheng
OffRL
13
2
0
16 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
73
5
0
13 Dec 2023
Evolving Reservoirs for Meta Reinforcement Learning
Evolving Reservoirs for Meta Reinforcement Learning
Corentin Léger
Gautier Hamon
Eleni Nisioti
X. Hinaut
Clément Moulin-Frier
21
1
0
09 Dec 2023
Learning for Semantic Knowledge Base-Guided Online Feature Transmission
  in Dynamic Channels
Learning for Semantic Knowledge Base-Guided Online Feature Transmission in Dynamic Channels
Xiangyu Gao
Yaping Sun
Dongyu Wei
Xiaodong Xu
Hao Chen
Hao Yin
Shuguang Cui
21
2
0
30 Nov 2023
Two-step dynamic obstacle avoidance
Two-step dynamic obstacle avoidance
Fabian Hart
Martin Waltz
Ostap Okhrin
24
3
0
28 Nov 2023
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware
  Direct Preference Optimization
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
Zhiyuan Zhao
Bin Wang
Linke Ouyang
Xiao-wen Dong
Jiaqi Wang
Conghui He
MLLM
VLM
32
105
0
28 Nov 2023
Adinkra Symbol Recognition using Classical Machine Learning and Deep
  Learning
Adinkra Symbol Recognition using Classical Machine Learning and Deep Learning
Michael Adjeisah
K. Asamoah
Martha Asamoah Yeboah
Raji Rafiu King
Godwin Ferguson Achaab
Kingsley Adjei
29
0
0
27 Nov 2023
FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts
FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts
Yichen Gong
Delong Ran
Jinyuan Liu
Conglei Wang
Tianshuo Cong
Anyu Wang
Sisi Duan
Xiaoyun Wang
MLLM
129
117
0
09 Nov 2023
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought
  Generation
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation
Ruomeng Ding
Chaoyun Zhang
Lu Wang
Yong Xu
Ming-Jie Ma
Wei Zhang
Si Qin
Saravan Rajmohan
Qingwei Lin
Dongmei Zhang
LRM
33
59
0
07 Nov 2023
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Gregory Palmer
Chris Parry
Daniel J.B. Harrold
Chris Willis
AI4CE
21
1
0
11 Oct 2023
Algebras of actions in an agent's representations of the world
Algebras of actions in an agent's representations of the world
Alexander Dean
Eduardo Alonso
Esther Mondragón
23
0
0
02 Oct 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence
  Agents
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents
Pengyu Zhao
Zijian Jin
Ning Cheng
LLMAG
30
20
0
23 Sep 2023
Trip Planning for Autonomous Vehicles with Wireless Data Transfer Needs
  Using Reinforcement Learning
Trip Planning for Autonomous Vehicles with Wireless Data Transfer Needs Using Reinforcement Learning
Yousef AlSaqabi
Bhaskar Krishnamachari
12
2
0
21 Sep 2023
Deep Multi-Agent Reinforcement Learning for Decentralized Active
  Hypothesis Testing
Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing
Hadar Szostak
Kobi Cohen
15
3
0
14 Sep 2023
A Review on Robot Manipulation Methods in Human-Robot Interactions
A Review on Robot Manipulation Methods in Human-Robot Interactions
Haoxu Zhang
P. Kebria
Shady M. K. Mohamed
Samson Yu
Saeid Nahavandi
16
0
0
09 Sep 2023
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
Ophir M. Carmel
Guy Katz
15
0
0
06 Sep 2023
Hawkeye: Change-targeted Testing for Android Apps based on Deep
  Reinforcement Learning
Hawkeye: Change-targeted Testing for Android Apps based on Deep Reinforcement Learning
Chao Peng
Zhengwei Lv
Jiarong Fu
Jiayuan Liang
Zhao Zhang
Ajitha Rajan
Ping Yang
11
0
0
04 Sep 2023
AlphaZero Gomoku
AlphaZero Gomoku
Wen-Chieh Liang
Chao Yu
Brian Whiteaker
Inyoung Huh
Hua Shao
Youzhi Liang
10
2
0
04 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAI
OffRL
19
16
0
02 Sep 2023
Previous
123456789
Next