Deep Reinforcement Learning: An Overview

25 January 2017

Papers citing "Deep Reinforcement Learning: An Overview"

50 / 418 papers shown

Title
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach Yuxuan Chen Rongpeng Li Xiaoxue Yu Zhifeng Zhao Honggang Zhang 34 9 0 03 Jun 2024
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning Tianyu Chen Zhendong Wang Mingyuan Zhou OffRL 24 4 0 30 May 2024
A Survey on Vision-Language-Action Models for Embodied AI Yueen Ma Zixing Song Yuzheng Zhuang Jianye Hao Irwin King LM&Ro 67 41 0 23 May 2024
Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning Sean Vaskov Wilko Schwarting Chris Baker 17 1 0 19 May 2024
Python-Based Reinforcement Learning on Simulink Models Georg Schafer Max Schirl Jakob Rehrl Stefan Huber Simon Hirlaender AI4CE 18 4 0 14 May 2024
PhilHumans: Benchmarking Machine Learning for Personal Health Vadim Liventsev Vivek Kumar Allmin Pradhap Singh Susaiyah Zixiu "Alex" Wu Ivan Rodin ... Milan Petkovic Diego Reforgiato Recupero Ehud Reiter Daniele Riboni Raymond Sterling AI4MH LM&MA 34 0 0 04 May 2024
Research and application of artificial intelligence based webshell detection model: A literature review Mingrui Ma Lansheng Han Chunjie Zhou 71 2 0 28 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories Ning Yang Shuo Chen Haijun Zhang Randall Berry OffRL 29 5 0 22 Apr 2024
Physics-based reward driven image analysis in microscopy Kamyar Barakati Hui Yuan Amit Goyal Sergei V. Kalinin 19 2 0 22 Apr 2024
Cooperative Sentiment Agents for Multimodal Sentiment Analysis Shan Wang Hui Shuai Qingshan Liu Fei Wang LLMAG 29 1 0 19 Apr 2024
Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation Hanlin Tian Kethan Reddy Yuxiang Feng Mohammed Quddus Y. Demiris Panagiotis Angeloudis 30 10 0 12 Apr 2024
Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning Yanjie Li Weijun Li Lina Yu Min Wu Jingyi Liu Wenqiang Li Meilan Hao Shu Wei Yusong Deng 27 2 0 09 Apr 2024
Stochastic Online Optimization for Cyber-Physical and Robotic Systems Hao Ma M. Zeilinger Michael Muehlebach 27 0 0 08 Apr 2024
From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries Ergon Cugler de Moraes Silva OffRL 34 0 0 27 Mar 2024
Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies N. Botteghi Urban Fasel AI4CE 22 6 0 22 Mar 2024
Levels of AI Agents: from Rules to Large Language Models Yu Huang AI4CE ELM LM&Ro 38 2 0 06 Mar 2024
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation Di Zhang Moyang Wang Joseph D Mango Xiang Li Xianrui Xu 32 1 0 06 Mar 2024
Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods Zheyu Zhang AAML 16 0 0 23 Feb 2024
SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot Interaction Jie Xu Hanbo Zhang Xinghang Li Huaping Liu Xuguang Lan Tao Kong LM&Ro 30 3 0 19 Feb 2024
Optimal Parallelization Strategies for Active Flow Control in Deep Reinforcement Learning-Based Computational Fluid Dynamics Wang Jia Hang Xu AI4CE 18 4 0 18 Feb 2024
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks Yaniv Cohen Tomer Gafni Ronen Greenberg Kobi Cohen 11 5 0 17 Feb 2024
Agents Need Not Know Their Purpose Paulo Garcia 11 0 0 15 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning Michael Lanier Ying Xu Nathan Jacobs Chongjie Zhang Yevgeniy Vorobeychik 21 2 0 14 Feb 2024
Steady-State Error Compensation for Reinforcement Learning with Quadratic Rewards Liyao Wang Zishun Zheng Yuan Lin 8 0 0 14 Feb 2024
ACTER: Diverse and Actionable Counterfactual Sequences for Explaining and Diagnosing RL Policies Jasmina Gajcin Ivana Dusparic CML OffRL 20 2 0 09 Feb 2024
Circuit Partitioning for Multi-Core Quantum Architectures with Deep Reinforcement Learning Arnau Pastor Pau Escofet Sahar Ben Rached Eduard Alarcón Pere Barlet-Ros S. Abadal GNN 24 4 0 31 Jan 2024
DittoGym: Learning to Control Soft Shape-Shifting Robots Suning Huang Boyuan Chen Huazhe Xu Vincent Sitzmann 37 3 0 24 Jan 2024
Machine Learning on Dynamic Graphs: A Survey on Applications Sanaz Hasanzadeh Fard AI4CE 11 3 0 16 Jan 2024
Learning Crowd Behaviors in Navigation with Attention-based Spatial-Temporal Graphs Yanying Zhou Jochen Garcke GNN 40 3 0 11 Jan 2024
On Safety and Liveness Filtering Using Hamilton-Jacobi Reachability Analysis Javier Borquez Kaustav Chakraborty Hao Wang Somil Bansal 6 7 0 23 Dec 2023
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training Xinyan Chen Jiaxin Ge Tianjun Zhang Jiaming Liu Shanghang Zhang VLM EGVM 27 0 0 23 Dec 2023
Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System Ruining Zhang H. Han Maolong Lv Qisong Yang Jian Cheng OffRL 13 2 0 16 Dec 2023
An Invitation to Deep Reinforcement Learning Bernhard Jaeger Andreas Geiger OffRL OOD 73 5 0 13 Dec 2023
Evolving Reservoirs for Meta Reinforcement Learning Corentin Léger Gautier Hamon Eleni Nisioti X. Hinaut Clément Moulin-Frier 21 1 0 09 Dec 2023
Learning for Semantic Knowledge Base-Guided Online Feature Transmission in Dynamic Channels Xiangyu Gao Yaping Sun Dongyu Wei Xiaodong Xu Hao Chen Hao Yin Shuguang Cui 21 2 0 30 Nov 2023
Two-step dynamic obstacle avoidance Fabian Hart Martin Waltz Ostap Okhrin 24 3 0 28 Nov 2023
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization Zhiyuan Zhao Bin Wang Linke Ouyang Xiao-wen Dong Jiaqi Wang Conghui He MLLM VLM 32 105 0 28 Nov 2023
Adinkra Symbol Recognition using Classical Machine Learning and Deep Learning Michael Adjeisah K. Asamoah Martha Asamoah Yeboah Raji Rafiu King Godwin Ferguson Achaab Kingsley Adjei 29 0 0 27 Nov 2023
FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts Yichen Gong Delong Ran Jinyuan Liu Conglei Wang Tianshuo Cong Anyu Wang Sisi Duan Xiaoyun Wang MLLM 129 117 0 09 Nov 2023
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation Ruomeng Ding Chaoyun Zhang Lu Wang Yong Xu Ming-Jie Ma Wei Zhang Si Qin Saravan Rajmohan Qingwei Lin Dongmei Zhang LRM 33 59 0 07 Nov 2023
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey Gregory Palmer Chris Parry Daniel J.B. Harrold Chris Willis AI4CE 21 1 0 11 Oct 2023
Algebras of actions in an agent's representations of the world Alexander Dean Eduardo Alonso Esther Mondragón 23 0 0 02 Oct 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents Pengyu Zhao Zijian Jin Ning Cheng LLMAG 30 20 0 23 Sep 2023
Trip Planning for Autonomous Vehicles with Wireless Data Transfer Needs Using Reinforcement Learning Yousef AlSaqabi Bhaskar Krishnamachari 12 2 0 21 Sep 2023
Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing Hadar Szostak Kobi Cohen 15 3 0 14 Sep 2023
A Review on Robot Manipulation Methods in Human-Robot Interactions Haoxu Zhang P. Kebria Shady M. K. Mohamed Samson Yu Saeid Nahavandi 16 0 0 09 Sep 2023
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models Ophir M. Carmel Guy Katz 15 0 0 06 Sep 2023
Hawkeye: Change-targeted Testing for Android Apps based on Deep Reinforcement Learning Chao Peng Zhengwei Lv Jiarong Fu Jiayuan Liang Zhao Zhang Ajitha Rajan Ping Yang 11 0 0 04 Sep 2023
AlphaZero Gomoku Wen-Chieh Liang Chao Yu Brian Whiteaker Inyoung Huh Hua Shao Youzhi Liang 10 2 0 04 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey Kamal Acharya Waleed Raza Carlos Dourado Alvaro Velasquez Houbing Song NAI OffRL 19 16 0 02 Sep 2023