ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1701.07274
  4. Cited By
Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning: An Overview

25 January 2017
Yuxi Li
    OffRL
    VLM
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning: An Overview"

50 / 418 papers shown
Title
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
Dor Livne
Kobi Cohen
14
50
0
14 Jan 2020
Sparse Black-box Video Attack with Reinforcement Learning
Sparse Black-box Video Attack with Reinforcement Learning
Xingxing Wei
Huanqian Yan
Bo-wen Li
AAML
15
49
0
11 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
12
188
0
23 Dec 2019
A Survey of Deep Learning Applications to Autonomous Vehicle Control
A Survey of Deep Learning Applications to Autonomous Vehicle Control
Sampo Kuutti
Richard Bowden
Yaochu Jin
P. Barber
Saber Fallah
6
506
0
23 Dec 2019
Mix and Match: Markov Chains & Mixing Times for Matching in Rideshare
Mix and Match: Markov Chains & Mixing Times for Matching in Rideshare
Michael J. Curry
John P. Dickerson
Karthik Abinav Sankararaman
A. Srinivasan
Yuhao Wan
Pan Xu
11
6
0
30 Nov 2019
An Autonomous Spectrum Management Scheme for Unmanned Aerial Vehicle
  Networks in Disaster Relief Operations
An Autonomous Spectrum Management Scheme for Unmanned Aerial Vehicle Networks in Disaster Relief Operations
Alireza Shamsoshoara
Fatemeh Afghah
Abolfazl Razi
Sajad Mousavi
Jonathan D. Ashdown
Kurt Turk
11
47
0
26 Nov 2019
Word Sense Disambiguation using Knowledge-based Word Similarity
Sunjae Kwon
Dongsuk Oh
Youngjoong Ko
6
0
0
11 Nov 2019
Response to NITRD, NCO, NSF Request for Information on "Update to the
  2016 National Artificial Intelligence Research and Development Strategic
  Plan"
Response to NITRD, NCO, NSF Request for Information on "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan"
J. Amundson
J. Annis
Camille Avestruz
D. Bowring
J. Caldeira
...
N. Tran
S. Trivedi
L. Trouille
W. L. K. Wu
C. Bom
7
11
0
05 Nov 2019
Quantum enhancements for deep reinforcement learning in large spaces
Quantum enhancements for deep reinforcement learning in large spaces
Sofiene Jerbi
Lea M. Trenkwalder
Hendrik Poulsen Nautrup
H. Briegel
Vedran Dunjko
14
5
0
28 Oct 2019
Convergent Policy Optimization for Safe Reinforcement Learning
Convergent Policy Optimization for Safe Reinforcement Learning
Ming Yu
Zhuoran Yang
Mladen Kolar
Zhaoran Wang
8
91
0
26 Oct 2019
Autonomous Industrial Management via Reinforcement Learning:
  Self-Learning Agents for Decision-Making -- A Review
Autonomous Industrial Management via Reinforcement Learning: Self-Learning Agents for Decision-Making -- A Review
L. E. Leal
Magnus Westerlund
A. Chapman
6
0
0
20 Oct 2019
RLCard: A Toolkit for Reinforcement Learning in Card Games
RLCard: A Toolkit for Reinforcement Learning in Card Games
Daochen Zha
Kwei-Herng Lai
Yuanpu Cao
Songyi Huang
Ruzhe Wei
Junyu Guo
Xia Hu
OffRL
10
58
0
10 Oct 2019
Sample Efficient Policy Gradient Methods with Recursive Variance
  Reduction
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction
Pan Xu
F. Gao
Quanquan Gu
23
83
0
18 Sep 2019
Learning Index Selection with Structured Action Spaces
Learning Index Selection with Structured Action Spaces
Jeremy Welborn
Michael Schaarschmidt
Eiko Yoneki
9
10
0
16 Sep 2019
Signal Instructed Coordination in Cooperative Multi-agent Reinforcement
  Learning
Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning
Liheng Chen
Hongyi Guo
Yali Du
Fei Fang
Haifeng Zhang
Yaoming Zhu
Ming Zhou
Weinan Zhang
Qing Wang
Yong Yu
8
8
0
10 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of
  Convergence
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
6
236
0
29 Aug 2019
Ensemble-Based Deep Reinforcement Learning for Chatbots
Ensemble-Based Deep Reinforcement Learning for Chatbots
Heriberto Cuayáhuitl
Donghyeon Lee
Seonghan Ryu
Yongjin Cho
Sungja Choi
Satish Reddy Indurthi
Seunghak Yu
Hyungtak Choi
Inchul Hwang
J. Kim
OffRL
6
70
0
27 Aug 2019
Reinforcement Learning in Healthcare: A Survey
Reinforcement Learning in Healthcare: A Survey
Chao Yu
Jiming Liu
S. Nemati
LM&MA
OffRL
6
545
0
22 Aug 2019
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
Afshin Oroojlooyjadid
Davood Hajinezhad
38
408
0
11 Aug 2019
Attention Control with Metric Learning Alignment for Image Set-based
  Recognition
Attention Control with Metric Learning Alignment for Image Set-based Recognition
Xiaofeng Liu
A. Marques
J. You
G. Giannakis
CVBM
6
10
0
05 Aug 2019
Multi-Point Bandit Algorithms for Nonstationary Online Nonconvex
  Optimization
Multi-Point Bandit Algorithms for Nonstationary Online Nonconvex Optimization
Abhishek Roy
Krishnakumar Balasubramanian
Saeed Ghadimi
P. Mohapatra
OffRL
17
15
0
31 Jul 2019
Deep Reinforcement Learning for Personalized Search Story Recommendation
Deep Reinforcement Learning for Personalized Search Story Recommendation
Jason Zhang
Zhang
Junming Yin
Dongwon Lee
Linhong Zhu
22
2
0
26 Jul 2019
Action Guidance with MCTS for Deep Reinforcement Learning
Action Guidance with MCTS for Deep Reinforcement Learning
Bilal Kartal
Pablo Hernandez-Leal
Matthew E. Taylor
15
18
0
25 Jul 2019
Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
Xiaofei Wang
Yiwen Han
Victor C. M. Leung
Dusit Niyato
Xueqiang Yan
Xu Chen
6
974
0
19 Jul 2019
Prioritized Guidance for Efficient Multi-Agent Reinforcement Learning Exploration
Qisheng Wang
Qichao Wang
13
1
0
18 Jul 2019
An Inductive Synthesis Framework for Verifiable Reinforcement Learning
An Inductive Synthesis Framework for Verifiable Reinforcement Learning
He Zhu
Zikang Xiong
Stephen Magill
Suresh Jagannathan
17
93
0
16 Jul 2019
Dependency-aware Attention Control for Unconstrained Face Recognition
  with Image Sets
Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets
Xiaofeng Liu
B. Kumar
Chao Yang
Qingming Tang
J. You
CVBM
13
42
0
05 Jul 2019
A Survey of Multi-Access Edge Computing in 5G and Beyond: Fundamentals,
  Technology Integration, and State-of-the-Art
A Survey of Multi-Access Edge Computing in 5G and Beyond: Fundamentals, Technology Integration, and State-of-the-Art
Viet Quoc Pham
Fang Fang
H. Vu
Md. Jalil Piran
Mai Le
L. Le
W. Hwang
Z. Ding
11
595
0
20 Jun 2019
A Survey of Optimization Methods from a Machine Learning Perspective
A Survey of Optimization Methods from a Machine Learning Perspective
Shiliang Sun
Zehui Cao
Han Zhu
Jing Zhao
14
548
0
17 Jun 2019
RL4health: Crowdsourcing Reinforcement Learning for Knee Replacement
  Pathway Optimization
RL4health: Crowdsourcing Reinforcement Learning for Knee Replacement Pathway Optimization
Hao Lu
Mengdi Wang
OffRL
8
4
0
24 May 2019
CityFlow: A Multi-Agent Reinforcement Learning Environment for Large
  Scale City Traffic Scenario
CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
Huichu Zhang
Siyuan Feng
Chang-rui Liu
Yaoyao Ding
Yichen Zhu
Zihan Zhou
Weinan Zhang
Yong Yu
Haiming Jin
Z. Li
AI4TS
AI4CE
11
266
0
13 May 2019
Metareasoning in Modular Software Systems: On-the-Fly Configuration
  using Reinforcement Learning with Rich Contextual Representations
Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations
Aditya Modi
Debadeepta Dey
Alekh Agarwal
Adith Swaminathan
Besmira Nushi
Sean Andrist
Eric Horvitz
OffRL
LRM
9
1
0
12 May 2019
Autonomous Voltage Control for Grid Operation Using Deep Reinforcement
  Learning
Autonomous Voltage Control for Grid Operation Using Deep Reinforcement Learning
R. Diao
Zhiwei Wang
Di Shi
Qianyun Chang
Jiajun Duan
Xiaohu Zhang
AI4CE
9
87
0
24 Apr 2019
A Solution for Dynamic Spectrum Management in Mission-Critical UAV
  Networks
A Solution for Dynamic Spectrum Management in Mission-Critical UAV Networks
Alireza Shamsoshoara
M. Khaledi
Fatemeh Afghah
Abolfazl Razi
Jonathan D. Ashdown
K. Turck
6
26
0
16 Apr 2019
Safer Deep RL with Shallow MCTS: A Case Study in Pommerman
Safer Deep RL with Shallow MCTS: A Case Study in Pommerman
Bilal Kartal
Pablo Hernandez-Leal
Chao Gao
Matthew E. Taylor
OffRL
12
7
0
10 Apr 2019
Reducing catastrophic forgetting when evolving neural networks
Reducing catastrophic forgetting when evolving neural networks
Joseph Early
9
2
0
05 Apr 2019
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for
  Deep Reinforcement Learning
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
12
4
0
03 Apr 2019
Generalization through Simulation: Integrating Simulated and Real Data
  into Deep Reinforcement Learning for Vision-Based Autonomous Flight
Generalization through Simulation: Integrating Simulated and Real Data into Deep Reinforcement Learning for Vision-Based Autonomous Flight
Katie Kang
Suneel Belkhale
G. Kahn
Pieter Abbeel
Sergey Levine
11
122
0
11 Feb 2019
Safe, Efficient, and Comfortable Velocity Control based on Reinforcement
  Learning for Autonomous Driving
Safe, Efficient, and Comfortable Velocity Control based on Reinforcement Learning for Autonomous Driving
Meixin Zhu
Yinhai Wang
Ziyuan Pu
Jingyun Hu
X. Wang
Ruimin Ke
16
325
0
29 Jan 2019
Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement
  Learning through Memory-driven Communication
Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement Learning through Memory-driven Communication
E. Pesce
Giovanni Montana
13
70
0
12 Jan 2019
Human-Like Autonomous Car-Following Model with Deep Reinforcement
  Learning
Human-Like Autonomous Car-Following Model with Deep Reinforcement Learning
Meixin Zhu
X. Wang
Yinhai Wang
11
402
0
03 Jan 2019
Deep Reinforcement Learning for Multi-Agent Systems: A Review of
  Challenges, Solutions and Applications
Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications
Thanh Thi Nguyen
Ngoc Duy Nguyen
S. Nahavandi
11
762
0
31 Dec 2018
AME Blockchain: An Architecture Design for Closed-Loop Fluid Economy
  Token System
AME Blockchain: An Architecture Design for Closed-Loop Fluid Economy Token System
Lanny Z. N. Yuan
Huaibing Jian
Peng Liu
Pengxin Zhu
ShanYang Fu
12
0
0
18 Dec 2018
Deep neural networks algorithms for stochastic control problems on
  finite horizon: convergence analysis
Deep neural networks algorithms for stochastic control problems on finite horizon: convergence analysis
Côme Huré
H. Pham
Achref Bachouch
N. Langrené
8
64
0
11 Dec 2018
Distilling Information from a Flood: A Possibility for the Use of
  Meta-Analysis and Systematic Review in Machine Learning Research
Distilling Information from a Flood: A Possibility for the Use of Meta-Analysis and Systematic Review in Machine Learning Research
Peter Henderson
Emma Brunskill
AI4CE
24
3
0
03 Dec 2018
AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov
  Decision Processes with Near-Optimal Sample Complexity
AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity
Yibo Zeng
Fei Feng
W. Yin
11
3
0
03 Dec 2018
A Study on Dialogue Reward Prediction for Open-Ended Conversational
  Agents
A Study on Dialogue Reward Prediction for Open-Ended Conversational Agents
Heriberto Cuayáhuitl
Seonghan Ryu
Donghyeon Lee
J. Kim
8
6
0
02 Dec 2018
Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep
  RL
Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RL
Bilal Kartal
Pablo Hernandez-Leal
Matthew E. Taylor
OffRL
17
9
0
30 Nov 2018
Distributed traffic light control at uncoupled intersections with
  real-world topology by deep reinforcement learning
Distributed traffic light control at uncoupled intersections with real-world topology by deep reinforcement learning
Mark Schutera
Niklas Goby
S. Smolarek
Markus Reischl
11
7
0
27 Nov 2018
A Novel Learning-based Global Path Planning Algorithm for Planetary
  Rovers
A Novel Learning-based Global Path Planning Algorithm for Planetary Rovers
Jiang Zhang
Yuanqing Xia
Ganghui Shen
17
35
0
23 Nov 2018
Previous
123456789
Next