Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.06339
Cited By
Deep Reinforcement Learning
15 October 2018
Yuxi Li
VLM
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning"
27 / 27 papers shown
Title
Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains
Manon Flageat
Félix Chalumeau
Antoine Cully
23
26
0
24 Oct 2022
Policy Gradients for Probabilistic Constrained Reinforcement Learning
Weiqin Chen
D. Subramanian
Santiago Paternain
13
6
0
02 Oct 2022
Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
D. Mguni
Aivar Sootla
Juliusz Ziomek
Oliver Slumbers
Zipeng Dai
Kun Shao
J. Wang
26
6
0
31 May 2022
RACE: A Reinforcement Learning Framework for Improved Adaptive Control of NoC Channel Buffers
Kamil Khan
S. Pasricha
R. Kim
24
2
0
26 May 2022
Quantile-Based Policy Optimization for Reinforcement Learning
Jinyang Jiang
Jiaqiao Hu
Yijie Peng
8
7
0
27 Jan 2022
Autonomous Driving with Deep Learning: A Survey of State-of-Art Technologies
Yu Huang
Yue Chen
3DPC
29
81
0
10 Jun 2020
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
32
666
0
21 Sep 2018
Bayesian Model-Agnostic Meta-Learning
Taesup Kim
Jaesik Yoon
Ousmane Amadou Dia
Sungwoong Kim
Yoshua Bengio
Sungjin Ahn
UQCV
BDL
191
498
0
11 Jun 2018
Probabilistic Model-Agnostic Meta-Learning
Chelsea Finn
Kelvin Xu
Sergey Levine
BDL
165
666
0
07 Jun 2018
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Xue Bin Peng
Pieter Abbeel
Sergey Levine
M. van de Panne
AI4CE
150
495
0
08 Apr 2018
Iterative Visual Reasoning Beyond Convolutions
Xinlei Chen
Li-Jia Li
Li Fei-Fei
Abhinav Gupta
LRM
GNN
24
212
0
29 Mar 2018
DeepPicar: A Low-cost Deep Neural Network-based Autonomous Car
Michael Bechtel
Elise McEllhiney
Minje Kim
H. Yun
11
103
0
19 Dec 2017
Building machines that adapt and compute like brains
Brenden Lake
J. Tenenbaum
AI4CE
FedML
NAI
AILaw
243
893
0
11 Nov 2017
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
118
927
0
07 Jul 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
243
11,659
0
09 Mar 2017
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Nantas Nardelli
Gregory Farquhar
Triantafyllos Afouras
Philip H. S. Torr
Pushmeet Kohli
Shimon Whiteson
OffRL
109
594
0
28 Feb 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
254
1,895
0
10 Jan 2017
An Alternative Softmax Operator for Reinforcement Learning
Kavosh Asadi
Michael L. Littman
6
10
0
16 Dec 2016
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
258
1,398
0
01 Dec 2016
Dialogue Learning With Human-In-The-Loop
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
OffRL
216
134
0
29 Nov 2016
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
264
5,319
0
05 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,724
0
26 Sep 2016
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
Layla El Asri
Jing He
Kaheer Suleman
49
117
0
30 Jun 2016
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
198
1,325
0
05 Jun 2016
Learning Representations for Counterfactual Inference
Fredrik D. Johansson
Uri Shalit
David Sontag
CML
OOD
BDL
207
718
0
12 May 2016
Q-learning with censored data
Y. Goldberg
Michael R. Kosorok
OffRL
57
135
0
30 May 2012
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
155
221
0
22 May 2012
1