v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017

Papers citing "Proximal Policy Optimization Algorithms"

50 / 11,422 papers shown

A DIRT-T Approach to Unsupervised Domain Adaptation

187

653

23 Feb 2018

Verifying Controllers Against Adversarial Examples with Bayesian Optimization

163

23 Feb 2018

Structured Control Nets for Deep Reinforcement Learning

Mario Srouji

Jian Zhang

Ruslan Salakhutdinov

154

22 Feb 2018

Clipped Action Policy Gradient

Yasuhiro Fujita

S. Maeda

OffRL

126

21 Feb 2018

Learning to Play with Intrinsically-Motivated Self-Aware Agents

Li Fei-Fei

189

122

21 Feb 2018

Fourier Policy Gradients

M. Fellows

K. Ciosek

Shimon Whiteson

164

19 Feb 2018

Learning High-level Representations from Demonstrations

Garrett Andersen

Peter Vrancx

Haitham Bou-Ammar

125

19 Feb 2018

GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms

Cédric Colas

Olivier Sigaud

Pierre-Yves Oudeyer

302

165

14 Feb 2018

Evolved Policy Gradients

Pieter Abbeel

400

235

13 Feb 2018

Diversity-Driven Exploration Strategy for Deep Reinforcement Learning

Chun-Yi Lee

220

138

13 Feb 2018

Learning Robust and Adaptive Real-World Continuous Control Using Simulation and Transfer Learning

M. Ferguson

K. Law

13 Feb 2018

Hierarchical Learning for Modular Robots

112

12 Feb 2018

Towards self-adaptable robots: from programming to training machines

12 Feb 2018

VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control

Wolfram Burgard

509

132

01 Feb 2018

Learning Symmetric and Low-energy Locomotion

Wenhao Yu

Greg Turk

Chenxi Liu

357

207

24 Jan 2018

An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients

Jiaming Song

Yuhuai Wu

17 Jan 2018

Model-Based Action Exploration for Learning Dynamic Motion Skills

Glen Berseth

M. van de Panne

108

11 Jan 2018

Expected Policy Gradients for Reinforcement Learning

K. Ciosek

Shimon Whiteson

296

10 Jan 2018

Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes

Henryk Michalewski

200

09 Jan 2018

Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations

Xingyu Wang

Diego Klabjan

114

07 Jan 2018

Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning

263

119

04 Jan 2018

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Tuomas Haarnoja

Aurick Zhou

Pieter Abbeel

Sergey Levine

2.5K

10,124

04 Jan 2018

SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation

293

29 Dec 2017

Boosting the Actor with Dual CriticInternational Conference on Learning Representations (ICLR), 2017

152

29 Dec 2017

RLlib: Abstractions for Distributed Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2017

305

178

26 Dec 2017

Safe Policy Improvement with Baseline Bootstrapping

Romain Laroche

P. Trichelair

Rémi Tachet des Combes

OffRL

384

214

19 Dec 2017

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

Jeff Clune

438

735

18 Dec 2017

Ray: A Distributed Framework for Emerging AI Applications

...

490

1,496

16 Dec 2017

Bayesian Policy Gradients via Alpha Divergence Dropout Inference

120

06 Dec 2017

A Deeper Look at Experience Replay

Shangtong Zhang

R. Sutton

OffRL VLM

370

306

04 Dec 2017

Progressive Neural Architecture Search

Li Fei-Fei

499

2,097

02 Dec 2017

Time Limits in Reinforcement Learning

288

175

01 Dec 2017

Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control

Shangtong Zhang

Osmar R. Zaiane

145

30 Nov 2017

Learnings Options End-to-End for Continuous Action Tasks

Martin Klissarov

Pierre-Luc Bacon

J. Harb

Doina Precup

194

30 Nov 2017

Automating Vehicles by Deep Reinforcement Learning using Task Separation with Hill Climbing

M. Plessen

29 Nov 2017

Cascade Attribute Learning Network

Zhuo Xu

Haonan Chang

Masayoshi Tomizuka

24 Nov 2017

Action Branching Architectures for Deep Reinforcement Learning

Arash Tavakoli

Fabio Pardo

Petar Kormushev

255

299

24 Nov 2017

Run, skeleton, run: skeletal model in a physics-based simulation

203

18 Nov 2017

Worm-level Control through Search-based Reinforcement Learning

Mathias Lechner

Radu Grosu

Ramin M. Hasani

09 Nov 2017

Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

496

07 Nov 2017

Policy Optimization by Genetic Distillation

Tanmay Gangwani

Jian-wei Peng

189

03 Nov 2017

Automata-Guided Hierarchical Reinforcement Learning for Skill Composition

Xiao Li

Yao Ma

C. Belta

31 Oct 2017

Action-depedent Control Variates for Policy Optimization via Stein's Identity

277

30 Oct 2017

Transfer Learning to Learn with Multitask Neural Model Search

Catherine Wong

Andrea Gesmundo

30 Oct 2017

Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning

Sergio Valcarcel Macua

Aleksi Tukiainen

D. Hernández

David Baldazo

Enrique Munoz de Cote

S. Zazo

345

28 Oct 2017

Meta Learning Shared HierarchiesInternational Conference on Learning Representations (ICLR), 2017

Pieter Abbeel

200

372

26 Oct 2017

Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation

Pieter Abbeel

418

724

12 Oct 2017

Emergent Complexity via Multi-Agent Competition

295

418

10 Oct 2017

Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments

Pieter Abbeel

219

367

10 Oct 2017

Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge

370

08 Oct 2017