v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017

Papers citing "Proximal Policy Optimization Algorithms"

50 / 11,424 papers shown

Verifiable Reinforcement Learning via Policy Extraction

332

371

22 May 2018

Evolution-Guided Policy Gradient in Reinforcement Learning

Shauharda Khadka

Kagan Tumer

294

271

21 May 2018

Constrained Policy Improvement for Safe and Efficient Reinforcement Learning

221

20 May 2018

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

207

20 May 2018

Deep Dynamical Modeling and Control of Unsteady Fluid Flows

Jeremy Morton

F. Witherden

A. Jameson

Mykel J. Kochenderfer

AI4CE

215

180

18 May 2018

Learning Time-Sensitive Strategies in Space Fortress

Akshat Agarwal

Ryan Hope

Katia Sycara

203

17 May 2018

Task Agnostic Robust Learning on Corrupt Outputs by Correlation-Guided Mixture Density Networks

254

16 May 2018

257

13 May 2018

Policy Optimization with Second-Order Advantage Information

Jiajin Li

Baoxiang Wang

152

09 May 2018

Reward Estimation for Variance Reduction in Deep Reinforcement Learning

Joshua Romoff

Peter Henderson

Alexandre Piché

Vincent François-Lavet

Joelle Pineau

325

09 May 2018

Deep Reinforcement Learning for Playing 2.5D Fighting Games

05 May 2018

Decoupling Dynamics and Reward for Transfer Learning

215

27 Apr 2018

Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments

206

27 Apr 2018

Sim-to-Real: Learning Agile Locomotion For Quadruped Robots

Jie Tan

Tingnan Zhang

386

900

27 Apr 2018

Distributed Distributional Deterministic Policy GradientsInternational Conference on Learning Representations (ICLR), 2018

David Budden

Dan Horgan

261

520

23 Apr 2018

Vehicle Communication Strategies for Simulated Highway Driving

Cinjon Resnick

I. Kulikov

Dong Wang

Jason Weston

161

19 Apr 2018

An Adaptive Clipping Approach for Proximal Policy Optimization

Gang Chen

Yiming Peng

Mengjie Zhang

112

17 Apr 2018

On Learning Intrinsic Rewards for Policy Gradient Methods

Zeyu Zheng

Junhyuk Oh

Satinder Singh

278

224

17 Apr 2018

Rafiki: Machine Learning as an Analytics Service System

248

122

17 Apr 2018

Intrinsically motivated reinforcement learning for human-robot interaction in the real-world

105

14 Apr 2018

Reinforcement Learning for UAV Attitude Control

135

438

11 Apr 2018

Gotta Learn Fast: A New Benchmark for Generalization in RL

187

183

10 Apr 2018

Latent Space Policies for Hierarchical Reinforcement Learning

Pieter Abbeel

189

202

09 Apr 2018

DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills

Pieter Abbeel

470

557

08 Apr 2018

Structured Evolution with Compact Architectures for Scalable Policy Optimization

270

158

06 Apr 2018

Information Maximizing Exploration with a Latent Dynamics Model

Trevor Barron

Oliver Obst

H. B. Amor

04 Apr 2018

Renewal Monte Carlo: Renewal theory based reinforcement learning

Jayakumar Subramanian

Aditya Mahajan

03 Apr 2018

StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning

Youssef Attia El Hili

Yuanheng Zhu

Dongbin Zhao

221

182

03 Apr 2018

Universal Planning Networks

Pieter Abbeel

183

146

02 Apr 2018

Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments

L. Kidzinski

Sharada Mohanty

Carmichael F. Ong

Zhewei Huang

Shuchang Zhou

...

188

02 Apr 2018

Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning

167

31 Mar 2018

Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system

Aravind Rajeswaran

146

28 Mar 2018

Long short-term memory and learning-to-learn in networks of spiking neurons

507

543

26 Mar 2018

Neuronal Circuit Policies

Mathias Lechner

Ramin M. Hasani

Radu Grosu

22 Mar 2018

Automated Curriculum Learning by Rewarding Temporally Rare Events

Niels Justesen

S. Risi

OffRL

146

19 Mar 2018

Simple random search provides a competitive approach to reinforcement learning

Horia Mania

Aurelia Guy

Benjamin Recht

201

330

19 Mar 2018

Feedback Control For Cassie With Deep Reinforcement LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2018

243

199

15 Mar 2018

Learning to Explore with Meta-Policy GradientInternational Conference on Machine Learning (ICML), 2018

149

13 Mar 2018

Policy Search in Continuous Action Domains: an OverviewNeural Networks (NN), 2018

Olivier Sigaud

F. Stulp

310

13 Mar 2018

Deep Learning in Mobile and Wireless Networking: A SurveyIEEE Communications Surveys and Tutorials (COMST), 2018

Chaoyun Zhang

P. Patras

Hamed Haddadi

357

1,428

12 Mar 2018

Accelerated Methods for Deep Reinforcement Learning

Adam Stooke

Pieter Abbeel

OffRL OnRL

149

141

07 Mar 2018

Transfer Learning with Neural AutoML

263

118

07 Mar 2018

Discontinuity-Sensitive Optimal Control Learning by Mixture of Experts

Gao Tang

Kris K. Hauser

07 Mar 2018

Smoothed Action Value Functions for Learning Gaussian Policies

258

06 Mar 2018

Some Considerations on Learning to Explore via Meta-Reinforcement Learning

Pieter Abbeel

192

122

03 Mar 2018

Deep Reinforcement Learning for Join Order Enumeration

Ryan Marcus

Olga Papaemmanouil

284

252

28 Feb 2018

Model-Ensemble Trust-Region Policy OptimizationInternational Conference on Learning Representations (ICLR), 2018

Pieter Abbeel

289

474

28 Feb 2018

Computational Theories of Curiosity-Driven Learning

Pierre-Yves Oudeyer

198

28 Feb 2018

The Mirage of Action-Dependent Baselines in Reinforcement LearningInternational Conference on Machine Learning (ICML), 2018

296

137

27 Feb 2018

Reinforcement and Imitation Learning for Diverse Visuomotor Skills

...

377

334

26 Feb 2018