ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXivPDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 7,402 papers shown
Title
DREAM: Deep Regret minimization with Advantage baselines and Model-free
  learning
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
53
53
0
18 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
19
28
0
18 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using
  Deep Reinforcement Learning
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
43
74
0
16 Jun 2020
Solving the Order Batching and Sequencing Problem using Deep
  Reinforcement Learning
Solving the Order Batching and Sequencing Problem using Deep Reinforcement Learning
Bram Cals
Yingqian Zhang
R. Dijkman
Claudy van Dorst
OffRL
25
29
0
16 Jun 2020
Semantic Curiosity for Active Visual Learning
Semantic Curiosity for Active Visual Learning
Devendra Singh Chaplot
Helen Jiang
Saurabh Gupta
Abhinav Gupta
ObjD
30
72
0
16 Jun 2020
Robot Perception enables Complex Navigation Behavior via Self-Supervised
  Learning
Robot Perception enables Complex Navigation Behavior via Self-Supervised Learning
Marvin Chancán
Michael Milford
SSL
26
0
0
16 Jun 2020
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
James Ferlez
Mahmoud M. Elnaggar
Yasser Shoukry
C. Fleming
AAML
62
33
0
16 Jun 2020
Designing high-fidelity multi-qubit gates for semiconductor quantum dots
  through deep reinforcement learning
Designing high-fidelity multi-qubit gates for semiconductor quantum dots through deep reinforcement learning
Sahar Daraeizadeh
S. Premaratne
A. Matsuura
22
5
0
15 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
38
83
0
15 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
31
222
0
14 Jun 2020
Reinforcement Learning with Supervision from Noisy Demonstrations
Reinforcement Learning with Supervision from Noisy Demonstrations
Kun-Peng Ning
Sheng-Jun Huang
14
7
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
32
18
0
14 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
38
24
0
12 Jun 2020
Residual Force Control for Agile Human Behavior Imitation and Extended
  Motion Synthesis
Residual Force Control for Agile Human Behavior Imitation and Extended Motion Synthesis
Ye Yuan
Kris Kitani
54
75
0
12 Jun 2020
Deep Reinforcement and InfoMax Learning
Deep Reinforcement and InfoMax Learning
Bogdan Mazoure
Rémi Tachet des Combes
T. Doan
Philip Bachman
R. Devon Hjelm
AI4CE
35
109
0
12 Jun 2020
SAMBA: Safe Model-Based & Active Reinforcement Learning
SAMBA: Safe Model-Based & Active Reinforcement Learning
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
28
44
0
12 Jun 2020
Does Unsupervised Architecture Representation Learning Help Neural
  Architecture Search?
Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?
Shen Yan
Yu Zheng
Wei Ao
Xiao Zeng
Mi Zhang
SSL
AI4CE
40
101
0
12 Jun 2020
Avoiding Side Effects in Complex Environments
Avoiding Side Effects in Complex Environments
Alexander Matt Turner
Neale Ratzlaff
Prasad Tadepalli
35
34
0
11 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
Matthieu Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
31
215
0
10 Jun 2020
Learning to Play Table Tennis From Scratch using Muscular Robots
Learning to Play Table Tennis From Scratch using Muscular Robots
Le Chen
Simon Guist
Roberto Calandra
V. Berenz
Bernhard Schölkopf
Jan Peters
21
88
0
10 Jun 2020
AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP
  Investigation in the Recommender System
AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP Investigation in the Recommender System
Pengyu Zhao
Kecheng Xiao
Yuanxing Zhang
Kaigui Bian
Wei Yan
26
16
0
10 Jun 2020
Searching Learning Strategy with Reinforcement Learning for 3D Medical
  Image Segmentation
Searching Learning Strategy with Reinforcement Learning for 3D Medical Image Segmentation
Dong Yang
H. Roth
Ziyue Xu
Fausto Milletari
Ling Zhang
Daguang Xu
24
46
0
10 Jun 2020
Transient Non-Stationarity and Generalisation in Deep Reinforcement
  Learning
Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning
Maximilian Igl
Gregory Farquhar
Jelena Luketina
Wendelin Boehmer
Shimon Whiteson
32
86
0
10 Jun 2020
Stealing Deep Reinforcement Learning Models for Fun and Profit
Stealing Deep Reinforcement Learning Models for Fun and Profit
Kangjie Chen
Shangwei Guo
Tianwei Zhang
Xiaofei Xie
Yang Liu
MLAU
MIACV
OffRL
31
45
0
09 Jun 2020
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization
  without Compounding Errors
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors
Chi Zhang
S. Kuppannagari
Viktor Prasanna
22
4
0
08 Jun 2020
Reinforcement Learning Under Moral Uncertainty
Reinforcement Learning Under Moral Uncertainty
Adrien Ecoffet
Joel Lehman
34
32
0
08 Jun 2020
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Mehdi Jafarnia-Jahromi
Chen-Yu Wei
Rahul Jain
Haipeng Luo
48
7
0
08 Jun 2020
Randomized Policy Learning for Continuous State and Action MDPs
Randomized Policy Learning for Continuous State and Action MDPs
Hiteshi Sharma
Rahul Jain
31
1
0
08 Jun 2020
Deep Reinforcement Learning for Human-Like Driving Policies in Collision
  Avoidance Tasks of Self-Driving Cars
Deep Reinforcement Learning for Human-Like Driving Policies in Collision Avoidance Tasks of Self-Driving Cars
Ran Emuna
A. Borowsky
Armin Biess
63
24
0
07 Jun 2020
State Action Separable Reinforcement Learning
State Action Separable Reinforcement Learning
Ziyao Zhang
Liang Ma
K. Leung
Konstantinos Poularakis
Mudhakar Srivatsa
36
2
0
05 Jun 2020
TrueRMA: Learning Fast and Smooth Robot Trajectories with Recursive
  Midpoint Adaptations in Cartesian Space
TrueRMA: Learning Fast and Smooth Robot Trajectories with Recursive Midpoint Adaptations in Cartesian Space
Jonas C. Kiemel
Pascal Meissner
Torsten Kröger
25
6
0
05 Jun 2020
Model-Based Generalization Under Parameter Uncertainty Using Path
  Integral Control
Model-Based Generalization Under Parameter Uncertainty Using Path Integral Control
Ian Abraham
Ankur Handa
Nathan D. Ratliff
Kendall Lowrey
Todd Murphey
Dieter Fox
35
39
0
04 Jun 2020
Visual Transfer for Reinforcement Learning via Wasserstein Domain
  Confusion
Visual Transfer for Reinforcement Learning via Wasserstein Domain Confusion
Josh Roy
George Konidaris
22
16
0
04 Jun 2020
Constrained Reinforcement Learning for Dynamic Optimization under
  Uncertainty
Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty
Panagiotis Petsagkourakis
I. O. Sandoval
E. Bradford
Dongda Zhang
Ehecatl Antonio del Rio Chanona
20
11
0
04 Jun 2020
Combining Reinforcement Learning and Constraint Programming for
  Combinatorial Optimization
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Quentin Cappart
Thierry Moisan
Louis-Martin Rousseau
Isabeau Prémont-Schwarz
A. Ciré
33
138
0
02 Jun 2020
Cross-Domain Imitation Learning with a Dual Structure
Sungho Choi
Seungyul Han
Woojun Kim
Y. Sung
30
2
0
02 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
67
225
0
01 Jun 2020
Predicting Goal-directed Human Attention Using Inverse Reinforcement
  Learning
Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning
Zhibo Yang
Lihan Huang
Yupei Chen
Zijun Wei
Seoyoung Ahn
G. Zelinsky
Dimitris Samaras
Minh Hoai
42
96
0
28 May 2020
A reinforcement learning approach to rare trajectory sampling
A reinforcement learning approach to rare trajectory sampling
Dominic C. Rose
Jamie F. Mair
J. P. Garrahan
29
51
0
26 May 2020
Efficient Use of heuristics for accelerating XCS-based Policy Learning
  in Markov Games
Efficient Use of heuristics for accelerating XCS-based Policy Learning in Markov Games
Hao Chen
Chang Wang
Jian Huang
Jianxing Gong
46
5
0
26 May 2020
Neural Topological SLAM for Visual Navigation
Neural Topological SLAM for Visual Navigation
Devendra Singh Chaplot
Ruslan Salakhutdinov
Abhinav Gupta
Saurabh Gupta
52
291
0
25 May 2020
Gradient Monitored Reinforcement Learning
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
50
11
0
25 May 2020
LEAF: Latent Exploration Along the Frontier
LEAF: Latent Exploration Along the Frontier
Homanga Bharadhwaj
Animesh Garg
Florian Shkurti
39
1
0
21 May 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
38
13
0
21 May 2020
Decentralized Deep Reinforcement Learning for a Distributed and Adaptive
  Locomotion Controller of a Hexapod Robot
Decentralized Deep Reinforcement Learning for a Distributed and Adaptive Locomotion Controller of a Hexapod Robot
M. Schilling
Kai Konen
F. Ohl
Timo Korthals
41
18
0
21 May 2020
Learning natural locomotion behaviors for humanoid robots using human knowledge
Chuanyu Yang
Kai Yuan
Shuai Heng
Taku Komura
Zhibin Li
29
38
0
20 May 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR
  Control in Active Distribution Networks
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
19
96
0
20 May 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
47
83
0
20 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular
  Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Abdulelah S. Alshehri
R. Gani
Fengqi You
AI4CE
43
83
0
18 May 2020
Optimizing for the Future in Non-Stationary MDPs
Optimizing for the Future in Non-Stationary MDPs
Yash Chandak
Georgios Theocharous
Shiv Shankar
Martha White
Sridhar Mahadevan
Philip S. Thomas
OffRL
23
65
0
17 May 2020
Previous
123...135136137...147148149
Next