ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 3,103 papers shown
Title
Scalable and Incremental Learning of Gaussian Mixture Models
Scalable and Incremental Learning of Gaussian Mixture Models
R. Pinto
P. Engel
29
10
0
14 Jan 2017
A K-fold Method for Baseline Estimation in Policy Gradient Algorithms
A K-fold Method for Baseline Estimation in Policy Gradient Algorithms
N. Kota
Abhishek Mishra
Sunil Srinivasa
Xi
Xi Chen
Pieter Abbeel
OffRL
26
0
0
03 Jan 2017
Deep Reinforcement Learning with Successor Features for Navigation
  across Similar Environments
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
Jingwei Zhang
Jost Tobias Springenberg
Joschka Boedecker
Wolfram Burgard
22
294
0
16 Dec 2016
Reinforcement Learning With Temporal Logic Rewards
Reinforcement Learning With Temporal Logic Rewards
Xiao Li
C. Vasile
C. Belta
31
214
0
11 Dec 2016
Model-based Adversarial Imitation Learning
Model-based Adversarial Imitation Learning
Nir Baram
Oron Anschel
Shie Mannor
GAN
25
43
0
07 Dec 2016
Generalizing Skills with Semi-Supervised Reinforcement Learning
Generalizing Skills with Semi-Supervised Reinforcement Learning
Chelsea Finn
Tianhe Yu
Justin Fu
Pieter Abbeel
Sergey Levine
OffRL
SSL
35
68
0
01 Dec 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a
  GPU
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
28
258
0
18 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
60
762
0
15 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models
  with KL-control
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
41
169
0
09 Nov 2016
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
RL2^22: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
35
1,010
0
09 Nov 2016
Recursive Regression with Neural Networks: Approximating the HJI PDE
  Solution
Recursive Regression with Neural Networks: Approximating the HJI PDE Solution
Vicencc Rubies-Royo
Claire Tomlin
20
20
0
08 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
47
344
0
07 Nov 2016
Modular Multitask Reinforcement Learning with Policy Sketches
Modular Multitask Reinforcement Learning with Policy Sketches
Jacob Andreas
Dan Klein
Sergey Levine
OffRL
27
459
0
06 Nov 2016
Combining policy gradient and Q-learning
Combining policy gradient and Q-learning
Brendan O'Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
OffRL
OnRL
30
139
0
05 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
35
755
0
03 Nov 2016
Deep Learning Approximation for Stochastic Control Problems
Deep Learning Approximation for Stochastic Control Problems
Jiequn Han
E. Weinan
BDL
32
193
0
02 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for
  Robotics
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
32
11
0
01 Nov 2016
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Andrei A. Rusu
Matej Vecerík
Thomas Rothörl
N. Heess
Razvan Pascanu
R. Hadsell
44
532
0
13 Oct 2016
Reset-free Trial-and-Error Learning for Robot Damage Recovery
Reset-free Trial-and-Error Learning for Robot Damage Recovery
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
Jean-Baptiste Mouret
13
102
0
13 Oct 2016
Transfer from Simulation to Real World through Learning Deep Inverse
  Dynamics Model
Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
Paul Christiano
Zain Shah
Igor Mordatch
Jonas Schneider
T. Blackwell
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
PINN
32
249
0
11 Oct 2016
Connecting Generative Adversarial Networks and Actor-Critic Methods
Connecting Generative Adversarial Networks and Actor-Critic Methods
David Pfau
Oriol Vinyals
OffRL
AI4CE
30
186
0
06 Oct 2016
EPOpt: Learning Robust Neural Network Policies Using Model Ensembles
EPOpt: Learning Robust Neural Network Policies Using Model Ensembles
Aravind Rajeswaran
Sarvjeet Ghotra
Balaraman Ravindran
Sergey Levine
35
349
0
05 Oct 2016
Reset-Free Guided Policy Search: Efficient Deep Reinforcement Learning
  with Stochastic Initial States
Reset-Free Guided Policy Search: Efficient Deep Reinforcement Learning with Stochastic Initial States
William H. Montgomery
Anurag Ajay
Chelsea Finn
Pieter Abbeel
Sergey Levine
OnRL
30
37
0
04 Oct 2016
Collective Robot Reinforcement Learning with Distributed Asynchronous
  Guided Policy Search
Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search
Ali Yahya
A. Li
Mrinal Kalakrishnan
Yevgen Chebotar
Sergey Levine
OffRL
26
155
0
03 Oct 2016
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous
  Off-Policy Updates
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates
S. Gu
E. Holly
Timothy Lillicrap
Sergey Levine
OffRL
SSL
56
1,473
0
03 Oct 2016
Path Integral Guided Policy Search
Path Integral Guided Policy Search
Yevgen Chebotar
Mrinal Kalakrishnan
Ali Yahya
A. Li
S. Schaal
Sergey Levine
38
149
0
03 Oct 2016
Deep Reinforcement Learning for Tensegrity Robot Locomotion
Deep Reinforcement Learning for Tensegrity Robot Locomotion
Marvin Zhang
Xinyang Geng
J. Bruce
Ken Caluwaerts
Massimo Vespignani
Vytas SunSpiral
Pieter Abbeel
Sergey Levine
28
92
0
28 Sep 2016
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot
  Transfer
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer
Coline Devin
Abhishek Gupta
Trevor Darrell
Pieter Abbeel
Sergey Levine
OffRL
30
396
0
22 Sep 2016
A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot
  Interaction
A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction
Ali Ghadirzadeh
Judith Butepage
A. Maki
Danica Kragic
Mårten Björkman
31
50
0
27 Jul 2016
Guided Policy Search as Approximate Mirror Descent
Guided Policy Search as Approximate Mirror Descent
William H. Montgomery
Sergey Levine
27
125
0
15 Jul 2016
Model-Free Trajectory-based Policy Optimization with Monotonic
  Improvement
Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
R. Akrour
A. Abdolmaleki
Hany Abdulsamad
Jan Peters
Gerhard Neumann
18
49
0
29 Jun 2016
Strategic Attentive Writer for Learning Macro-Actions
Strategic Attentive Writer for Learning Macro-Actions
Alexander
A. Vezhnevets
Volodymyr Mnih
J. Agapiou
Simon Osindero
Alex Graves
Oriol Vinyals
Koray Kavukcuoglu
18
172
0
15 Jun 2016
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
60
3,080
0
10 Jun 2016
Continuously Learning Neural Dialogue Management
Continuously Learning Neural Dialogue Management
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
41
122
0
08 Jun 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
131
5,042
0
05 Jun 2016
VIME: Variational Information Maximizing Exploration
VIME: Variational Information Maximizing Exploration
Rein Houthooft
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
27
78
0
31 May 2016
Predicting Personal Traits from Facial Images using Convolutional Neural
  Networks Augmented with Facial Landmark Information
Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information
Yoad Lewenberg
Valliappa Chockalingam
Satinder Singh
Honglak Lee
CVBM
22
304
0
29 May 2016
Model-Free Imitation Learning with Policy Optimization
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho
Jayesh K. Gupta
Stefano Ermon
24
149
0
26 May 2016
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in
  ATARI Games
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Xiaoxiao Guo
Satinder Singh
Richard L. Lewis
Honglak Lee
30
55
0
24 Apr 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
44
1,689
0
22 Apr 2016
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks
  with Delayed Rewards
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards
S. Krishnan
Animesh Garg
Richard Liaw
Lauren Miller
Florian T. Pokorny
Ken Goldberg
36
40
0
21 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
42
1,008
0
02 Mar 2016
PLATO: Policy Learning using Adaptive Trajectory Optimization
PLATO: Policy Learning using Adaptive Trajectory Optimization
G. Kahn
Tianhao Zhang
Sergey Levine
Pieter Abbeel
32
136
0
02 Mar 2016
Easy Monotonic Policy Iteration
Easy Monotonic Policy Iteration
Joshua Achiam
OffRL
37
0
0
29 Feb 2016
A review on locomotion robophysics: the study of movement at the
  intersection of robotics, soft matter and dynamical systems
A review on locomotion robophysics: the study of movement at the intersection of robotics, soft matter and dynamical systems
J. Aguilar
Tingnan Zhang
Feifei Qian
Mark Kingsbury
Benjamin W. McInroe
...
Matthew Travers
Ross L. Hatton
Howie Choset
P. Umbanhowar
Daniel I. Goldman
17
237
0
12 Feb 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
41
650
0
09 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
112
8,792
0
04 Feb 2016
Memory-based control with recurrent neural networks
Memory-based control with recurrent neural networks
N. Heess
Jonathan J. Hunt
Timothy Lillicrap
David Silver
35
301
0
14 Dec 2015
State of the Art Control of Atari Games Using Shallow Reinforcement
  Learning
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao Liang
Marlos C. Machado
Erik Talvitie
Michael Bowling
21
113
0
04 Dec 2015
Adapting Deep Visuomotor Representations with Weak Pairwise Constraints
Adapting Deep Visuomotor Representations with Weak Pairwise Constraints
Eric Tzeng
Coline Devin
Judy Hoffman
Chelsea Finn
Pieter Abbeel
Sergey Levine
Kate Saenko
Trevor Darrell
OOD
31
139
0
23 Nov 2015
Previous
123...616263
Next