ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms
v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 11,424 papers shown
qgym: A Gym for Training and Benchmarking RL-Based Quantum Compilation
qgym: A Gym for Training and Benchmarking RL-Based Quantum CompilationInternational Conference on Quantum Computing and Engineering (QCE), 2023
S. V. D. Linde
Willem de Kok
T. Bontekoe
Sebastian Feld
150
18
0
01 Aug 2023
Target Search and Navigation in Heterogeneous Robot Systems with Deep
  Reinforcement Learning
Target Search and Navigation in Heterogeneous Robot Systems with Deep Reinforcement LearningMachine Intelligence Research (MIR), 2023
Yuxiang Chen
Jiaping Xiao
129
10
0
01 Aug 2023
Pixel to policy: DQN Encoders for within & cross-game reinforcement
  learning
Pixel to policy: DQN Encoders for within & cross-game reinforcement learning
Ashrya Agrawal
Priyansh Shah
S. Prakash
OffRL
32
0
0
01 Aug 2023
Deep Reinforcement Learning-Based Battery Conditioning Hierarchical V2G
  Coordination for Multi-Stakeholder Benefits
Deep Reinforcement Learning-Based Battery Conditioning Hierarchical V2G Coordination for Multi-Stakeholder Benefits
Yubao Zhang
Xinyu Chen
Yi Gu
Zhicheng Li
Wu Kai
78
0
0
01 Aug 2023
Formally Explaining Neural Networks within Reactive Systems
Formally Explaining Neural Networks within Reactive SystemsFormal Methods in Computer-Aided Design (FMCAD), 2023
Shahaf Bassan
Guy Amir
Davide Corsi
Idan Refaeli
Guy Katz
AAML
384
22
0
31 Jul 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark
  and Case Study for Robotics Manipulation
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Yuheng Huang
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
223
32
0
31 Jul 2023
Reinforcement Learning for Generative AI: State of the Art,
  Opportunities and Open Research Challenges
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research ChallengesJournal of Artificial Intelligence Research (JAIR), 2023
Giorgio Franceschelli
Mirco Musolesi
AI4CE
650
30
0
31 Jul 2023
Learning to Model the World with Language
Learning to Model the World with LanguageInternational Conference on Machine Learning (ICML), 2023
Jessy Lin
Yuqing Du
Olivia Watkins
Danijar Hafner
Pieter Abbeel
Dan Klein
Anca Dragan
LM&RoSyDa
290
70
0
31 Jul 2023
Discovering Adaptable Symbolic Algorithms from Scratch
Discovering Adaptable Symbolic Algorithms from ScratchIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Stephen Kelly
Daniel S. Park
Xingyou Song
Mitchell McIntire
Pranav Nashikkar
...
W. Banzhaf
Kalyanmoy Deb
Vishnu Boddeti
Jie Tan
Esteban Real
199
6
0
31 Jul 2023
Learning whom to trust in navigation: dynamically switching between
  classical and neural planning
Learning whom to trust in navigation: dynamically switching between classical and neural planningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Sombit Dey
Assem Sadek
G. Monaci
Boris Chidlovskii
Christian Wolf
238
6
0
31 Jul 2023
Learning Generalizable Tool Use with Non-rigid Grasp-pose Registration
Learning Generalizable Tool Use with Non-rigid Grasp-pose Registration
Malte Mosbach
Sven Behnke
215
2
0
31 Jul 2023
Rating-based Reinforcement Learning
Rating-based Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Devin White
Mingkang Wu
Ellen R. Novoseller
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
ALM
237
13
0
30 Jul 2023
Do LLMs Possess a Personality? Making the MBTI Test an Amazing
  Evaluation for Large Language Models
Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models
Keyu Pan
Yawen Zeng
LLMAG
214
57
0
30 Jul 2023
MTD-GPT: A Multi-Task Decision-Making GPT Model for Autonomous Driving
  at Unsignalized Intersections
MTD-GPT: A Multi-Task Decision-Making GPT Model for Autonomous Driving at Unsignalized Intersections
Jiaqi Liu
Peng Hang
Xiao Qi
Jianqiang Wang
Jian Sun
174
60
0
30 Jul 2023
Coordination of Bounded Rational Drones through Informed Prior Policy
Coordination of Bounded Rational Drones through Informed Prior PolicyIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Durgakant Pushp
Junhong Xu
Lantao Liu
136
1
0
28 Jul 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Benchmarking Offline Reinforcement Learning on Real-Robot HardwareInternational Conference on Learning Representations (ICLR), 2023
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
303
36
0
28 Jul 2023
TrackAgent: 6D Object Tracking via Reinforcement Learning
TrackAgent: 6D Object Tracking via Reinforcement LearningInternational Conference on Virtual Storytelling (ICVS), 2023
Konstantin Röhrl
Dominik Bauer
T. Patten
Markus Vincze
3DPC
121
0
0
28 Jul 2023
Learning to Open Doors with an Aerial Manipulator
Learning to Open Doors with an Aerial ManipulatorIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Eugenio Cuniato
Ismail Geles
Weixuan Zhang
Olov Andersson
M. Tognon
Roland Siegwart
171
7
0
28 Jul 2023
Curiosity-Driven Reinforcement Learning based Low-Level Flight Control
Curiosity-Driven Reinforcement Learning based Low-Level Flight Control
Amir Ramezani Dooraki
Alexandros Iosifidis
99
0
0
28 Jul 2023
Thinker: Learning to Plan and Act
Thinker: Learning to Plan and ActNeural Information Processing Systems (NeurIPS), 2023
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAGOffRLLRM
294
12
0
27 Jul 2023
FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal
  Adversarial Masks
FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial MasksAsia-Pacific Computer Systems Architecture Conference (ACSA), 2023
Buse G. A. Tekgul
Nadarajah Asokan
AAML
235
4
0
27 Jul 2023
An Ensemble Method of Deep Reinforcement Learning for Automated
  Cryptocurrency Trading
An Ensemble Method of Deep Reinforcement Learning for Automated Cryptocurrency TradingInternational Conference on Blockchain (ICB), 2023
Shuyang Wang
Diego Klabjan
189
3
0
27 Jul 2023
Evaluation of Safety Constraints in Autonomous Navigation with Deep
  Reinforcement Learning
Evaluation of Safety Constraints in Autonomous Navigation with Deep Reinforcement Learning
Brian Angulo
G. Gorbov
Aleksandr I. Panov
Konstantin Yakovlev
203
0
0
27 Jul 2023
MorphoLander: Reinforcement Learning Based Landing of a Group of Drones
  on the Adaptive Morphogenetic UAV
MorphoLander: Reinforcement Learning Based Landing of a Group of Drones on the Adaptive Morphogenetic UAVIEEE International Conference on Systems, Man and Cybernetics (SMC), 2023
Sausar Karaf
A. Fedoseev
Mikhail Martynov
Zhanibek Darush
Aleksei Shcherbak
Dzmitry Tsetserukou
146
9
0
26 Jul 2023
Reinforced Potential Field for Multi-Robot Motion Planning in Cluttered
  Environments
Reinforced Potential Field for Multi-Robot Motion Planning in Cluttered EnvironmentsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Dengyu Zhang
Xinyu Zhang
Zheng Zhang
Bo Zhu
Qingrui Zhang
228
4
0
26 Jul 2023
Deep Reinforcement Learning for Robust Goal-Based Wealth Management
Deep Reinforcement Learning for Robust Goal-Based Wealth ManagementArtificial Intelligence Applications and Innovations (AIAI), 2023
Tessa Bauman
Bruno Gašperov
Stjepan Begušić
Z. Kostanjčar
AIFin
81
1
0
25 Jul 2023
Submodular Reinforcement Learning
Submodular Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Manish Prajapat
Mojmír Mutný
Melanie Zeilinger
Andreas Krause
OffRL
274
22
0
25 Jul 2023
Reinforcement Learning -based Adaptation and Scheduling Methods for
  Multi-source DASH
Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASHComputer Science and Information Systems (COMSIS), 2023
Nghia T. Nguyen
Long Luu
Phuong Vo
Sang Nguyen
Cuong T. Do
Ngoc-Thanh Nguyen
AI4TS
178
2
0
25 Jul 2023
Counterfactual Explanation Policies in RL
Counterfactual Explanation Policies in RL
Shripad Deshmukh
R Srivatsan
Supriti Vijay
Jayakumar Subramanian
Chirag Agarwal
OffRL
228
0
0
25 Jul 2023
RLCD: Reinforcement Learning from Contrastive Distillation for Language
  Model Alignment
RLCD: Reinforcement Learning from Contrastive Distillation for Language Model Alignment
Kevin Kaichuang Yang
Dan Klein
Asli Celikyilmaz
Nanyun Peng
Yuandong Tian
ALM
457
38
0
24 Jul 2023
RRAML: Reinforced Retrieval Augmented Machine Learning
RRAML: Reinforced Retrieval Augmented Machine Learning
Andrea Bacciu
Florin Cocunasu
F. Siciliano
Fabrizio Silvestri
Nicola Tonellotto
Giovanni Trappolini
RALM
346
9
0
24 Jul 2023
Policy Gradient Optimal Correlation Search for Variance Reduction in
  Monte Carlo simulation and Maximum Optimal Transport
Policy Gradient Optimal Correlation Search for Variance Reduction in Monte Carlo simulation and Maximum Optimal Transport
Pierre Bras
Gilles Pagès
178
1
0
24 Jul 2023
SafeSteps: Learning Safer Footstep Planning Policies for Legged Robots via Model-Based Priors
SafeSteps: Learning Safer Footstep Planning Policies for Legged Robots via Model-Based PriorsIEEE-RAS International Conference on Humanoid Robots (Humanoids), 2023
Shafeef Omar
Lorenzo Amatucci
Victor Barasuol
Giulio Turrisi
Claudio Semini
342
6
0
24 Jul 2023
On the Effectiveness of Offline RL for Dialogue Response Generation
On the Effectiveness of Offline RL for Dialogue Response GenerationInternational Conference on Machine Learning (ICML), 2023
Paloma Sodhi
Felix Wu
Ethan R. Elenberg
Kilian Q. Weinberger
Ryan T. McDonald
OffRL
203
5
0
23 Jul 2023
Using Reinforcement Learning for the Three-Dimensional Loading
  Capacitated Vehicle Routing Problem
Using Reinforcement Learning for the Three-Dimensional Loading Capacitated Vehicle Routing Problem
Stefan Schoepf
Stephen Mak
J. Senoner
Liming Xu
Netland Torbjorn
Alexandra Brintrup
122
0
0
22 Jul 2023
Online Container Scheduling for Low-Latency IoT Services in Edge Cluster
  Upgrade: A Reinforcement Learning Approach
Online Container Scheduling for Low-Latency IoT Services in Edge Cluster Upgrade: A Reinforcement Learning ApproachInternational Conference on Innovative Computing and Cloud Computing (ICCC), 2023
Hanshuai Cui
Zhiqing Tang
Jiong Lou
Weijia Jia
54
8
0
22 Jul 2023
Active Control of Flow over Rotating Cylinder by Multiple Jets using
  Deep Reinforcement Learning
Active Control of Flow over Rotating Cylinder by Multiple Jets using Deep Reinforcement Learning
Kamyar Dobakhti
J. Ghazanfarian
AI4CE
278
0
0
22 Jul 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
On-Robot Bayesian Reinforcement Learning for POMDPsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
212
2
0
22 Jul 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
279
2
0
21 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement
  Learning
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
369
3
0
21 Jul 2023
An Analysis of Multi-Agent Reinforcement Learning for Decentralized
  Inventory Control Systems
An Analysis of Multi-Agent Reinforcement Learning for Decentralized Inventory Control SystemsComputers and Chemical Engineering (Comput. Chem. Eng.), 2023
Marwan Mousa
Damien van de Berg
Niki Kotecha
Ehecatl Antonio del Rio Chanona
M. Mowbray
176
24
0
21 Jul 2023
Bridging the Reality Gap of Reinforcement Learning based Traffic Signal
  Control using Domain Randomization and Meta Learning
Bridging the Reality Gap of Reinforcement Learning based Traffic Signal Control using Domain Randomization and Meta Learning
Arthur Muller
M. Sabatelli
162
5
0
21 Jul 2023
A Two-stage Fine-tuning Strategy for Generalizable Manipulation Skill of
  Embodied AI
A Two-stage Fine-tuning Strategy for Generalizable Manipulation Skill of Embodied AI
Fang Gao
XueTao Li
Jun Yu
Shaung Feng
215
4
0
21 Jul 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from
  Human-in-the-Loop Feedback
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
M. Torné
Max Balsells
Zihan Wang
Samedh Desai
Tao Chen
Pulkit Agrawal
Abhishek Gupta
226
15
0
20 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&RoOffRL
306
6
0
20 Jul 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Reparameterized Policy Learning for Multimodal Trajectory OptimizationInternational Conference on Machine Learning (ICML), 2023
Zhiao Huang
Litian Liang
Z. Ling
Xuanlin Li
Chuang Gan
H. Su
193
18
0
20 Jul 2023
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Ashish Singh
Ashutosh Singh
Prateek R. Agarwal
Zixuan Huang
Arpita Singh
...
Ryan Rossi
Puneet Mathur
Erik Learned-Miller
Franck Dernoncourt
Ryan Rossi
311
8
0
20 Jul 2023
Technical Challenges of Deploying Reinforcement Learning Agents for Game
  Testing in AAA Games
Technical Challenges of Deploying Reinforcement Learning Agents for Game Testing in AAA Games
Jonas Gillberg
Joakim Bergdahl
Alessandro Sestini
Andy Eakins
Linus Gisslén
OffRL
264
19
0
19 Jul 2023
Robust Driving Policy Learning with Guided Meta Reinforcement Learning
Robust Driving Policy Learning with Guided Meta Reinforcement Learning
Kanghoon Lee
Jiachen Li
David Isele
Jinkyoo Park
K. Fujimura
Mykel J. Kochenderfer
199
8
0
19 Jul 2023
Benchmarking Potential Based Rewards for Learning Humanoid Locomotion
Benchmarking Potential Based Rewards for Learning Humanoid LocomotionIEEE International Conference on Robotics and Automation (ICRA), 2023
Seungmin Jeon
Steve Heim
Charles Khazoom
Sangbae Kim
OffRL
154
25
0
19 Jul 2023
Previous
123...132133134...227228229
Next
Page 133 of 229
Pageof 229