ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms
v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 11,421 papers shown
Task-Driven Graph Attention for Hierarchical Relational Object
  Navigation
Task-Driven Graph Attention for Hierarchical Relational Object NavigationIEEE International Conference on Robotics and Automation (ICRA), 2023
Michael Lingelbach
Chengshu Li
Minjune Hwang
Andrey Kurenkov
Alan Lou
Roberto Martín-Martín
Ruohan Zhang
Li Fei-Fei
Jiajun Wu
240
10
0
23 Jun 2023
Creating Valid Adversarial Examples of Malware
Creating Valid Adversarial Examples of MalwareJournal of Computer Virology and Hacking Techniques (JCVHT), 2023
M. Kozák
M. Jureček
Mark Stamp
Fabio Di Troia
AAML
152
18
0
23 Jun 2023
Correcting discount-factor mismatch in on-policy policy gradient methods
Correcting discount-factor mismatch in on-policy policy gradient methodsInternational Conference on Machine Learning (ICML), 2023
Fengdi Che
Gautham Vasan
A. R. Mahmood
OffRL
135
9
0
23 Jun 2023
Transferable Curricula through Difficulty Conditioned Generators
Transferable Curricula through Difficulty Conditioned GeneratorsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Sidney Tio
Pradeep Varakantham
175
4
0
22 Jun 2023
MP3: Movement Primitive-Based (Re-)Planning Policy
MP3: Movement Primitive-Based (Re-)Planning Policy
Fabian Otto
Hongyi Zhou
Onur Celik
Ge Li
Rudolf Lioutikov
Gerhard Neumann
283
9
0
22 Jun 2023
Robust Recovery Motion Control for Quadrupedal Robots via Learned
  Terrain Imagination
Robust Recovery Motion Control for Quadrupedal Robots via Learned Terrain Imagination
Aswin Nahrendra
Mi-Suk Oh
Byeong-Uk Yu
Hyungtae Lim
Hyun Myung
158
4
0
22 Jun 2023
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large
  Foundation Models
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Shizhe Diao
Boyao Wang
Hanze Dong
Kashun Shum
Jipeng Zhang
Wei Xiong
Tong Zhang
ALM
300
76
0
21 Jun 2023
Introspective Action Advising for Interpretable Transfer Learning
Introspective Action Advising for Interpretable Transfer Learning
Joseph Campbell
Yue (Sophie) Guo
Fiona Xie
Simon Stepputtis
Katia Sycara
244
3
0
21 Jun 2023
ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario
  Simulation and Modeling
ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and ModelingNeural Information Processing Systems (NeurIPS), 2023
Quanyi Li
Zhenghao Peng
Lan Feng
Zhizheng Liu
Chenda Duan
Wen-An Mo
Bolei Zhou
518
70
0
21 Jun 2023
Tailstorm: A Secure and Fair Blockchain for Cash Transactions
Tailstorm: A Secure and Fair Blockchain for Cash TransactionsConference on Advances in Financial Technologies (AFT), 2023
Patrik Keller
Ben Glickenhaus
G. Bissias
G. Griffith
199
6
0
21 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for
  Search Engine Marketing Optimization
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
386
1
0
21 Jun 2023
Efficient Dynamics Modeling in Interactive Environments with Koopman
  Theory
Efficient Dynamics Modeling in Interactive Environments with Koopman TheoryInternational Conference on Learning Representations (ICLR), 2023
Arnab Kumar Mondal
Siba Smarak Panigrahi
Sai Rajeswar
K. Siddiqi
Siamak Ravanbakhsh
286
9
0
20 Jun 2023
Reinforcement Learning-based Virtual Fixtures for Teleoperation of
  Hydraulic Construction Machine
Reinforcement Learning-based Virtual Fixtures for Teleoperation of Hydraulic Construction MachineIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Hyung-Joo Lee
S. Brell-Çokcan
106
6
0
20 Jun 2023
Learning to Generate Better Than Your LLM
Learning to Generate Better Than Your LLM
Jonathan D. Chang
Kianté Brantley
Rajkumar Ramamurthy
Dipendra Kumar Misra
Wen Sun
272
54
0
20 Jun 2023
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy
  Guided Reinforcement Learning
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement LearningACM Multimedia (ACM MM), 2023
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
N. Yuan
Jian Yin
Hongyang Chao
Tao Gui
EGVM
348
12
0
20 Jun 2023
Multi-Fidelity Active Learning with GFlowNets
Multi-Fidelity Active Learning with GFlowNets
Alex Hernandez-Garcia
Nikita Saxena
Moksh Jain
Cheng-Hao Liu
Yoshua Bengio
AI4CE
179
17
0
20 Jun 2023
IMP-MARL: a Suite of Environments for Large-scale Infrastructure
  Management Planning via MARL
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARLNeural Information Processing Systems (NeurIPS), 2023
Pascal Leroy
P. G. Morato
J. Pisane
A. Kolios
D. Ernst
OffRL
266
13
0
20 Jun 2023
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs
  for Fact-aware Language Modeling
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language ModelingIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Lin F. Yang
Hongyang Chen
Zhao Li
Xiao Ding
Xindong Wu
KELM
358
142
0
20 Jun 2023
Multi-user Reset Controller for Redirected Walking Using Reinforcement
  Learning
Multi-user Reset Controller for Redirected Walking Using Reinforcement Learning
Ho Jung Lee
Sang-Bin Jeon
Yong-Hun Cho
In-Kwon Lee
96
3
0
20 Jun 2023
Deep Reinforcement Learning for Privacy-Preserving Task Offloading in
  Integrated Satellite-Terrestrial Networks
Deep Reinforcement Learning for Privacy-Preserving Task Offloading in Integrated Satellite-Terrestrial NetworksIEEE Transactions on Mobile Computing (IEEE TMC), 2023
Wen-Bo Lan
Kongyang Chen
Yikai Li
Jiannong Cao
Yuvraj Sahni
42
21
0
20 Jun 2023
Cooperative Multi-Agent Learning for Navigation via Structured State
  Abstraction
Cooperative Multi-Agent Learning for Navigation via Structured State AbstractionIEEE Transactions on Communications (IEEE Trans. Commun.), 2023
Mohamed K. Abdel-Aziz
Mohammed S. Elbamby
S. Samarakoon
M. Bennis
229
8
0
20 Jun 2023
Autonomous Driving with Deep Reinforcement Learning in CARLA Simulation
Autonomous Driving with Deep Reinforcement Learning in CARLA Simulation
Jumman Hossain
145
12
0
20 Jun 2023
Sim-to-real transfer of active suspension control using deep
  reinforcement learning
Sim-to-real transfer of active suspension control using deep reinforcement learning
Viktor Wiberg
Erik Wallin
Arvid Fälldin
Tobias Semberg
Morgan Rossander
E. Wadbro
Martin Servin
344
15
0
19 Jun 2023
CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning
CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning
Nikunj Gupta
Somjit Nath
Samira Ebrahimi Kahou
191
2
0
19 Jun 2023
Deep Reinforcement Learning for ESG financial portfolio management
Deep Reinforcement Learning for ESG financial portfolio management
E.C. Garrido-Merchán
Sol Mora-Figueroa-Cruz-Guzmán
Maria Coronado Vaca
AIFin
155
5
0
19 Jun 2023
LARG, Language-based Automatic Reward and Goal Generation
LARG, Language-based Automatic Reward and Goal Generation
Julien Perez
Denys Proux
Claude Roux
Michael Niemaz
LM&Ro
153
1
0
19 Jun 2023
AdaStop: adaptive statistical testing for sound comparisons of Deep RL
  agents
AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents
Timothée Mathieu
R. D. Vecchia
Alena Shilova
M. Centa
Hector Kohler
Odalric-Ambrym Maillard
Philippe Preux
167
2
0
19 Jun 2023
Practical First-Order Bayesian Optimization Algorithms
Practical First-Order Bayesian Optimization Algorithms
Utkarsh Prakash
Aryan Chollera
Kushagra Khatwani
P. K. J.
Tejas Bodas
158
5
0
19 Jun 2023
Collaborative Optimization of Multi-microgrids System with Shared Energy
  Storage Based on Multi-agent Stochastic Game and Reinforcement Learning
Collaborative Optimization of Multi-microgrids System with Shared Energy Storage Based on Multi-agent Stochastic Game and Reinforcement Learning
Yijia Wang
Yangliu Cui
Yang Li
Yang Xu
91
42
0
19 Jun 2023
Integrating Tick-level Data and Periodical Signal for High-frequency
  Market Making
Integrating Tick-level Data and Periodical Signal for High-frequency Market Making
Jiafa He
Cong Zheng
Can Yang
AIFin
127
1
0
19 Jun 2023
Deep Reinforcement Learning with Task-Adaptive Retrieval via
  Hypernetwork
Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork
Yonggang Jin
Chenxu Wang
Tianyu Zheng
Liuyu Xiang
Yao-Chun Yang
Junge Zhang
Jie Fu
Zhaofeng He
3DH
287
0
0
19 Jun 2023
Acceleration in Policy Optimization
Acceleration in Policy Optimization
Veronica Chelu
Tom Zahavy
A. Guez
Doina Precup
Sebastian Flennerhag
329
0
0
18 Jun 2023
LAGOON: Language-Guided Motion Control
LAGOON: Language-Guided Motion ControlIEEE International Conference on Robotics and Automation (ICRA), 2023
Shusheng Xu
Huaijie Wang
Jiaxuan Gao
Yutao Ouyang
Chao Yu
Yi Wu
LM&Ro
303
2
0
18 Jun 2023
Variational Sequential Optimal Experimental Design using Reinforcement
  Learning
Variational Sequential Optimal Experimental Design using Reinforcement LearningComputer Methods in Applied Mechanics and Engineering (CMAME), 2023
Wanggang Shen
Jiayuan Dong
Xun Huan
172
11
0
17 Jun 2023
The RL Perceptron: Generalisation Dynamics of Policy Learning in High
  Dimensions
The RL Perceptron: Generalisation Dynamics of Policy Learning in High DimensionsPhysical Review X (PRX), 2023
Nishil Patel
Sebastian Lee
Stefano Sarao Mannelli
Sebastian Goldt
Adrew Saxe
OffRL
445
7
0
17 Jun 2023
Empowering NLG: Offline Reinforcement Learning for Informal
  Summarization in Online Domains
Empowering NLG: Offline Reinforcement Learning for Informal Summarization in Online Domains
Zhiwei Tai
Po-Chuan Chen
OffRL
169
0
0
17 Jun 2023
Snowman: A Million-scale Chinese Commonsense Knowledge Graph Distilled
  from Foundation Model
Snowman: A Million-scale Chinese Commonsense Knowledge Graph Distilled from Foundation Model
Jiaan Wang
Jianfeng Qu
Yunlong Liang
Zhixu Li
An Liu
Guanfeng Liu
Xin Zheng
235
2
0
17 Jun 2023
Genes in Intelligent Agents
Genes in Intelligent Agents
Fu Feng
Jing Wang
Xu Yang
Xin Geng
AI4CE
143
10
0
17 Jun 2023
Active Policy Improvement from Multiple Black-box Oracles
Active Policy Improvement from Multiple Black-box OraclesInternational Conference on Machine Learning (ICML), 2023
Xuefeng Liu
Takuma Yoneda
Simon Mahns
Matthew R. Walter
Yuxin Chen
375
12
0
17 Jun 2023
ALP: Action-Aware Embodied Learning for Perception
ALP: Action-Aware Embodied Learning for Perception
Xinran Liang
Anthony Han
Wilson Yan
Aditi Raghunathan
Pieter Abbeel
VLM
261
6
0
16 Jun 2023
SLACK: Stable Learning of Augmentations with Cold-start and KL
  regularization
SLACK: Stable Learning of Augmentations with Cold-start and KL regularizationComputer Vision and Pattern Recognition (CVPR), 2023
Juliette Marrie
Michael Arbel
Diane Larlus
Julien Mairal
OffRL
155
6
0
16 Jun 2023
Fairness in Preference-based Reinforcement Learning
Fairness in Preference-based Reinforcement Learning
Umer Siddique
Abhinav Sinha
Yongcan Cao
208
7
0
16 Jun 2023
Actor-Critic Model Predictive Control
Actor-Critic Model Predictive ControlIEEE International Conference on Robotics and Automation (ICRA), 2023
Angel Romero
Yunlong Song
Davide Scaramuzza
509
11
0
16 Jun 2023
Unlocking the Potential of User Feedback: Leveraging Large Language
  Model as User Simulator to Enhance Dialogue System
Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue SystemInternational Conference on Information and Knowledge Management (CIKM), 2023
Zhiyuan Hu
Yue Feng
Anh Tuan Luu
Bryan Hooi
Aldo Lipani
311
49
0
16 Jun 2023
Mimicking Better by Matching the Approximate Action Distribution
Mimicking Better by Matching the Approximate Action DistributionInternational Conference on Machine Learning (ICML), 2023
Joao A. Candido Ramos
Lionel Blondé
Naoya Takeishi
Alexandros Kalousis
215
4
0
16 Jun 2023
Meta Generative Flow Networks with Personalization for Task-Specific
  Adaptation
Meta Generative Flow Networks with Personalization for Task-Specific AdaptationInformation Sciences (Inf. Sci.), 2023
Xinyuan Ji
Xu Zhang
Wei Xi
Haozhi Wang
Olga Gadyatskaya
Yinchuan Li
178
1
0
16 Jun 2023
Semi-Offline Reinforcement Learning for Optimized Text Generation
Semi-Offline Reinforcement Learning for Optimized Text GenerationInternational Conference on Machine Learning (ICML), 2023
Changyu Chen
Xiting Wang
Yiqiao Jin
Victor Ye Dong
Li Dong
Jie Cao
Yi Liu
Rui Yan
OffRL
215
17
0
16 Jun 2023
Cooperative Multi-Objective Reinforcement Learning for Traffic Signal
  Control and Carbon Emission Reduction
Cooperative Multi-Objective Reinforcement Learning for Traffic Signal Control and Carbon Emission Reduction
Cheng Ruei Tang
J. Hsieh
Shin-You Teng
97
0
0
16 Jun 2023
DeepMPR: Enhancing Opportunistic Routing in Wireless Networks through
  Multi-Agent Deep Reinforcement Learning
DeepMPR: Enhancing Opportunistic Routing in Wireless Networks through Multi-Agent Deep Reinforcement Learning
Saeed Kaviani
Bo Ryu
Ejaz Ahmed
Deokseong Kim
Jae H. Kim
Carrie Spiker
Blake Harnden
153
3
0
16 Jun 2023
CAJun: Continuous Adaptive Jumping using a Learned Centroidal Controller
CAJun: Continuous Adaptive Jumping using a Learned Centroidal ControllerConference on Robot Learning (CoRL), 2023
Yuxiang Yang
Guanya Shi
Xiang Meng
Wenhao Yu
Tingnan Zhang
Jie Tan
Byron Boots
221
33
0
16 Jun 2023
Previous
123...135136137...227228229
Next