Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.06347
Cited By
v1
v2 (latest)
Proximal Policy Optimization Algorithms
20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Proximal Policy Optimization Algorithms"
50 / 11,421 papers shown
Task-Driven Graph Attention for Hierarchical Relational Object Navigation
IEEE International Conference on Robotics and Automation (ICRA), 2023
Michael Lingelbach
Chengshu Li
Minjune Hwang
Andrey Kurenkov
Alan Lou
Roberto Martín-Martín
Ruohan Zhang
Li Fei-Fei
Jiajun Wu
240
10
0
23 Jun 2023
Creating Valid Adversarial Examples of Malware
Journal of Computer Virology and Hacking Techniques (JCVHT), 2023
M. Kozák
M. Jureček
Mark Stamp
Fabio Di Troia
AAML
152
18
0
23 Jun 2023
Correcting discount-factor mismatch in on-policy policy gradient methods
International Conference on Machine Learning (ICML), 2023
Fengdi Che
Gautham Vasan
A. R. Mahmood
OffRL
135
9
0
23 Jun 2023
Transferable Curricula through Difficulty Conditioned Generators
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Sidney Tio
Pradeep Varakantham
175
4
0
22 Jun 2023
MP3: Movement Primitive-Based (Re-)Planning Policy
Fabian Otto
Hongyi Zhou
Onur Celik
Ge Li
Rudolf Lioutikov
Gerhard Neumann
283
9
0
22 Jun 2023
Robust Recovery Motion Control for Quadrupedal Robots via Learned Terrain Imagination
Aswin Nahrendra
Mi-Suk Oh
Byeong-Uk Yu
Hyungtae Lim
Hyun Myung
158
4
0
22 Jun 2023
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Shizhe Diao
Boyao Wang
Hanze Dong
Kashun Shum
Jipeng Zhang
Wei Xiong
Tong Zhang
ALM
300
76
0
21 Jun 2023
Introspective Action Advising for Interpretable Transfer Learning
Joseph Campbell
Yue (Sophie) Guo
Fiona Xie
Simon Stepputtis
Katia Sycara
244
3
0
21 Jun 2023
ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling
Neural Information Processing Systems (NeurIPS), 2023
Quanyi Li
Zhenghao Peng
Lan Feng
Zhizheng Liu
Chenda Duan
Wen-An Mo
Bolei Zhou
518
70
0
21 Jun 2023
Tailstorm: A Secure and Fair Blockchain for Cash Transactions
Conference on Advances in Financial Technologies (AFT), 2023
Patrik Keller
Ben Glickenhaus
G. Bissias
G. Griffith
199
6
0
21 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
386
1
0
21 Jun 2023
Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
International Conference on Learning Representations (ICLR), 2023
Arnab Kumar Mondal
Siba Smarak Panigrahi
Sai Rajeswar
K. Siddiqi
Siamak Ravanbakhsh
286
9
0
20 Jun 2023
Reinforcement Learning-based Virtual Fixtures for Teleoperation of Hydraulic Construction Machine
IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Hyung-Joo Lee
S. Brell-Çokcan
106
6
0
20 Jun 2023
Learning to Generate Better Than Your LLM
Jonathan D. Chang
Kianté Brantley
Rajkumar Ramamurthy
Dipendra Kumar Misra
Wen Sun
272
54
0
20 Jun 2023
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
ACM Multimedia (ACM MM), 2023
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
N. Yuan
Jian Yin
Hongyang Chao
Tao Gui
EGVM
348
12
0
20 Jun 2023
Multi-Fidelity Active Learning with GFlowNets
Alex Hernandez-Garcia
Nikita Saxena
Moksh Jain
Cheng-Hao Liu
Yoshua Bengio
AI4CE
179
17
0
20 Jun 2023
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
Neural Information Processing Systems (NeurIPS), 2023
Pascal Leroy
P. G. Morato
J. Pisane
A. Kolios
D. Ernst
OffRL
266
13
0
20 Jun 2023
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Lin F. Yang
Hongyang Chen
Zhao Li
Xiao Ding
Xindong Wu
KELM
358
142
0
20 Jun 2023
Multi-user Reset Controller for Redirected Walking Using Reinforcement Learning
Ho Jung Lee
Sang-Bin Jeon
Yong-Hun Cho
In-Kwon Lee
96
3
0
20 Jun 2023
Deep Reinforcement Learning for Privacy-Preserving Task Offloading in Integrated Satellite-Terrestrial Networks
IEEE Transactions on Mobile Computing (IEEE TMC), 2023
Wen-Bo Lan
Kongyang Chen
Yikai Li
Jiannong Cao
Yuvraj Sahni
42
21
0
20 Jun 2023
Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction
IEEE Transactions on Communications (IEEE Trans. Commun.), 2023
Mohamed K. Abdel-Aziz
Mohammed S. Elbamby
S. Samarakoon
M. Bennis
229
8
0
20 Jun 2023
Autonomous Driving with Deep Reinforcement Learning in CARLA Simulation
Jumman Hossain
145
12
0
20 Jun 2023
Sim-to-real transfer of active suspension control using deep reinforcement learning
Viktor Wiberg
Erik Wallin
Arvid Fälldin
Tobias Semberg
Morgan Rossander
E. Wadbro
Martin Servin
344
15
0
19 Jun 2023
CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning
Nikunj Gupta
Somjit Nath
Samira Ebrahimi Kahou
191
2
0
19 Jun 2023
Deep Reinforcement Learning for ESG financial portfolio management
E.C. Garrido-Merchán
Sol Mora-Figueroa-Cruz-Guzmán
Maria Coronado Vaca
AIFin
155
5
0
19 Jun 2023
LARG, Language-based Automatic Reward and Goal Generation
Julien Perez
Denys Proux
Claude Roux
Michael Niemaz
LM&Ro
153
1
0
19 Jun 2023
AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents
Timothée Mathieu
R. D. Vecchia
Alena Shilova
M. Centa
Hector Kohler
Odalric-Ambrym Maillard
Philippe Preux
167
2
0
19 Jun 2023
Practical First-Order Bayesian Optimization Algorithms
Utkarsh Prakash
Aryan Chollera
Kushagra Khatwani
P. K. J.
Tejas Bodas
158
5
0
19 Jun 2023
Collaborative Optimization of Multi-microgrids System with Shared Energy Storage Based on Multi-agent Stochastic Game and Reinforcement Learning
Yijia Wang
Yangliu Cui
Yang Li
Yang Xu
91
42
0
19 Jun 2023
Integrating Tick-level Data and Periodical Signal for High-frequency Market Making
Jiafa He
Cong Zheng
Can Yang
AIFin
127
1
0
19 Jun 2023
Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork
Yonggang Jin
Chenxu Wang
Tianyu Zheng
Liuyu Xiang
Yao-Chun Yang
Junge Zhang
Jie Fu
Zhaofeng He
3DH
287
0
0
19 Jun 2023
Acceleration in Policy Optimization
Veronica Chelu
Tom Zahavy
A. Guez
Doina Precup
Sebastian Flennerhag
329
0
0
18 Jun 2023
LAGOON: Language-Guided Motion Control
IEEE International Conference on Robotics and Automation (ICRA), 2023
Shusheng Xu
Huaijie Wang
Jiaxuan Gao
Yutao Ouyang
Chao Yu
Yi Wu
LM&Ro
303
2
0
18 Jun 2023
Variational Sequential Optimal Experimental Design using Reinforcement Learning
Computer Methods in Applied Mechanics and Engineering (CMAME), 2023
Wanggang Shen
Jiayuan Dong
Xun Huan
172
11
0
17 Jun 2023
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions
Physical Review X (PRX), 2023
Nishil Patel
Sebastian Lee
Stefano Sarao Mannelli
Sebastian Goldt
Adrew Saxe
OffRL
445
7
0
17 Jun 2023
Empowering NLG: Offline Reinforcement Learning for Informal Summarization in Online Domains
Zhiwei Tai
Po-Chuan Chen
OffRL
169
0
0
17 Jun 2023
Snowman: A Million-scale Chinese Commonsense Knowledge Graph Distilled from Foundation Model
Jiaan Wang
Jianfeng Qu
Yunlong Liang
Zhixu Li
An Liu
Guanfeng Liu
Xin Zheng
235
2
0
17 Jun 2023
Genes in Intelligent Agents
Fu Feng
Jing Wang
Xu Yang
Xin Geng
AI4CE
143
10
0
17 Jun 2023
Active Policy Improvement from Multiple Black-box Oracles
International Conference on Machine Learning (ICML), 2023
Xuefeng Liu
Takuma Yoneda
Simon Mahns
Matthew R. Walter
Yuxin Chen
375
12
0
17 Jun 2023
ALP: Action-Aware Embodied Learning for Perception
Xinran Liang
Anthony Han
Wilson Yan
Aditi Raghunathan
Pieter Abbeel
VLM
261
6
0
16 Jun 2023
SLACK: Stable Learning of Augmentations with Cold-start and KL regularization
Computer Vision and Pattern Recognition (CVPR), 2023
Juliette Marrie
Michael Arbel
Diane Larlus
Julien Mairal
OffRL
155
6
0
16 Jun 2023
Fairness in Preference-based Reinforcement Learning
Umer Siddique
Abhinav Sinha
Yongcan Cao
208
7
0
16 Jun 2023
Actor-Critic Model Predictive Control
IEEE International Conference on Robotics and Automation (ICRA), 2023
Angel Romero
Yunlong Song
Davide Scaramuzza
509
11
0
16 Jun 2023
Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System
International Conference on Information and Knowledge Management (CIKM), 2023
Zhiyuan Hu
Yue Feng
Anh Tuan Luu
Bryan Hooi
Aldo Lipani
311
49
0
16 Jun 2023
Mimicking Better by Matching the Approximate Action Distribution
International Conference on Machine Learning (ICML), 2023
Joao A. Candido Ramos
Lionel Blondé
Naoya Takeishi
Alexandros Kalousis
215
4
0
16 Jun 2023
Meta Generative Flow Networks with Personalization for Task-Specific Adaptation
Information Sciences (Inf. Sci.), 2023
Xinyuan Ji
Xu Zhang
Wei Xi
Haozhi Wang
Olga Gadyatskaya
Yinchuan Li
178
1
0
16 Jun 2023
Semi-Offline Reinforcement Learning for Optimized Text Generation
International Conference on Machine Learning (ICML), 2023
Changyu Chen
Xiting Wang
Yiqiao Jin
Victor Ye Dong
Li Dong
Jie Cao
Yi Liu
Rui Yan
OffRL
215
17
0
16 Jun 2023
Cooperative Multi-Objective Reinforcement Learning for Traffic Signal Control and Carbon Emission Reduction
Cheng Ruei Tang
J. Hsieh
Shin-You Teng
97
0
0
16 Jun 2023
DeepMPR: Enhancing Opportunistic Routing in Wireless Networks through Multi-Agent Deep Reinforcement Learning
Saeed Kaviani
Bo Ryu
Ejaz Ahmed
Deokseong Kim
Jae H. Kim
Carrie Spiker
Blake Harnden
153
3
0
16 Jun 2023
CAJun: Continuous Adaptive Jumping using a Learned Centroidal Controller
Conference on Robot Learning (CoRL), 2023
Yuxiang Yang
Guanya Shi
Xiang Meng
Wenhao Yu
Tingnan Zhang
Jie Tan
Byron Boots
221
33
0
16 Jun 2023
Previous
1
2
3
...
135
136
137
...
227
228
229
Next