v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017

Papers citing "Proximal Policy Optimization Algorithms"

50 / 11,424 papers shown

qgym: A Gym for Training and Benchmarking RL-Based Quantum CompilationInternational Conference on Quantum Computing and Engineering (QCE), 2023

150

01 Aug 2023

Target Search and Navigation in Heterogeneous Robot Systems with Deep Reinforcement LearningMachine Intelligence Research (MIR), 2023

Yuxiang Chen

Jiaping Xiao

129

01 Aug 2023

Pixel to policy: DQN Encoders for within & cross-game reinforcement learning

01 Aug 2023

Deep Reinforcement Learning-Based Battery Conditioning Hierarchical V2G Coordination for Multi-Stakeholder Benefits

01 Aug 2023

Formally Explaining Neural Networks within Reactive SystemsFormal Methods in Computer-Aided Design (FMCAD), 2023

384

31 Jul 2023

Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation

223

31 Jul 2023

Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research ChallengesJournal of Artificial Intelligence Research (JAIR), 2023

Giorgio Franceschelli

Mirco Musolesi

AI4CE

650

31 Jul 2023

Learning to Model the World with LanguageInternational Conference on Machine Learning (ICML), 2023

Pieter Abbeel

290

31 Jul 2023

Discovering Adaptable Symbolic Algorithms from ScratchIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

...

Jie Tan

199

31 Jul 2023

Learning whom to trust in navigation: dynamically switching between classical and neural planningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

238

31 Jul 2023

Learning Generalizable Tool Use with Non-rigid Grasp-pose Registration

Malte Mosbach

Sven Behnke

215

31 Jul 2023

Rating-based Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023

Devin White

Mingkang Wu

Ellen R. Novoseller

Vernon J. Lawhern

Nicholas R. Waytowich

Yongcan Cao

ALM

237

30 Jul 2023

Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models

Keyu Pan

Yawen Zeng

LLMAG

214

30 Jul 2023

MTD-GPT: A Multi-Task Decision-Making GPT Model for Autonomous Driving at Unsignalized Intersections

Jianqiang Wang

174

30 Jul 2023

Coordination of Bounded Rational Drones through Informed Prior PolicyIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

Durgakant Pushp

Junhong Xu

Lantao Liu

136

28 Jul 2023

Benchmarking Offline Reinforcement Learning on Real-Robot HardwareInternational Conference on Learning Representations (ICLR), 2023

Stefan Bauer

303

28 Jul 2023

TrackAgent: 6D Object Tracking via Reinforcement LearningInternational Conference on Virtual Storytelling (ICVS), 2023

Markus Vincze

121

28 Jul 2023

Learning to Open Doors with an Aerial ManipulatorIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

Roland Siegwart

171

28 Jul 2023

Curiosity-Driven Reinforcement Learning based Low-Level Flight Control

Amir Ramezani Dooraki

Alexandros Iosifidis

28 Jul 2023

Thinker: Learning to Plan and ActNeural Information Processing Systems (NeurIPS), 2023

294

27 Jul 2023

FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial MasksAsia-Pacific Computer Systems Architecture Conference (ACSA), 2023

Buse G. A. Tekgul

Nadarajah Asokan

AAML

235

27 Jul 2023

An Ensemble Method of Deep Reinforcement Learning for Automated Cryptocurrency TradingInternational Conference on Blockchain (ICB), 2023

Shuyang Wang

Diego Klabjan

189

27 Jul 2023

Evaluation of Safety Constraints in Autonomous Navigation with Deep Reinforcement Learning

203

27 Jul 2023

MorphoLander: Reinforcement Learning Based Landing of a Group of Drones on the Adaptive Morphogenetic UAVIEEE International Conference on Systems, Man and Cybernetics (SMC), 2023

Dzmitry Tsetserukou

146

26 Jul 2023

Reinforced Potential Field for Multi-Robot Motion Planning in Cluttered EnvironmentsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

228

26 Jul 2023

Deep Reinforcement Learning for Robust Goal-Based Wealth ManagementArtificial Intelligence Applications and Innovations (AIAI), 2023

25 Jul 2023

Submodular Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

274

25 Jul 2023

Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASHComputer Science and Information Systems (COMSIS), 2023

178

25 Jul 2023

Counterfactual Explanation Policies in RL

Shripad Deshmukh

R Srivatsan

Supriti Vijay

Jayakumar Subramanian

Chirag Agarwal

OffRL

228

25 Jul 2023

RLCD: Reinforcement Learning from Contrastive Distillation for Language Model Alignment

457

24 Jul 2023

RRAML: Reinforced Retrieval Augmented Machine Learning

346

24 Jul 2023

Policy Gradient Optimal Correlation Search for Variance Reduction in Monte Carlo simulation and Maximum Optimal Transport

Pierre Bras

Gilles Pagès

178

24 Jul 2023

SafeSteps: Learning Safer Footstep Planning Policies for Legged Robots via Model-Based PriorsIEEE-RAS International Conference on Humanoid Robots (Humanoids), 2023

Shafeef Omar

Lorenzo Amatucci

Victor Barasuol

Giulio Turrisi

Claudio Semini

342

24 Jul 2023

On the Effectiveness of Offline RL for Dialogue Response GenerationInternational Conference on Machine Learning (ICML), 2023

203

23 Jul 2023

Using Reinforcement Learning for the Three-Dimensional Loading Capacitated Vehicle Routing Problem

122

22 Jul 2023

Online Container Scheduling for Low-Latency IoT Services in Edge Cluster Upgrade: A Reinforcement Learning ApproachInternational Conference on Innovative Computing and Cloud Computing (ICCC), 2023

Hanshuai Cui

Zhiqing Tang

Jiong Lou

Weijia Jia

22 Jul 2023

Active Control of Flow over Rotating Cylinder by Multiple Jets using Deep Reinforcement Learning

Kamyar Dobakhti

J. Ghazanfarian

AI4CE

278

22 Jul 2023

On-Robot Bayesian Reinforcement Learning for POMDPsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

212

22 Jul 2023

Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning

279

21 Jul 2023

JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning

369

21 Jul 2023

An Analysis of Multi-Agent Reinforcement Learning for Decentralized Inventory Control SystemsComputers and Chemical Engineering (Comput. Chem. Eng.), 2023

Marwan Mousa

Damien van de Berg

Niki Kotecha

Ehecatl Antonio del Rio Chanona

M. Mowbray

176

21 Jul 2023

Bridging the Reality Gap of Reinforcement Learning based Traffic Signal Control using Domain Randomization and Meta Learning

Arthur Muller

M. Sabatelli

162

21 Jul 2023

A Two-stage Fine-tuning Strategy for Generalizable Manipulation Skill of Embodied AI

215

21 Jul 2023

Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback

Tao Chen

Abhishek Gupta

226

20 Jul 2023

PASTA: Pretrained Action-State Transformer Agents

306

20 Jul 2023

Reparameterized Policy Learning for Multimodal Trajectory OptimizationInternational Conference on Machine Learning (ICML), 2023

Chuang Gan

193

20 Jul 2023

FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback

...

311

20 Jul 2023

Technical Challenges of Deploying Reinforcement Learning Agents for Game Testing in AAA Games

264

19 Jul 2023

Robust Driving Policy Learning with Guided Meta Reinforcement Learning

Jinkyoo Park

Mykel J. Kochenderfer

199

19 Jul 2023

Benchmarking Potential Based Rewards for Learning Humanoid LocomotionIEEE International Conference on Robotics and Automation (ICRA), 2023

154

19 Jul 2023