v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017

Pieter Abbeel

Papers citing "Hindsight Experience Replay"

50 / 1,345 papers shown

Open-Ended Reinforcement Learning with Neural Reward FunctionsNeural Information Processing Systems (NeurIPS), 2022

Robert Meier

Asier Mujika

318

16 Feb 2022

End-to-end Reinforcement Learning of Robotic Manipulation with Robust Keypoints RepresentationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022

232

12 Feb 2022

Online Decision TransformerInternational Conference on Machine Learning (ICML), 2022

403

250

11 Feb 2022

Help Me Explore: Minimal Social Interventions for Graph-Based Autotelic Agents

202

10 Feb 2022

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RLInternational Conference on Learning Representations (ICLR), 2022

379

09 Feb 2022

Approximating Gradients for Differentiable Quality Diversity in Reinforcement LearningAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2022

300

08 Feb 2022

Pre-Trained Language Models for Interactive Decision-MakingNeural Information Processing Systems (NeurIPS), 2022

Shuang Li

...

Antonio Torralba

478

315

03 Feb 2022

How to Leverage Unlabeled Data in Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

550

03 Feb 2022

Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning

David Brandfonbrener

Pieter Abbeel

248

112

31 Jan 2022

Contrastive Learning from DemonstrationsInternational Conference on Robotic Computing (IRC), 2022

André Rosa de Sousa Porfírio Correia

L. A. Alexandre

SSL

274

30 Jan 2022

The Challenges of Exploration for Offline Reinforcement Learning

Markus Wulfmeier

Michael Bloesch

Martin Riedmiller

232

27 Jan 2022

State-Conditioned Adversarial Subgoal GenerationAAAI Conference on Artificial Intelligence (AAAI), 2022

V. Wang

Joni Pajarinen

Tinghuai Wang

Joni-Kristian Kämäräinen

385

24 Jan 2022

Pearl: Parallel Evolutionary and Reinforcement Learning Library

Rohan Tangri

Danilo P. Mandic

A. Constantinides

176

24 Jan 2022

Goal-Conditioned Reinforcement Learning: Problems and SolutionsInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

Minghuan Liu

Menghui Zhu

Weinan Zhang

405

199

20 Jan 2022

Reinforcement Learning based Air Combat Maneuver Generation

Muhammed Murat Özbek

E. Koyuncu

14 Jan 2022

Automated Reinforcement Learning: An Overview

Yaoxin Wu

Wen Song

Yingqian Zhang

OffRL

464

13 Jan 2022

Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics

Swagat Kumar

Hayden Sampson

Ardhendu Behera

197

11 Jan 2022

Automated Reinforcement Learning (AutoRL): A Survey and Open ProblemsJournal of Artificial Intelligence Research (JAIR), 2022

...

Katharina Eggensperger

Marius Lindauer

AI4CE

480

134

11 Jan 2022

STIR

^2

: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasksAdaptive Agents and Multi-Agent Systems (AAMAS), 2022

Jesús Bujalance Martín

Fabien Moutarde

OffRL

220

11 Jan 2022

Integrating Artificial Intelligence and Augmented Reality in Robotic Surgery: An Initial dVRK Study Using a Surgical Education ScenarioInternational Symposium on Medical Robotics (ISMR), 2022

413

02 Jan 2022

Multiagent Model-based Credit Assignment for Continuous ControlAdaptive Agents and Multi-Agent Systems (AAMAS), 2021

132

27 Dec 2021

Off Environment Evaluation Using Convex Risk MinimizationIEEE International Conference on Robotics and Automation (ICRA), 2021

Pulkit Katdare

Shuijing Liu

Katherine Driggs-Campbell

140

21 Dec 2021

Proving Theorems using Incremental Learning and Hindsight Experience ReplayInternational Conference on Machine Learning (ICML), 2021

Lei M. Zhang

300

20 Dec 2021

143

08 Dec 2021

CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation TasksIEEE Robotics and Automation Letters (RA-L), 2021

Wolfram Burgard

671

485

06 Dec 2021

Hierarchical Reinforcement Learning with Timed SubgoalsNeural Information Processing Systems (NeurIPS), 2021

Nico Gürtler

Le Chen

Georg Martius

355

06 Dec 2021

Flexible-Joint Manipulator Trajectory Tracking with Learned Two-Stage Model employing One-Step Future PredictionInternational Conference on Robotic Computing (IRC), 2021

D. Pavlichenko

Sven Behnke

164

06 Dec 2021

Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL

Charles Packer

Pieter Abbeel

Joseph E. Gonzalez

OffRL

284

02 Dec 2021

Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation

Markus Wulfmeier

243

01 Dec 2021

Learning Long-Term Reward Redistribution via Randomized Return DecompositionInternational Conference on Learning Representations (ICLR), 2021

381

26 Nov 2021

Adaptive Multi-Goal Exploration

Pierre Ménard

347

23 Nov 2021

Generalized Decision Transformer for Offline Hindsight Information MatchingInternational Conference on Learning Representations (ICLR), 2021

Hiroki Furuta

Y. Matsuo

S. Gu

OffRL

362

124

19 Nov 2021

Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning

228

18 Nov 2021

Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in RoboticsNeural Information Processing Systems (NeurIPS), 2021

172

15 Nov 2021

Improving Experience Replay through Modeling of Similar Transitions' Sets

Daniel Eugênio Neves

João Pedro Oliveira Batisteli

Eduardo Felipe Lopes

Lucila Ishitani

Zenilton K. G. Patrocínio

OffRL

128

12 Nov 2021

One model Packs Thousands of Items with Recurrent Conditional Query LearningKnowledge-Based Systems (KBS), 2021

231

12 Nov 2021

Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot ManipulationConference on Robot Learning (CoRL), 2021

163

11 Nov 2021

Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments

167

07 Nov 2021

Automatic Goal Generation using Dynamical Distance Learning

Bharat Prakash

Nicholas R. Waytowich

T. Mohsenin

Tim Oates

129

07 Nov 2021

Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon ReasoningInternational Conference on Learning Representations (ICLR), 2021

297

04 Nov 2021

Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning

Wenlong Huang

Igor Mordatch

Pieter Abbeel

Deepak Pathak

324

04 Nov 2021

Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement LearningAmerican Control Conference (ACC), 2021

Sindre Benjamin Remman

Inga Strümke

A. Lekkas

CML

193

04 Nov 2021

Autonomous Attack Mitigation for Industrial Control Systems

Mykel J. Kochenderfer

AAML

169

03 Nov 2021

Discovering and Exploiting Sparse Rewards in a Learned Behavior SpaceEvolutionary Computation (Evol. Comput.), 2021

250

02 Nov 2021

Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience ReplayIEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2021

Baturay Saglam

155

02 Nov 2021

Robot Learning from Randomized Simulations: A ReviewFrontiers in Robotics and AI (Front. Robot. AI), 2021

Wenhao Yu

Jan Peters

401

124

01 Nov 2021

Adjacency constraint for efficient hierarchical reinforcement learningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

584

30 Oct 2021

Hindsight Goal Ranking on Replay Buffer for Sparse Reward EnvironmentIEEE Access (IEEE Access), 2021

Tung M. Luu

Chang D. Yoo

181

28 Oct 2021

Similarity-Aware Skill Reproduction based on Multi-Representational Learning from DemonstrationInternational Conference on Advanced Robotics (ICAR), 2021

Brendan Hertel

S. Ahmadzadeh

176

28 Oct 2021

Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling

Jesús Bujalance Martín

Raphael Chekroun

Fabien Moutarde

OffRL

209

27 Oct 2021