v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017

Pieter Abbeel

Papers citing "Hindsight Experience Replay"

50 / 1,339 papers shown

Direct Preference Optimization for Primitive-Enabled Hierarchical Reinforcement Learning

366

01 Nov 2024

Compositional Automata Embeddings for Goal-Conditioned Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024

Beyazit Yalcinkaya

Niklas Lauffer

Marcell Vazquez-Chanlatte

Sanjit A. Seshia

AI4CE

485

31 Oct 2024

Maximum Entropy Hindsight Experience Replay

Douglas C. Crowder

Matthew L. Trappett

Darrien M. McKenzie

Frances S. Chance

101

31 Oct 2024

Efficient Diversity-based Experience Replay for Deep Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

452

27 Oct 2024

OGBench: Benchmarking Offline Goal-Conditioned RLInternational Conference on Learning Representations (ICLR), 2024

Sergey Levine

521

26 Oct 2024

SkiLD: Unsupervised Skill Discovery Guided by Factor InteractionsNeural Information Processing Systems (NeurIPS), 2024

Roberto Martín-Martín

262

24 Oct 2024

Safe Load Balancing in Software-Defined-NetworkingComputer Communications (Comput. Commun.), 2024

L. Dinh

Pham Tran Anh Quang

Jérémie Leguay

224

22 Oct 2024

Interpretable end-to-end Neurosymbolic Reinforcement Learning agents

429

18 Oct 2024

Novelty-based Sample Reuse for Continuous Robotics ControlIEEE International Conference on Robotics and Biomimetics (ROBIO), 2024

Ke Duan

Kai Yang

Houde Liu

Xueqian Wang

195

17 Oct 2024

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

156

16 Oct 2024

Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards

Grant C. Forbes

Leonardo Villalobos-Arias

Jianxun Wang

Arnav Jhala

David L. Roberts

255

16 Oct 2024

The State of Robot Motion Generation

325

16 Oct 2024

Zero-Shot Offline Imitation Learning via Optimal Transport

1.1K

11 Oct 2024

Effective Exploration Based on the Structural Information PrinciplesNeural Information Processing Systems (NeurIPS), 2024

Xianghua Zeng

Hao Peng

Angsheng Li

151

09 Oct 2024

Unsupervised Skill Discovery for Robotic Manipulation through Automatic Task GenerationIEEE-RAS International Conference on Humanoid Robots (Humanoids), 2024

279

07 Oct 2024

ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control

337

07 Oct 2024

Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration

Chang Liu

318

03 Oct 2024

Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning

Alicia Li

Nishanth Kumar

Tomás Lozano-Pérez

Leslie Kaelbling

OffRL

241

28 Sep 2024

VertiSelector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain

Tong Xu

Chenhui Pan

Xuesu Xiao

607

26 Sep 2024

Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at ScaleNeural Information Processing Systems (NeurIPS), 2024

246

24 Sep 2024

Autonomous Wheel Loader Navigation Using Goal-Conditioned Actor-Critic MPCIEEE International Conference on Robotics and Automation (ICRA), 2024

Aleksi Mäki-Penttilä

Naeim Ebrahimi Toulkani

Reza Ghabcheloo

410

24 Sep 2024

R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World ModelsIEEE International Conference on Robotics and Automation (ICRA), 2024

Viet Dung Nguyen

Zhizhuo Yang

Christopher L. Buckley

Alexander Ororbia

344

21 Sep 2024

Representing Positional Information in Generative World Models for Object Manipulation

242

18 Sep 2024

Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal GuidanceConference on Robot Learning (CoRL), 2024

Yang Yang

Hengtao Shen

OffRL

261

06 Sep 2024

Simplex-enabled Safe Continual Learning Machine

Marco Caccamo

291

05 Sep 2024

ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models

404

05 Sep 2024

Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning

Sotirios A. Tsaftaris

411

04 Sep 2024

A Tighter Convergence Proof of Reverse Experience Replay

Nan Jiang

Jinzhao Li

Yexiang Xue

151

30 Aug 2024

Safe Policy Exploration Improvement via Subgoals

159

25 Aug 2024

Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and AviationConference on Robot Learning (CoRL), 2024

Sergey Levine

379

21 Aug 2024

Online Behavior Modification for Expressive User Control of RL-Trained RobotsIEEE/ACM International Conference on Human-Robot Interaction (HRI), 2024

292

15 Aug 2024

How to Solve Contextual Goal-Oriented Problems with Offline Datasets?Neural Information Processing Systems (NeurIPS), 2024

372

14 Aug 2024

A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or SubgoalsInternational Conference on Learning Representations (ICLR), 2024

391

11 Aug 2024

Contrast, Imitate, Adapt: Learning Robotic Skills From Raw Human VideosIEEE Transactions on Automation Science and Engineering (T-ASE), 2024

358

10 Aug 2024

Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning

Martin Moder

Stephen Adhisaputra

Josef Pauli

243

07 Aug 2024

A Value Function Space Approach for Hierarchical Planning with Signal Temporal Logic TasksIEEE Control Systems Letters (L-CSS), 2024

277

04 Aug 2024

Jacta: A Versatile Planner for Learning Dexterous and Whole-body ManipulationConference on Robot Learning (CoRL), 2024

Jan Brüdigam

209

02 Aug 2024

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

130

30 Jul 2024

Autonomous Improvement of Instruction Following Skills via Foundation Models

252

30 Jul 2024

Gymnasium: A Standard Interface for Reinforcement Learning Environments

...

402

479

24 Jul 2024

WayEx: Waypoint Exploration using a Single Demonstration

Mara Levy

Nirat Saini

Abhinav Shrivastava

228

22 Jul 2024

Learning Goal-Conditioned Representations for Language Reward Models

178

18 Jul 2024

Variable-Agnostic Causal Exploration for Reinforcement Learning

243

17 Jul 2024

Investigating the Interplay of Prioritized Replay and Generalization

Parham Mohammad Panahi

Andrew Patterson

Martha White

Adam White

175

12 Jul 2024

TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations

Junik Bae

Kwanyoung Park

Youngwoon Lee

243

11 Jul 2024

Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search

326

08 Jul 2024

Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization

Liam Schramm

Abdeslam Boularias

214

07 Jul 2024

Embracing Massive Medical Data

Yu-Cheng Chou

Zongwei Zhou

Alan Yuille

CLL OOD

185

05 Jul 2024

Hindsight Preference Learning for Offline Preference-based Reinforcement Learning

162

05 Jul 2024

EAGERx: Graph-Based Framework for Sim2real Robot Learning

Laura Ferranti

179

05 Jul 2024