ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.06006
  4. Cited By
Hindsight policy gradients

Hindsight policy gradients

16 November 2017
Paulo E. Rauber
Avinash Ummadisingu
Filipe Wall Mutz
J. Schmidhuber
ArXivPDFHTML

Papers citing "Hindsight policy gradients"

13 / 13 papers shown
Title
Goal-Conditioned Supervised Learning with Sub-Goal Prediction
Goal-Conditioned Supervised Learning with Sub-Goal Prediction
Tom Jurgenson
Aviv Tamar
24
1
0
17 May 2023
Hindsight States: Blending Sim and Real Task Elements for Efficient
  Reinforcement Learning
Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning
Simon Guist
Jan Schneider-Barnes
Alexander Dittrich
V. Berenz
Bernhard Schölkopf
Dieter Buchler
21
3
0
03 Mar 2023
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
16
18
0
02 Dec 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
T. Luu
Chang-Dong Yoo
10
7
0
28 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
30
92
0
14 Sep 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
66
643
0
03 Jun 2021
Learning One Representation to Optimize All Rewards
Learning One Representation to Optimize All Rewards
Ahmed Touati
Yann Ollivier
OffRL
21
60
0
14 Mar 2021
Scalable Multi-Task Imitation Learning with Autonomous Improvement
Scalable Multi-Task Imitation Learning with Autonomous Improvement
Avi Singh
Eric Jang
A. Irpan
Daniel Kappler
Murtaza Dalal
Sergey Levine
Mohi Khansari
Chelsea Finn
45
35
0
25 Feb 2020
Hindsight Trust Region Policy Optimization
Hindsight Trust Region Policy Optimization
Hanbo Zhang
Site Bai
Xuguang Lan
David Hsu
Nanning Zheng
25
8
0
29 Jul 2019
Exploration via Hindsight Goal Generation
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
22
84
0
10 Jun 2019
Visual Reinforcement Learning with Imagined Goals
Visual Reinforcement Learning with Imagined Goals
Ashvin Nair
Vitchyr H. Pong
Murtaza Dalal
Shikhar Bahl
Steven Lin
Sergey Levine
SSL
13
535
0
12 Jul 2018
A survey on policy search algorithms for learning robot controllers in a
  handful of trials
A survey on policy search algorithms for learning robot controllers in a handful of trials
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
F. Stulp
Sylvain Calinon
Jean-Baptiste Mouret
17
154
0
06 Jul 2018
1