Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.06366
Cited By
v1
v2
v3
v4
v5 (latest)
Automatic Goal Generation for Reinforcement Learning Agents
17 May 2017
Carlos Florensa
David Held
Xinyang Geng
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Automatic Goal Generation for Reinforcement Learning Agents"
50 / 321 papers shown
Title
Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
Daniel Furelos-Blanco
Charles Pert
Frederik Kelbel
Alex F Spies
Alessandra Russo
Michael Dennis
68
0
0
16 Nov 2025
Distributionally Robust Self Paced Curriculum Reinforcement Learning
Anirudh Satheesh
Keenan Powell
Vaneet Aggarwal
OOD
OffRL
364
0
0
07 Nov 2025
Environment Agnostic Goal-Conditioning, A Study of Reward-Free Autonomous Learning
Hampus Åström
Elin Anna Topp
Jacek Malec
OffRL
73
0
0
06 Nov 2025
Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs
Georgios Tzannetos
Parameswaran Kamalaruban
Adish Singla
64
1
0
04 Nov 2025
Heterogeneous Adversarial Play in Interactive Environments
Manjie Xu
Xinyi Yang
Jiayu Zhan
Wei Liang
Chi Zhang
Yixin Zhu
105
0
0
21 Oct 2025
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Kathryn Wantlin
Chongyi Zheng
Benjamin Eysenbach
108
0
0
20 Oct 2025
BuilderBench -- A benchmark for generalist agents
Raj Ghugare
Catherine Ji
Kathryn Wantlin
Jin Schofield
Benjamin Eysenbach
72
0
0
07 Oct 2025
General and Efficient Visual Goal-Conditioned Reinforcement Learning using Object-Agnostic Masks
Fahim Shahriar
Cheryl Wang
Alireza Azimi
Gautham Vasan
Hany Hamed Elanwar
A. Rupam Mahmood
Colin Bellinger
52
0
0
06 Oct 2025
STAIR: Addressing Stage Misalignment through Temporal-Aligned Preference Reinforcement Learning
Yao Luan
Ni Mu
Yiqin Yang
Bo Xu
Qing-Shan Jia
69
0
0
28 Sep 2025
Co-Evolving Complexity: An Adversarial Framework for Automatic MARL Curricula
Brennen Hill
64
0
0
03 Sep 2025
Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning
Ang Li
Zhihang Yuan
Yang Zhang
Shouda Liu
Yisen Wang
100
3
0
29 Aug 2025
cMALC-D: Contextual Multi-Agent LLM-Guided Curriculum Learning with Diversity-Based Context Blending
Anirudh Satheesh
Keenan Powell
Hua Wei
76
0
0
28 Aug 2025
TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design
Geonwoo Cho
Jaegyun Im
Jihwan Lee
Hojun Yi
Sejin Kim
Sundong Kim
126
0
0
24 Jun 2025
Policy Search, Retrieval, and Composition via Task Similarity in Collaborative Agentic Systems
Saptarshi Nath
Christos Peridis
Eseoghene Benjamin
Hengrong Du
Soheil Kolouri
Peter Kinnell
Zexin Li
Cong Liu
Shirin Dora
Andrea Soltoggio
198
0
0
05 Jun 2025
Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay
Yifan Sun
Jingyan Shen
Yibin Wang
Tianyu Chen
Zhendong Wang
Mingyuan Zhou
Huan Zhang
293
9
0
05 Jun 2025
Deep learning image burst stacking to reconstruct high-resolution ground-based solar observations
Christoph Schirninger
Robert Jarolim
Astrid M. Veronig
Christoph Kuckein
267
2
0
05 Jun 2025
ADEPT: Adaptive Diffusion Environment for Policy Transfer Sim-to-Real
Youwei Yu
Junhong Xu
Lantao Liu
260
2
0
02 Jun 2025
Normalizing Flows are Capable Models for RL
Raj Ghugare
Benjamin Eysenbach
OffRL
AI4CE
306
4
0
29 May 2025
Prior Reinforce: Mastering Agile Tasks with Limited Trials
Yihang Hu
Pingyue Sheng
Shengjie Wang
Yang Gao
Yang Gao
196
0
0
28 May 2025
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
V. Wang
Tinghuai Wang
Joni Pajarinen
BDL
120
1
0
27 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
273
4
0
26 May 2025
Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning
Nicolas Castanet
Olivier Sigaud
Sylvain Lamprier
OffRL
320
0
0
23 May 2025
Self-Evolving Curriculum for LLM Reasoning
Xiaoyin Chen
Jiarui Lu
Minsu Kim
Dinghuai Zhang
Jian Tang
Alexandre Piché
Nicolas Angelard-Gontier
Yoshua Bengio
Ehsan Kamalloo
ReLM
LRM
486
23
0
20 May 2025
CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task Evolution
International Conference on Intelligent Computing (ICIC), 2025
Yufei Lin
Chengwei Ye
Ning Yang
Kangsheng Wang
Linuo Xu
Shuyan Liu
Zeyu Zhang
211
2
0
08 May 2025
MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning
Itamar Mishani
Yorai Shaoul
Maxim Likhachev
190
2
0
23 Apr 2025
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Fikrican Özgür
René Zurbrugg
Suryansh Kumar
233
0
0
15 Apr 2025
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Taiwei Shi
Yiyang Wu
Linxin Song
Wanrong Zhu
Jieyu Zhao
LRM
331
48
0
07 Apr 2025
MultiClear: Multimodal Soft Exoskeleton Glove for Transparent Object Grasping Assistance
Towards Autonomous Robotic Systems (TAROS), 2025
Chen Hu
Timothy Neate
Shan Luo
Letizia Gionfrida
198
40
0
04 Apr 2025
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Llewyn Salt
Marcus Gallagher
190
1
0
02 Apr 2025
Causally Aligned Curriculum Learning
International Conference on Learning Representations (ICLR), 2025
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
234
6
0
21 Mar 2025
Reward Training Wheels: Adaptive Auxiliary Rewards for Robotics Reinforcement Learning
Linji Wang
Tong Xu
Yuanjie Lu
Xuesu Xiao
253
1
0
19 Mar 2025
ForceGrip: Reference-Free Curriculum Learning for Realistic Grip Force Control in VR Hand Manipulation
DongHeun Han
Byungmin Kim
RoUn Lee
KyeongMin Kim
Hyoseok Hwang
HyeongYeop Kang
461
0
0
11 Mar 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
359
2
0
28 Jan 2025
Learn2Mix: Training Neural Networks Using Adaptive Data Integration
Shyam Venkatasubramanian
Vahid Tarokh
314
2
0
21 Dec 2024
Eurekaverse: Environment Curriculum Generation via Large Language Models
Conference on Robot Learning (CoRL), 2024
William Liang
Sam Wang
Hung-Ju Wang
Osbert Bastani
Dinesh Jayaraman
Yecheng Jason Ma
SyDa
262
4
0
04 Nov 2024
Learning World Models for Unconstrained Goal Navigation
Neural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Wensen Mao
He Zhu
211
5
0
03 Nov 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Guofeng Cui
He Zhu
OffRL
333
0
0
03 Nov 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
International Conference on Learning Representations (ICLR), 2024
Michael T. Matthews
Michael Beukman
Chris Xiaoxuan Lu
Jakob Foerster
OffRL
AI4CE
351
18
0
30 Oct 2024
GUIDEd Agents: Enhancing Navigation Policies through Task-Specific Uncertainty Abstraction in Localization-Limited Environments
Gokul Puthumanaillam
Paulo Padrao
Jose Fuentes
Leonardo Bobadilla
Melkior Ornik
261
3
0
19 Oct 2024
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Andrew Levy
A. Allievi
George Konidaris
203
1
0
15 Oct 2024
AFlow: Automating Agentic Workflow Generation
International Conference on Learning Representations (ICLR), 2024
Jiayi Zhang
Jinyu Xiang
Zhaoyang Yu
Xinbing Liang
Xionghui Chen
...
Jinlin Wang
Bingnan Zheng
Bang Liu
Yuyu Luo
Chenglin Wu
AIFin
AI4CE
275
6
0
14 Oct 2024
Words as Beacons: Guiding RL Agents with High-Level Language Prompts
Unai Ruiz-Gonzalez
Alain Andres
Pedro G. Bascoy
Javier Del Ser
167
2
0
11 Oct 2024
Goal-Conditioned Terminal Value Estimation for Real-time and Multi-task Model Predictive Control
Mitsuki Morita
Satoshi Yamamori
Satoshi Yagi
Norikazu Sugimoto
Jun Morimoto
190
0
0
07 Oct 2024
Bridging SFT and DPO for Diffusion Model Alignment with Self-Sampling Preference Optimization
Daoan Zhang
Guangchen Lan
Dong-Jun Han
Wenlin Yao
Xiaoman Pan
...
Mingxiao Li
Pengcheng Chen
Yu Dong
Christopher G. Brinton
Jiebo Luo
EGVM
280
8
0
07 Oct 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Conference on Robot Learning (CoRL), 2024
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
204
0
0
06 Sep 2024
No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery
Neural Information Processing Systems (NeurIPS), 2024
Alexander Rutherford
Michael Beukman
Timon Willi
Bruno Lacerda
Nick Hawes
Jakob Foerster
229
19
0
27 Aug 2024
Online Optimization of Curriculum Learning Schedules using Evolutionary Optimization
Mohit Jiwatode
Leon Schlecht
Alexander Dockhorn
176
0
0
12 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
International Conference on Learning Representations (ICLR), 2024
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
347
8
0
11 Aug 2024
REVEAL-IT: REinforcement learning with Visibility of Evolving Agent poLicy for InTerpretability
Shuang Ao
Simon Khan
Haris Aziz
Flora D. Salim
436
0
0
20 Jun 2024
Learning telic-controllable state representations
Nadav Amir
Stas Tiomkin
249
1
0
20 Jun 2024
1
2
3
4
5
6
7
Next