v1v2v3v4 (latest)

Behavior From the Void: Unsupervised Active Pre-Training

Neural Information Processing Systems (NeurIPS), 2021

8 March 2021

Pieter Abbeel

Papers citing "Behavior From the Void: Unsupervised Active Pre-Training"

50 / 146 papers shown

Discover, Learn, and Reinforce: Scaling Vision-Language-Action Pretraining with Diverse RL-Generated Trajectories

257

24 Nov 2025

From Pixels to Views: Learning Angular-Aware and Physics-Consistent Representations for Light Field Microscopy

162

26 Oct 2025

Reference Grounded Skill Discovery

211

07 Oct 2025

Information-Theoretic Policy Pre-Training with Empowerment

171

07 Oct 2025

Embodied AI: From LLMs to World Models

461

24 Sep 2025

Learning Acrobatic Flight from Preferences

177

26 Aug 2025

Self-Questioning Language Models

523

05 Aug 2025

Provable Maximum Entropy Manifold Exploration via Diffusion Models

246

18 Jun 2025

Reward Models in Deep Reinforcement Learning: A SurveyInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

231

18 Jun 2025

Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025

343

12 Jun 2025

AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification

403

06 Jun 2025

Trajectory First: A Curriculum for Discovering Diverse Policies

Cornelius V. Braun

Sayantan Auddy

Marc Toussaint

374

02 Jun 2025

State-Covering Trajectory Stitching for Diffusion Planners

Kyowoon Lee

Jaesik Choi

OffRL

476

01 Jun 2025

Maximizing Confidence Alone Improves Reasoning

653

28 May 2025

DSADF: Thinking Fast and Slow for Decision Making

640

13 May 2025

Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story

526

02 May 2025

An Information-Geometric Approach to Artificial Curiosity

Alexander Nedergaard

Pablo A. Morales

284

08 Apr 2025

Intrinsically-Motivated Humans and Agents in Open-World Exploration

507

31 Mar 2025

Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation

638

08 Mar 2025

Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025

369

06 Feb 2025

Episodic Novelty Through Temporal DistanceInternational Conference on Learning Representations (ICLR), 2025

...

401

28 Jan 2025

The impact of intrinsic rewards on exploration in Reinforcement Learning

Aya Kayal

Eduardo Pignatelli

Laura Toni

305

20 Jan 2025

SkiLD: Unsupervised Skill Discovery Guided by Factor InteractionsNeural Information Processing Systems (NeurIPS), 2024

Roberto Martín-Martín

354

24 Oct 2024

Learning Versatile Skills with Curriculum MaskingNeural Information Processing Systems (NeurIPS), 2024

403

23 Oct 2024

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

898

23 Oct 2024

Effective Exploration Based on the Structural Information PrinciplesNeural Information Processing Systems (NeurIPS), 2024

Xianghua Zeng

Hao Peng

Angsheng Li

184

09 Oct 2024

Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration

Chang Liu

401

03 Oct 2024

Contrastive Abstraction for Reinforcement Learning

Vihang Patil

M. Hofmarcher

Elisabeth Rumetshofer

Sepp Hochreiter

OffRL SSL

338

01 Oct 2024

GFlowNet Pretraining with Inexpensive Rewards

256

15 Sep 2024

Unsupervised-to-Online Reinforcement Learning

Junsu Kim

Seohong Park

Sergey Levine

OnRL

301

27 Aug 2024

Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods

Ric De Santi

Manish Prajapat

Andreas Krause

334

13 Jul 2024

Constrained Intrinsic Motivation for Reinforcement Learning

Xiang Zheng

Jie Zhang

Chao Shen

Cong Wang

322

12 Jul 2024

TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations

Junik Bae

Kwanyoung Park

Youngwoon Lee

291

11 Jul 2024

Uncertainty-Aware Reward-Free Exploration with General Function Approximation

498

24 Jun 2024

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

358

18 Jun 2024

Exploration by Learning Diverse Skills through Successor State Measures

Paul-Antoine Le Tolguenec

Yann Besse

Florent Teichteil-Königsbuch

Dennis G. Wilson

Emmanuel Rachelson

382

14 Jun 2024

Deep Bayesian Active Learning for Preference Modeling in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024

Luckeciano C. Melo

P. Tigas

Alessandro Abate

Yarin Gal

275

14 Jun 2024

Language Guided Skill DiscoveryInternational Conference on Learning Representations (ICLR), 2024

Laura Smith

290

07 Jun 2024

Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning

453

04 Jun 2024

How to Explore with Belief: State Entropy Maximization in POMDPs

287

04 Jun 2024

Do's and Don'ts: Learning Desirable Skills with Instruction Videos

651

01 Jun 2024

Constrained Ensemble Exploration for Unsupervised Skill Discovery

Xuelong Li

492

25 May 2024

PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024

Hang Su

Jun Zhu

394

23 May 2024

Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning

Xin Liu

Yaran Chen

Dong Zhao

305

20 May 2024

Decoupling Exploration and Exploitation for Unsupervised Pre-training with Successor FeaturesIEEE International Joint Conference on Neural Network (IJCNN), 2024

264

04 May 2024

Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features CriticsInternational Conference on Machine Learning (ICML), 2024

501

15 Mar 2024

RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences

Jie Cheng

Gang Xiong

Xingyuan Dai

Qinghai Miao

Yisheng Lv

Fei-Yue Wang

368

27 Feb 2024

Foundation Policies with Hilbert Representations

437

23 Feb 2024

SLIM: Skill Learning with Multiple Critics

330

01 Feb 2024

Behind the Myth of Exploration in Policy Gradients

Adrien Bolland

Gaspard Lambrechts

Damien Ernst

474

31 Jan 2024