ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.04551
  4. Cited By
Behavior From the Void: Unsupervised Active Pre-Training
v1v2v3v4 (latest)

Behavior From the Void: Unsupervised Active Pre-Training

Neural Information Processing Systems (NeurIPS), 2021
8 March 2021
Hao Liu
Pieter Abbeel
    VLMSSL
ArXiv (abs)PDFHTML

Papers citing "Behavior From the Void: Unsupervised Active Pre-Training"

50 / 146 papers shown
Discover, Learn, and Reinforce: Scaling Vision-Language-Action Pretraining with Diverse RL-Generated Trajectories
Discover, Learn, and Reinforce: Scaling Vision-Language-Action Pretraining with Diverse RL-Generated Trajectories
Rushuai Yang
Zhiyuan Feng
Tianxiang Zhang
Kaixin Wang
Chuheng Zhang
Li Zhao
Xiu Su
Yi-Ling Chen
Jiang Bian
OffRL
257
0
0
24 Nov 2025
From Pixels to Views: Learning Angular-Aware and Physics-Consistent Representations for Light Field Microscopy
From Pixels to Views: Learning Angular-Aware and Physics-Consistent Representations for Light Field Microscopy
Feng He
Guodong Tan
Qiankun Li
Jun Yu
Quan Wen
162
0
0
26 Oct 2025
Reference Grounded Skill Discovery
Reference Grounded Skill Discovery
Seungeun Rho
Aaron Trinh
Danfei Xu
Sehoon Ha
211
0
0
07 Oct 2025
Information-Theoretic Policy Pre-Training with Empowerment
Information-Theoretic Policy Pre-Training with Empowerment
Moritz Schneider
Robert Krug
Narunas Vaskevicius
Luigi Palmieri
Michael Volpp
Joschka Boedecker
OffRL
171
1
0
07 Oct 2025
Embodied AI: From LLMs to World Models
Embodied AI: From LLMs to World Models
Tongtong Feng
Xin Wang
Yu Jiang
Wenwu Zhu
LM&Ro
461
25
0
24 Sep 2025
Learning Acrobatic Flight from Preferences
Learning Acrobatic Flight from Preferences
Colin Merk
Ismail Geles
Jiaxu Xing
Angel Romero
Giorgia Ramponi
Davide Scaramuzza
177
0
0
26 Aug 2025
Self-Questioning Language Models
Self-Questioning Language Models
Lili Chen
Mihir Prabhudesai
Katerina Fragkiadaki
Hao Liu
Deepak Pathak
ReLMSyDaLRM
523
26
0
05 Aug 2025
Provable Maximum Entropy Manifold Exploration via Diffusion Models
Provable Maximum Entropy Manifold Exploration via Diffusion Models
Riccardo De Santi
Marin Vlastelica
Ya-Ping Hsieh
Zebang Shen
Niao He
Andreas Krause
DiffM
246
8
0
18 Jun 2025
Reward Models in Deep Reinforcement Learning: A Survey
Reward Models in Deep Reinforcement Learning: A SurveyInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Rui Yu
Shenghua Wan
Yucen Wang
Chen-Xiao Gao
Le Gan
Zongzhang Zhang
De-Chuan Zhan
OffRL
231
18
0
18 Jun 2025
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Yucheng Yang
Tianyi Zhou
Qiang He
Lei Han
Mykola Pechenizkiy
Meng Fang
SSL
343
12
0
12 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
403
0
0
06 Jun 2025
Trajectory First: A Curriculum for Discovering Diverse Policies
Trajectory First: A Curriculum for Discovering Diverse Policies
Cornelius V. Braun
Sayantan Auddy
Marc Toussaint
374
1
0
02 Jun 2025
State-Covering Trajectory Stitching for Diffusion Planners
State-Covering Trajectory Stitching for Diffusion Planners
Kyowoon Lee
Jaesik Choi
OffRL
476
5
0
01 Jun 2025
Maximizing Confidence Alone Improves Reasoning
Maximizing Confidence Alone Improves Reasoning
Mihir Prabhudesai
Lili Chen
Alex Ippoliti
Katerina Fragkiadaki
Hao Liu
Deepak Pathak
OODOffRLReLMLRM
653
66
0
28 May 2025
DSADF: Thinking Fast and Slow for Decision Making
DSADF: Thinking Fast and Slow for Decision Making
Alex Zhihao Dou
Dongfei Cui
Jun Yan
Wei Wang
Benteng Chen
Haoming Wang
Zeke Xie
Shufei Zhang
OffRL
640
5
0
13 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
526
3
0
02 May 2025
An Information-Geometric Approach to Artificial Curiosity
An Information-Geometric Approach to Artificial Curiosity
Alexander Nedergaard
Pablo A. Morales
284
1
0
08 Apr 2025
Intrinsically-Motivated Humans and Agents in Open-World Exploration
Intrinsically-Motivated Humans and Agents in Open-World Exploration
Aly Lidayan
Yuqing Du
Eliza Kosoy
Maria Rufova
Pieter Abbeel
Alison Gopnik
507
8
0
31 Mar 2025
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Mohit Pandey
G. Subbaraj
Artem Cherkasov
Martin Ester
Emmanuel Bengio
AI4CE
638
7
0
08 Mar 2025
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Wesley A. Suttle
A. Suresh
Carlos Nieto-Granda
OffRL
369
4
0
06 Feb 2025
Episodic Novelty Through Temporal Distance
Episodic Novelty Through Temporal DistanceInternational Conference on Learning Representations (ICLR), 2025
Y. Jiang
Qihan Liu
Yiqin Yang
Xiaoteng Ma
Dianyu Zhong
...
Jun Yang
Bin Liang
Bo Xu
Chongjie Zhang
Qianchuan Zhao
OffRL
401
11
0
28 Jan 2025
The impact of intrinsic rewards on exploration in Reinforcement Learning
The impact of intrinsic rewards on exploration in Reinforcement Learning
Aya Kayal
Eduardo Pignatelli
Laura Toni
305
8
0
20 Jan 2025
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
SkiLD: Unsupervised Skill Discovery Guided by Factor InteractionsNeural Information Processing Systems (NeurIPS), 2024
Zizhao Wang
Jiaheng Hu
Caleb Chuck
Stephen Chen
Roberto Martín-Martín
Amy Zhang
S. Niekum
Peter Stone
OffRL
354
12
0
24 Oct 2024
Learning Versatile Skills with Curriculum Masking
Learning Versatile Skills with Curriculum MaskingNeural Information Processing Systems (NeurIPS), 2024
Yao Tang
Zhihui Xie
Zichuan Lin
Deheng Ye
Shuai Li
OffRL
403
3
0
23 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSLOffRLOnRL
898
8
0
23 Oct 2024
Effective Exploration Based on the Structural Information Principles
Effective Exploration Based on the Structural Information PrinciplesNeural Information Processing Systems (NeurIPS), 2024
Xianghua Zeng
Hao Peng
Angsheng Li
184
10
0
09 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
401
12
0
03 Oct 2024
Contrastive Abstraction for Reinforcement Learning
Contrastive Abstraction for Reinforcement Learning
Vihang Patil
M. Hofmarcher
Elisabeth Rumetshofer
Sepp Hochreiter
OffRLSSL
338
5
0
01 Oct 2024
GFlowNet Pretraining with Inexpensive Rewards
GFlowNet Pretraining with Inexpensive Rewards
Mohit Pandey
G. Subbaraj
Emmanuel Bengio
AI4CE
256
6
0
15 Sep 2024
Unsupervised-to-Online Reinforcement Learning
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
301
11
0
27 Aug 2024
Global Reinforcement Learning: Beyond Linear and Convex Rewards via
  Submodular Semi-gradient Methods
Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods
Ric De Santi
Manish Prajapat
Andreas Krause
334
13
0
13 Jul 2024
Constrained Intrinsic Motivation for Reinforcement Learning
Constrained Intrinsic Motivation for Reinforcement Learning
Xiang Zheng
Jie Zhang
Chao Shen
Cong Wang
322
5
0
12 Jul 2024
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware
  Representations
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
Junik Bae
Kwanyoung Park
Youngwoon Lee
291
12
0
11 Jul 2024
Uncertainty-Aware Reward-Free Exploration with General Function
  Approximation
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Junkai Zhang
Weitong Zhang
Dongruo Zhou
Q. Gu
498
6
0
24 Jun 2024
The Limits of Pure Exploration in POMDPs: When the Observation Entropy
  is Enough
The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
Riccardo Zamboni
Duilio Cirino
Marcello Restelli
Mirco Mutti
358
7
0
18 Jun 2024
Exploration by Learning Diverse Skills through Successor State Measures
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
382
1
0
14 Jun 2024
Deep Bayesian Active Learning for Preference Modeling in Large Language
  Models
Deep Bayesian Active Learning for Preference Modeling in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024
Luckeciano C. Melo
P. Tigas
Alessandro Abate
Yarin Gal
275
19
0
14 Jun 2024
Language Guided Skill Discovery
Language Guided Skill DiscoveryInternational Conference on Learning Representations (ICLR), 2024
Seungeun Rho
Laura Smith
Tianyu Li
Sergey Levine
Xue Bin Peng
Sehoon Ha
LM&Ro
290
15
0
07 Jun 2024
Query-based Semantic Gaussian Field for Scene Representation in
  Reinforcement Learning
Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning
Jiaxu Wang
Ziyi Zhang
Qiang Zhang
Jia Li
Jingkai Sun
Mingyuan Sun
Junhao He
Zhanchen Zhu
3DGS
453
6
0
04 Jun 2024
How to Explore with Belief: State Entropy Maximization in POMDPs
How to Explore with Belief: State Entropy Maximization in POMDPs
Riccardo Zamboni
Duilio Cirino
Marcello Restelli
Mirco Mutti
287
6
0
04 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
651
6
0
01 Jun 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
492
9
0
25 May 2024
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement
  Learning
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Xuezhou Xu
Hang Su
Xingxing Zhang
Jun Zhu
394
14
0
23 May 2024
Learning Future Representation with Synthetic Observations for
  Sample-efficient Reinforcement Learning
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
305
4
0
20 May 2024
Decoupling Exploration and Exploitation for Unsupervised Pre-training
  with Successor Features
Decoupling Exploration and Exploitation for Unsupervised Pre-training with Successor FeaturesIEEE International Joint Conference on Neural Network (IJCNN), 2024
JaeYoon Kim
Junyu Xuan
Christy Jie Liang
F. Hussain
264
1
0
04 May 2024
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse
  Behaviors via Value and Successor Features Critics
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features CriticsInternational Conference on Machine Learning (ICML), 2024
Luca Grillotti
Maxence Faldor
Borja G. León
Antoine Cully
501
12
0
15 Mar 2024
RIME: Robust Preference-based Reinforcement Learning with Noisy
  Preferences
RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences
Jie Cheng
Gang Xiong
Xingyuan Dai
Qinghai Miao
Yisheng Lv
Fei-Yue Wang
368
37
0
27 Feb 2024
Foundation Policies with Hilbert Representations
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSLOffRL
437
68
0
23 Feb 2024
SLIM: Skill Learning with Multiple Critics
SLIM: Skill Learning with Multiple Critics
David Emukpere
Bingbing Wu
Julien Perez
J. Renders
330
3
0
01 Feb 2024
Behind the Myth of Exploration in Policy Gradients
Behind the Myth of Exploration in Policy Gradients
Adrien Bolland
Gaspard Lambrechts
Damien Ernst
474
3
0
31 Jan 2024
123
Next
Page 1 of 3