Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.11972
Cited By
imitation: Clean Imitation Learning Implementations
22 November 2022
Adam Gleave
Mohammad Taufeeque
Juan Rocamonde
Erik Jenner
Steven H. Wang
Sam Toyer
M. Ernestus
Nora Belrose
Scott Emmons
Stuart J. Russell
MLAU
Re-assign community
ArXiv
PDF
HTML
Papers citing
"imitation: Clean Imitation Learning Implementations"
24 / 24 papers shown
Title
Whenever, Wherever: Towards Orchestrating Crowd Simulations with Spatio-Temporal Spawn Dynamics
T. Kreutz
M. Mühlhäuser
Alejandro Sánchez Guinea
37
0
0
20 Mar 2025
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies
William Solow
Sandhya Saisubramanian
Alan Fern
OffRL
66
0
0
26 Feb 2025
CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models
Zihao Sheng
Zilin Huang
Yansong Qu
Yue Leng
Sruthi Bhavanam
Sikai Chen
46
2
0
24 Feb 2025
X-IL: Exploring the Design Space of Imitation Learning Policies
Xiaogang Jia
Atalay Donat
Xi Huang
Xuan Zhao
Denis Blessing
...
Han A. Wang
Hanyi Zhang
Qian Wang
Rudolf Lioutikov
Gerhard Neumann
81
1
0
20 Feb 2025
Learning Transparent Reward Models via Unsupervised Feature Selection
Daulet Baimukashev
G. Alcan
K. Luck
Ville Kyrki
SSL
OffRL
36
0
0
24 Oct 2024
Diffusion Imitation from Observation
Bo-Ruei Huang
Chun-Kai Yang
Chun-Mao Lai
Dai-Jie Wu
Shao-Hua Sun
39
4
0
07 Oct 2024
A Graph-based Adversarial Imitation Learning Framework for Reliable & Realtime Fleet Scheduling in Urban Air Mobility
Prithvi Poddar
Steve Paul
Souma Chowdhury
AI4TS
30
0
0
16 Jul 2024
Preserving the Privacy of Reward Functions in MDPs through Deception
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
35
0
0
13 Jul 2024
"Give Me an Example Like This": Episodic Active Reinforcement Learning from Demonstrations
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
OffRL
28
3
0
05 Jun 2024
RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation
Zelei Cheng
Xian Wu
Jiahao Yu
Sabrina Yang
Gang Wang
Xinyu Xing
OffRL
26
2
0
05 May 2024
Human-compatible driving partners through data-regularized self-play reinforcement learning
Daphne Cornelisse
Eugene Vinitsky
35
6
0
28 Mar 2024
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory
Tianjiao Luo
Tim Pearce
Huayu Chen
Jianfei Chen
Jun Zhu
19
2
0
26 Feb 2024
Synergistic Reinforcement and Imitation Learning for Vision-driven Autonomous Flight of UAV Along River
Zihan Wang
Jianwen Li
N. Mahmoudian
31
0
0
17 Jan 2024
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
Rohin Shah
21
6
0
05 Dec 2023
Dynamic value alignment through preference aggregation of multiple objectives
Marcin Korecki
Damian Dailisan
Cesare Carissimo
33
0
0
09 Oct 2023
RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
Yannick Metz
David Lindner
Raphael Baur
Daniel A. Keim
Mennatallah El-Assady
AI4CE
32
10
0
08 Aug 2023
Residual Q-Learning: Offline and Online Policy Customization without Value
Chenran Li
Chen Tang
Haruki Nishimura
Jean-Pierre Mercat
M. Tomizuka
Wei Zhan
OffRL
25
6
0
15 Jun 2023
Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning
Bryan C. Brandt
P. Dasgupta
26
1
0
14 Apr 2023
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
...
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
J. Miller
Rohin Shah
30
16
0
23 Mar 2023
Safe Imitation Learning of Nonlinear Model Predictive Control for Flexible Robots
Shamil Mamedov
Rudolf Reiter
Seyed Mahdi Basiri Azad
Joschka Boedecker
Moritz Diehl
Jan Swevers
6
2
0
06 Dec 2022
Environment Design for Inverse Reinforcement Learning
Thomas Kleine Buening
Victor Villin
Christos Dimitrakakis
30
1
0
26 Oct 2022
Models of human preference for learning reward functions
W. B. Knox
Stephane Hatgis-Kessell
Serena Booth
S. Niekum
Peter Stone
A. Allievi
27
42
0
05 Jun 2022
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Fan Luo
Xingchen Cao
Rong-Jun Qin
Yang Yu
14
2
0
01 Jun 2022
Learning rewards for robotic ultrasound scanning using probabilistic temporal ranking
Michael G. Burke
Katie Lu
Daniel Angelov
Artūras Straižys
Craig Innes
Kartic Subr
S. Ramamoorthy
6
10
0
04 Feb 2020
1