Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.12142
Cited By
IQ-Learn: Inverse soft-Q Learning for Imitation
23 June 2021
Divyansh Garg
Shuvam Chakraborty
Chris Cundy
Jiaming Song
Matthieu Geist
Stefano Ermon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IQ-Learn: Inverse soft-Q Learning for Imitation"
50 / 134 papers shown
Title
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
57
0
0
04 May 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
25
0
0
17 Apr 2025
Understanding Inverse Reinforcement Learning under Overparameterization: Non-Asymptotic Analysis and Global Optimality
Ruijia Zhang
Siliang Zeng
Chenliang Li
Alfredo García
Mingyi Hong
56
0
0
22 Mar 2025
From Demonstrations to Rewards: Alignment Without Explicit Human Preferences
Siliang Zeng
Yao Liu
Huzefa Rangwala
George Karypis
Mingyi Hong
Rasool Fakoor
39
2
0
15 Mar 2025
Residual Policy Gradient: A Reward View of KL-regularized Objective
Pengcheng Wang
Xinghao Zhu
Yuxin Chen
Chenfeng Xu
M. Tomizuka
Chenran Li
36
0
0
14 Mar 2025
Preserving Cultural Identity with Context-Aware Translation Through Multi-Agent AI Systems
Mahfuz Ahmed Anik
Abdur Rahman
Azmine Toushik Wasi
Md Manjurul Ahsan
47
0
0
05 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
V. Cevher
74
0
0
27 Feb 2025
RIZE: Regularized Imitation Learning via Distributional Reinforcement Learning
Adib Karimi
Mohammad Mehdi Ebadzadeh
OOD
37
0
0
27 Feb 2025
Hierarchical Imitation Learning of Team Behavior from Heterogeneous Demonstrations
Sangwon Seo
Vaibhav Unhelkar
59
1
0
24 Feb 2025
Learning Strategy Representation for Imitation Learning in Multi-Agent Games
Shiqi Lei
Kanghon Lee
Linjing Li
Jinkyoo Park
OffRL
42
0
0
17 Feb 2025
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation
Minwoo Kim
Geunsik Bae
Jinwoo Lee
Woojae Shin
Changseung Kim
Myong-Yol Choi
Heejung Shin
H. Oh
74
0
0
04 Feb 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
Se-Wook Yoo
Seung-Woo Seo
48
0
0
30 Jan 2025
Inverse Reinforcement Learning with Switching Rewards and History Dependency for Characterizing Animal Behaviors
Jingyang Ke
Feiyang Wu
Jiyi Wang
Jeffrey Markowitz
Anqi Wu
73
0
0
22 Jan 2025
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
35
2
0
17 Jan 2025
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
67
0
0
04 Jan 2025
OMG-RL:Offline Model-based Guided Reward Learning for Heparin Treatment
Yooseok Lim
Sujee Lee
OffRL
137
0
0
03 Jan 2025
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
39
1
0
11 Nov 2024
The Role of Domain Randomization in Training Diffusion Policies for Whole-Body Humanoid Control
Oleg Kaidanov
Firas Al-Hafez
Yusuf Suvari
Boris Belousov
Jan Peters
27
0
0
02 Nov 2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Tian Xu
Zhilong Zhang
Ruishuo Chen
Yihao Sun
Yang Yu
25
1
0
01 Nov 2024
Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction
Utsav Singh
Souradip Chakraborty
Wesley A. Suttle
Brian M. Sadler
Anit Kumar Sahu
Mubarak Shah
Vinay P. Namboodiri
Amrit Singh Bedi
36
1
0
01 Nov 2024
Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment
Weichao Zhou
Wenchao Li
26
0
0
31 Oct 2024
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates
Shicheng Liu
Minghui Zhu
49
1
0
21 Oct 2024
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu
Yiding Chen
Gokul Swamy
Kianté Brantley
Wen Sun
DiffM
37
3
0
17 Oct 2024
Reward-free World Models for Online Imitation Learning
Shangzhe Li
Zhiao Huang
H. Su
OffRL
63
1
0
17 Oct 2024
UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations
Huy Hoang
Tien Mai
Pradeep Varakantham
OffRL
22
0
0
10 Oct 2024
Diffusion Imitation from Observation
Bo-Ruei Huang
Chun-Kai Yang
Chun-Mao Lai
Dai-Jie Wu
Shao-Hua Sun
31
4
0
07 Oct 2024
Robust Offline Imitation Learning from Diverse Auxiliary Data
Udita Ghosh
Dripta S. Raychaudhuri
Jiachen Li
Konstantinos Karydis
A. Roy-Chowdhury
OffRL
24
0
0
04 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
22
2
0
03 Oct 2024
The unknotting number, hard unknot diagrams, and reinforcement learning
Taylor Applebaum
Sam Blackwell
Alex Davies
Thomas Edlich
András Juhász
Marc Lackenby
Nenad Tomašev
Daniel Zheng
16
3
0
13 Sep 2024
Imitating Language via Scalable Inverse Reinforcement Learning
Markus Wulfmeier
Michael Bloesch
Nino Vieillard
Arun Ahuja
Jorg Bornschein
...
Jost Tobias Springenberg
Nikola Momchev
Olivier Bachem
Matthieu Geist
Martin Riedmiller
34
9
0
02 Sep 2024
Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation
Woo Kyung Kim
Minjong Yoo
Honguk Woo
OffRL
28
0
0
22 Aug 2024
Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
Ashutosh Nayyar
OffRL
19
0
0
17 Aug 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
36
5
0
29 Jul 2024
Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion
Henri-Jacques Geiss
Firas Al-Hafez
Andre Seyfarth
Jan Peters
Davide Tateo
18
2
0
16 Jul 2024
Preserving the Privacy of Reward Functions in MDPs through Deception
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
21
0
0
13 Jul 2024
Safety through feedback in Constrained RL
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
OffRL
35
1
0
28 Jun 2024
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention
Yuxin Chen
Chen Tang
Chenran Li
Ran Tian
Peter Stone
M. Tomizuka
Wei Zhan
21
1
0
24 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
18
4
0
13 Jun 2024
RILe: Reinforced Imitation Learning
Mert Albaba
Sammy Christen
Christoph Gebhardt
Thomas Langarek
Otmar Hilliges
Otmar Hilliges
42
1
0
12 Jun 2024
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning
Andreas Schlaginhaufen
Maryam Kamgarpour
OffRL
21
1
0
03 Jun 2024
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai
Hsiang-Chun Wang
Ping-Chun Hsieh
Yu-Chiang Frank Wang
Min-Hung Chen
Shao-Hua Sun
32
8
0
25 May 2024
Inference of Utilities and Time Preference in Sequential Decision-Making
Haoyang Cao
Zhengqi Wu
Renyuan Xu
19
0
0
24 May 2024
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
Angeliki Kamoutsi
Peter Schmitt-Förster
Tobias Sutter
V. Cevher
John Lygeros
39
0
0
24 May 2024
How to Leverage Diverse Demonstrations in Offline Imitation Learning
Sheng Yue
Jiani Liu
Xingyuan Hua
Ju Ren
Sen Lin
Junshan Zhang
Yaoxue Zhang
OffRL
32
2
0
24 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
40
2
0
23 May 2024
Efficient Imitation Learning with Conservative World Models
Victor Kolev
Rafael Rafailov
Kyle Hatch
Jiajun Wu
Chelsea Finn
OffRL
27
5
0
21 May 2024
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
OffRL
33
1
0
20 May 2024
Imitation Learning in Discounted Linear MDPs without exploration assumptions
Luca Viano
Stratis Skoulakis
V. Cevher
30
3
0
03 May 2024
Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning
Qiyuan Liu
19
0
0
02 May 2024
IDIL: Imitation Learning of Intent-Driven Expert Behavior
Sangwon Seo
Vaibhav Unhelkar
23
3
0
25 Apr 2024
1
2
3
Next