ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.12142
  4. Cited By
IQ-Learn: Inverse soft-Q Learning for Imitation

IQ-Learn: Inverse soft-Q Learning for Imitation

23 June 2021
Divyansh Garg
Shuvam Chakraborty
Chris Cundy
Jiaming Song
Matthieu Geist
Stefano Ermon
ArXivPDFHTML

Papers citing "IQ-Learn: Inverse soft-Q Learning for Imitation"

34 / 134 papers shown
Title
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the
  MineRL BASALT 2022 Competition
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
...
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
J. Miller
Rohin Shah
19
16
0
23 Mar 2023
How To Guide Your Learner: Imitation Learning with Active Adaptive
  Expert Involvement
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement
Xu-Hui Liu
Feng Xu
Xinyu Zhang
Tianyuan Liu
Shengyi Jiang
Rui Chen
Zongzhang Zhang
Yang Yu
32
11
0
03 Mar 2023
LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning
LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning
Firas Al-Hafez
Davide Tateo
O. Arenz
Guoping Zhao
Jan Peters
13
22
0
01 Mar 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation
  Learning
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
14
19
0
16 Feb 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood
  Framework for Offline Inverse Reinforcement Learning
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
OffRL
24
13
0
15 Feb 2023
CLARE: Conservative Model-Based Reward Learning for Offline Inverse
  Reinforcement Learning
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning
Sheng Yue
Guan-Bo Wang
Wei Shao
Zhaofeng Zhang
Sen Lin
Junkai Ren
Junshan Zhang
OffRL
23
20
0
09 Feb 2023
Visual Imitation Learning with Patch Rewards
Visual Imitation Learning with Patch Rewards
Minghuan Liu
Tairan He
Weinan Zhang
Shuicheng Yan
Zhongwen Xu
SSL
8
13
0
02 Feb 2023
Hierarchical Imitation Learning with Vector Quantized Models
Hierarchical Imitation Learning with Vector Quantized Models
Kalle Kujanpää
J. Pajarinen
Alexander Ilin
9
12
0
30 Jan 2023
Extreme Q-Learning: MaxEnt RL without Entropy
Extreme Q-Learning: MaxEnt RL without Entropy
Divyansh Garg
Joey Hejna
M. Geist
Stefano Ermon
OffRL
23
63
0
05 Jan 2023
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Daniel Shin
Anca Dragan
Daniel S. Brown
OffRL
6
53
0
03 Jan 2023
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
29
42
0
04 Oct 2022
Structural Estimation of Markov Decision Processes in High-Dimensional
  State Space with Finite-Time Guarantees
Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees
Siliang Zeng
Mingyi Hong
Alfredo García
OffRL
33
12
0
04 Oct 2022
Reinforcement Learning with Non-Exponential Discounting
Reinforcement Learning with Non-Exponential Discounting
M. Schultheis
Constantin Rothkopf
Heinz Koeppl
19
11
0
27 Sep 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization
  Perspective
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective
Lunjun Zhang
Bradly C. Stadie
13
1
0
26 Sep 2022
Proximal Point Imitation Learning
Proximal Point Imitation Learning
Luca Viano
Angeliki Kamoutsi
Gergely Neu
Igor Krawczuk
V. Cevher
20
14
0
22 Sep 2022
On the convex formulations of robust Markov decision processes
On the convex formulations of robust Markov decision processes
Julien Grand-Clément
Marek Petrik
44
10
0
21 Sep 2022
Task-Agnostic Learning to Accomplish New Tasks
Task-Agnostic Learning to Accomplish New Tasks
Xianqi Zhang
Xingtao Wang
Xu Liu
Wenrui Wang
Xiaopeng Fan
Debin Zhao
OffRL
80
0
0
09 Sep 2022
Basis for Intentions: Efficient Inverse Reinforcement Learning using
  Past Experience
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Marwa Abdulhai
Natasha Jaques
Sergey Levine
OffRL
9
5
0
09 Aug 2022
Lagrangian Method for Q-Function Learning (with Applications to Machine
  Translation)
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
Bojun Huang
11
1
0
22 Jul 2022
Discriminator-Weighted Offline Imitation Learning from Suboptimal
  Demonstrations
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
24
65
0
20 Jul 2022
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
Shunyu Liu
Kaixuan Chen
Na Yu
Jie Song
Zunlei Feng
Mingli Song
31
1
0
05 Jul 2022
Target-absent Human Attention
Target-absent Human Attention
Zhibo Yang
Sounak Mondal
Seoyoung Ahn
G. Zelinsky
Minh Hoai
Dimitris Samaras
11
17
0
04 Jul 2022
Discriminator-Guided Model-Based Offline Imitation Learning
Discriminator-Guided Model-Based Offline Imitation Learning
Wenjia Zhang
Haoran Xu
Haoyi Niu
Peng Cheng
Ming Li
Heming Zhang
Guyue Zhou
Xianyuan Zhan
OffRL
8
16
0
01 Jul 2022
Learning Agile Skills via Adversarial Imitation of Rough Partial
  Demonstrations
Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations
Chenhao Li
Marin Vlastelica
Sebastian Blaes
Jonas Frey
F. Grimminger
Georg Martius
8
63
0
23 Jun 2022
Auto-Encoding Adversarial Imitation Learning
Auto-Encoding Adversarial Imitation Learning
Kaifeng Zhang
Rui Zhao
Ziming Zhang
Yang Gao
14
1
0
22 Jun 2022
Benchmarking Constraint Inference in Inverse Reinforcement Learning
Benchmarking Constraint Inference in Inverse Reinforcement Learning
Guiliang Liu
Yudong Luo
A. Gaurav
K. Rezaee
Pascal Poupart
28
22
0
20 Jun 2022
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Fan Luo
Xingchen Cao
Rong-Jun Qin
Yang Yu
14
2
0
01 Jun 2022
Retrospective on the 2021 BASALT Competition on Learning from Human
  Feedback
Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Rohin Shah
Steven H. Wang
Cody Wild
Stephanie Milani
Anssi Kanervisto
...
Alexander Fries
Alexandra Souly
Chan Jun Shern
Daniel del Castillo
Tom Lieberum
LLMAG
OffRL
11
10
0
14 Apr 2022
LISA: Learning Interpretable Skill Abstractions from Language
LISA: Learning Interpretable Skill Abstractions from Language
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&Ro
OffRL
142
29
0
28 Feb 2022
LobsDICE: Offline Learning from Observation via Stationary Distribution
  Correction Estimation
LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation
Geon-hyeong Kim
Jongmin Lee
Youngsoo Jang
Hongseok Yang
Kyungmin Kim
OffRL
15
15
0
28 Feb 2022
Imitation Learning by State-Only Distribution Matching
Imitation Learning by State-Only Distribution Matching
Damian Boborzi
C. Straehle
Jens S. Buchner
Lars Mikelsons
OOD
OffRL
15
4
0
09 Feb 2022
A Ranking Game for Imitation Learning
A Ranking Game for Imitation Learning
Harshit S. Sikchi
Akanksha Saran
Wonjoon Goo
S. Niekum
OffRL
14
22
0
07 Feb 2022
A Critique of Strictly Batch Imitation Learning
A Critique of Strictly Batch Imitation Learning
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Zhiwei Steven Wu
OffRL
14
4
0
05 Oct 2021
Imitation Learning with Human Eye Gaze via Multi-Objective Prediction
Imitation Learning with Human Eye Gaze via Multi-Objective Prediction
Ravi Kumar Thakur
M. Sunbeam
Vinicius G. Goecks
Ellen R. Novoseller
R. Bera
Vernon J. Lawhern
Gregory M. Gremillion
J. Valasek
Nicholas R. Waytowich
11
5
0
25 Feb 2021
Previous
123