ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.11248
  4. Cited By
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning

Learning Robust Rewards with Adversarial Inverse Reinforcement Learning

30 October 2017
Justin Fu
Katie Z Luo
Sergey Levine
ArXivPDFHTML

Papers citing "Learning Robust Rewards with Adversarial Inverse Reinforcement Learning"

50 / 148 papers shown
Title
Recursive Deep Inverse Reinforcement Learning
Recursive Deep Inverse Reinforcement Learning
Paul Ghanem
Michael Potter
Owen Howell
Pau Closas
A. Ramezani
Deniz Erdogmus
Tales Imbiriba
20
0
0
17 Apr 2025
Imitation Learning of Correlated Policies in Stackelberg Games
Imitation Learning of Correlated Policies in Stackelberg Games
Kunag-Da Wang
Ping-Chun Hsieh
Wen-Chih Peng
43
0
0
11 Mar 2025
On the Effective Horizon of Inverse Reinforcement Learning
On the Effective Horizon of Inverse Reinforcement Learning
Yiqing Xu
Finale Doshi-Velez
David Hsu
46
0
0
21 Feb 2025
Conditional Prediction by Simulation for Automated Driving
Conditional Prediction by Simulation for Automated Driving
Fabian Konstantinidis
Moritz Sackmann
U. Hofmann
Christoph Stiller
82
0
0
05 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
94
14
0
28 Jan 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
37
0
0
22 Jan 2025
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep P. Chinchali
Ufuk Topcu
OffRL
92
0
0
02 Dec 2024
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Xingrui Yu
Zhenglin Wan
David Mark Bossens
Yueming Lyu
Qing-Wu Guo
Ivor W. Tsang
133
0
0
11 Nov 2024
Few-Shot Task Learning through Inverse Generative Modeling
Few-Shot Task Learning through Inverse Generative Modeling
Aviv Netanyahu
Yilun Du
Antonia Bronars
Jyothish Pari
J. Tenenbaum
Tianmin Shu
Pulkit Agrawal
49
1
0
07 Nov 2024
NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Eric Zhu
Mara Levy
M. Gwilliam
Abhinav Shrivastava
42
0
0
04 Nov 2024
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
Ondrej Biza
Thomas Weng
Lingfeng Sun
Karl Schmeckpeper
Tarik Kelestemur
Yecheng Jason Ma
Robert C. Platt
Jan Willem van de Meent
Lawson L. S. Wong
OffRL
43
0
0
25 Oct 2024
Learning Transparent Reward Models via Unsupervised Feature Selection
Learning Transparent Reward Models via Unsupervised Feature Selection
Daulet Baimukashev
G. Alcan
K. Luck
Ville Kyrki
SSL
OffRL
36
0
0
24 Oct 2024
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu
Yiding Chen
Gokul Swamy
Kianté Brantley
Wen Sun
DiffM
42
3
0
17 Oct 2024
Learning Causally Invariant Reward Functions from Diverse Demonstrations
Learning Causally Invariant Reward Functions from Diverse Demonstrations
Ivan Ovinnikov
Eugene Bykovets
J. M. Buhmann
CML
33
0
0
12 Sep 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
71
1
0
22 Aug 2024
Affordance-Guided Reinforcement Learning via Visual Prompting
Affordance-Guided Reinforcement Learning via Visual Prompting
Olivia Y. Lee
Annie Xie
Kuan Fang
Karl Pertsch
Chelsea Finn
OffRL
LM&Ro
74
7
0
14 Jul 2024
A Generalized Apprenticeship Learning Framework for Modeling
  Heterogeneous Student Pedagogical Strategies
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Md Mirajul Islam
Xi Yang
J. Hostetter
Adittya Soukarjya Saha
Min Chi
26
1
0
04 Jun 2024
Data Efficient Behavior Cloning for Fine Manipulation via
  Continuity-based Corrective Labels
Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels
Abhay Deshpande
Liyiming Ke
Quinn Pfeifer
Abhishek Gupta
S. Srinivasa
47
1
0
29 May 2024
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis
  via Forward Dynamics Guided 4D Imitation
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation
Yunze Liu
Changxi Chen
Chenjing Ding
Li Yi
31
6
0
01 Apr 2024
Offline Imitation of Badminton Player Behavior via Experiential Contexts
  and Brownian Motion
Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion
Kuang-Da Wang
Wei-Yao Wang
Ping-Chun Hsieh
Wenjie Peng
OffRL
34
0
0
19 Mar 2024
Globally Stable Neural Imitation Policies
Globally Stable Neural Imitation Policies
Amin Abyaneh
Mariana Sosa Guzmán
Hsiu-Chin Lin
43
2
0
07 Mar 2024
ARMCHAIR: integrated inverse reinforcement learning and model predictive
  control for human-robot collaboration
ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration
Angelo Caregnato-Neto
Luciano Cavalcante Siebert
Arkady Zgonnikov
Marcos Ricardo Omena de Albuquerque Máximo
R. J. Afonso
32
2
0
29 Feb 2024
Transductive Reward Inference on Graph
Transductive Reward Inference on Graph
B. Qu
Xiaofeng Cao
Qing-Wu Guo
Yi Chang
Ivor W. Tsang
Chengqi Zhang
OffRL
30
0
0
06 Feb 2024
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
Ryoma Furuyama
Daiki Kuyoshi
Satoshi Yamane
18
0
0
30 Jan 2024
Exploring Gradient Explosion in Generative Adversarial Imitation
  Learning: A Probabilistic Perspective
Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective
Wanying Wang
Yichen Zhu
Yirui Zhou
Chaomin Shen
Jian Tang
Zhiyuan Xu
Yaxin Peng
Yangchun Zhang
21
4
0
18 Dec 2023
Aligning Human Intent from Imperfect Demonstrations with
  Confidence-based Inverse soft-Q Learning
Aligning Human Intent from Imperfect Demonstrations with Confidence-based Inverse soft-Q Learning
Xizhou Bu
Wenjuan Li
Zhengxiong Liu
Zhiqiang Ma
Panfeng Huang
20
1
0
18 Dec 2023
Signal Temporal Logic-Guided Apprenticeship Learning
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
S. Nikolaidis
38
1
0
09 Nov 2023
Learning Reward for Physical Skills using Large Language Model
Learning Reward for Physical Skills using Large Language Model
Yuwei Zeng
Yiqing Xu
28
6
0
21 Oct 2023
CCIL: Continuity-based Data Augmentation for Corrective Imitation
  Learning
CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning
Liyiming Ke
Yunchu Zhang
Abhay Deshpande
S. Srinivasa
Abhishek Gupta
OffRL
21
12
0
19 Oct 2023
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement
  Learning with Sub-optimal Demonstrations
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations
Lu Li
Yuxin Pan
Ruobing Chen
Jie Liu
Zilin Wang
Yu Liu
Zhiheng Li
47
0
0
13 Oct 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline
  Reinforcement Learning
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
24
7
0
09 Oct 2023
All by Myself: Learning Individualized Competitive Behaviour with a
  Contrastive Reinforcement Learning optimization
All by Myself: Learning Individualized Competitive Behaviour with a Contrastive Reinforcement Learning optimization
Pablo V. A. Barros
A. Sciutti
SSL
25
3
0
02 Oct 2023
See to Touch: Learning Tactile Dexterity through Visual Incentives
See to Touch: Learning Tactile Dexterity through Visual Incentives
Irmak Güzey
Yinlong Dai
Ben Evans
Soumith Chintala
Lerrel Pinto
23
30
0
21 Sep 2023
Stylized Table Tennis Robots Skill Learning with Incomplete Human
  Demonstrations
Stylized Table Tennis Robots Skill Learning with Incomplete Human Demonstrations
Xiangpei Zhu
Zixuan Chen
Jianyu Chen
11
0
0
16 Sep 2023
Policy Contrastive Imitation Learning
Policy Contrastive Imitation Learning
Jialei Huang
Zhao-Heng Yin
Yingdong Hu
Yang Gao
27
3
0
06 Jul 2023
Curricular Subgoals for Inverse Reinforcement Learning
Curricular Subgoals for Inverse Reinforcement Learning
Shunyu Liu
Yunpeng Qing
Shuqi Xu
Hongyan Wu
Jiangtao Zhang
Jingyuan Cong
Tianhao Chen
Yunfu Liu
Mingli Song
21
0
0
14 Jun 2023
Coherent Soft Imitation Learning
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
30
11
0
25 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement
  Learning
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
28
39
0
22 May 2023
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via
  Geometry-aware Curriculum and Iterative Generalist-Specialist Learning
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning
Weikang Wan
Haoran Geng
Yun-Hai Liu
Zikang Shan
Yaodong Yang
Li Yi
He-Nan Wang
44
94
0
02 Apr 2023
BC-IRL: Learning Generalizable Reward Functions from Demonstrations
BC-IRL: Learning Generalizable Reward Functions from Demonstrations
Andrew Szot
Amy Zhang
Dhruv Batra
Z. Kira
Franziska Meier
OOD
OffRL
31
8
0
28 Mar 2023
Guarded Policy Optimization with Imperfect Online Demonstrations
Guarded Policy Optimization with Imperfect Online Demonstrations
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
OffRL
43
10
0
03 Mar 2023
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement
  Learning
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning
Archit Sharma
Ahmed M. Ahmed
Rehaan Ahmad
Chelsea Finn
SSL
48
17
0
02 Mar 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood
  Framework for Offline Inverse Reinforcement Learning
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
OffRL
34
13
0
15 Feb 2023
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Yunke Wang
Bo Du
Chang Xu
23
8
0
13 Feb 2023
AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer
AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer
Allen Z. Ren
Hongkai Dai
Benjamin Burchfiel
Anirudha Majumdar
24
14
0
09 Feb 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav
PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav
Ram Ramrakhya
Dhruv Batra
Erik Wijmans
Abhishek Das
OffRL
18
53
0
18 Jan 2023
On The Fragility of Learned Reward Functions
On The Fragility of Learned Reward Functions
Lev McKinney
Yawen Duan
David M. Krueger
Adam Gleave
28
20
0
09 Jan 2023
New Challenges in Reinforcement Learning: A Survey of Security and
  Privacy
New Challenges in Reinforcement Learning: A Survey of Security and Privacy
Yunjiao Lei
Dayong Ye
Sheng Shen
Yulei Sui
Tianqing Zhu
Wanlei Zhou
33
18
0
31 Dec 2022
Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
David Zhang
Micah Carroll
Andreea Bobu
Anca Dragan
22
4
0
30 Nov 2022
123
Next