ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.03498
  4. Cited By
Identifiability in inverse reinforcement learning
v1v2v3 (latest)

Identifiability in inverse reinforcement learning

Neural Information Processing Systems (NeurIPS), 2021
7 June 2021
Haoyang Cao
Samuel N. Cohen
Lukasz Szpruch
ArXiv (abs)PDFHTML

Papers citing "Identifiability in inverse reinforcement learning"

41 / 41 papers shown
Blind Inverse Game Theory: Jointly Decoding Rewards and Rationality in Entropy-Regularized Competitive Games
Blind Inverse Game Theory: Jointly Decoding Rewards and Rationality in Entropy-Regularized Competitive Games
Hamza Virk
Sandro Amaglobeli
Zuhayr Syed
135
0
0
07 Nov 2025
Generalizing Behavior via Inverse Reinforcement Learning with Closed-Form Reward Centroids
Generalizing Behavior via Inverse Reinforcement Learning with Closed-Form Reward Centroids
Filippo Lazzati
Alberto Maria Metelli
142
0
0
15 Sep 2025
Inference of Altruism and Intrinsic Rewards in Multi-Agent Systems
Inference of Altruism and Intrinsic Rewards in Multi-Agent Systems
Victor Villin
Christos Dimitrakakis
FaML
396
0
0
09 Sep 2025
Icon$^{2}$: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation
Icon2^{2}2: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation
Qiyuan Chen
Hongsen Huang
Qian Shao
Jiahe Chen
Jintai Chen
H. Xu
Renjie Hua
Ren Chuan
Jian Wu
142
0
0
06 Sep 2025
Efficient Reward Identification In Max Entropy Reinforcement Learning with Sparsity and Rank Priors
Efficient Reward Identification In Max Entropy Reinforcement Learning with Sparsity and Rank Priors
Mohamad Louai Shehab
Alperen Tercan
Necmiye Ozay
107
0
0
10 Aug 2025
Model-Based Soft Maximization of Suitable Metrics of Long-Term Human Power
Model-Based Soft Maximization of Suitable Metrics of Long-Term Human Power
Jobst Heitzig
Ram Potham
181
0
0
31 Jul 2025
Where You Go is Who You Are: Behavioral Theory-Guided LLMs for Inverse Reinforcement Learning
Where You Go is Who You Are: Behavioral Theory-Guided LLMs for Inverse Reinforcement Learning
Yuran Sun
Susu Xu
Chenguang Wang
Xilei Zhao
199
0
0
22 May 2025
Agency Is Frame-Dependent
Agency Is Frame-Dependent
David Abel
André Barreto
Michael Bowling
Will Dabney
Shi Dong
...
Doina Precup
Jonathan Richens
Mark Rowland
Tom Schaul
Satinder Singh
424
5
0
06 Feb 2025
Robustness in the Face of Partial Identifiability in Reward Learning
Robustness in the Face of Partial Identifiability in Reward Learning
Filippo Lazzati
Alberto Maria Metelli
272
1
0
10 Jan 2025
Bootstrapped Reward Shaping
Bootstrapped Reward ShapingAAAI Conference on Artificial Intelligence (AAAI), 2025
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OffRL
279
6
0
02 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
Yangchun Zhang
Wang Zhou
Yirui Zhou
289
0
0
31 Dec 2024
Rethinking Inverse Reinforcement Learning: from Data Alignment to Task
  Alignment
Rethinking Inverse Reinforcement Learning: from Data Alignment to Task AlignmentNeural Information Processing Systems (NeurIPS), 2024
Weichao Zhou
Wenchao Li
269
2
0
31 Oct 2024
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning
Jared Joselowitz
Ritam Majumdar
Arjun Jagota
Matthieu Bou
Nyal Patel
Satyapriya Krishna
Sonali Parbhoo
366
0
0
16 Oct 2024
Inverse Reinforcement Learning with Multiple Planning Horizons
Inverse Reinforcement Learning with Multiple Planning Horizons
Jiayu Yao
Weiwei Pan
Finale Doshi-Velez
Barbara E. Engelhardt
255
0
0
26 Sep 2024
Boosting Soft Q-Learning by Bounding
Boosting Soft Q-Learning by Bounding
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
Rahul V. Kulkarni
OffRL
293
4
0
26 Jun 2024
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning
Andreas Schlaginhaufen
Maryam Kamgarpour
OffRL
369
6
0
03 Jun 2024
Inference of Utilities and Time Preference in Sequential Decision-Making
Inference of Utilities and Time Preference in Sequential Decision-Making
Haoyang Cao
Zhengqi Wu
Renyuan Xu
235
1
0
24 May 2024
Randomized algorithms and PAC bounds for inverse reinforcement learning
  in continuous spaces
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
Angeliki Kamoutsi
Peter Schmitt-Förster
Tobias Sutter
Volkan Cevher
John Lygeros
252
0
0
24 May 2024
Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable
  AI Systems
Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems
David Dalrymple
Joar Skalse
Yoshua Bengio
Stuart J. Russell
Max Tegmark
...
Clark Barrett
Ding Zhao
Zhi-Xuan Tan
Jeannette Wing
Joshua Tenenbaum
401
106
0
10 May 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation,
  Transferable Reward Recovery and Algebraic Equilibrium Proof
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
377
0
0
21 Mar 2024
Quantifying the Sensitivity of Inverse Reinforcement Learning to
  Misspecification
Quantifying the Sensitivity of Inverse Reinforcement Learning to MisspecificationInternational Conference on Learning Representations (ICLR), 2024
Joar Skalse
Alessandro Abate
218
5
0
11 Mar 2024
Toward Computationally Efficient Inverse Reinforcement Learning via
  Reward Shaping
Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping
Lauren H. Cooke
Harvey Klyne
Edwin Zhang
Cassidy Laidlaw
Milind Tambe
Finale Doshi-Velez
412
2
0
15 Dec 2023
FoMo Rewards: Can we cast foundation models as reward functions?
FoMo Rewards: Can we cast foundation models as reward functions?
Ekdeep Singh Lubana
Johann Brehmer
P. D. Haan
Taco S. Cohen
OffRLLRM
302
5
0
06 Dec 2023
Eliciting Risk Aversion with Inverse Reinforcement Learning via Interactive Questioning
Eliciting Risk Aversion with Inverse Reinforcement Learning via Interactive Questioning
Ziteng Cheng
Anthony Coache
S. Jaimungal
227
1
0
16 Aug 2023
PAGAR: Taming Reward Misalignment in Inverse Reinforcement
  Learning-Based Imitation Learning with Protagonist Antagonist Guided
  Adversarial Reward
PAGAR: Taming Reward Misalignment in Inverse Reinforcement Learning-Based Imitation Learning with Protagonist Antagonist Guided Adversarial Reward
Weichao Zhou
Wenchao Li
344
0
0
02 Jun 2023
Identifiability and Generalizability in Constrained Inverse
  Reinforcement Learning
Identifiability and Generalizability in Constrained Inverse Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Andreas Schlaginhaufen
Maryam Kamgarpour
321
18
0
01 Jun 2023
Coherent Soft Imitation Learning
Coherent Soft Imitation LearningNeural Information Processing Systems (NeurIPS), 2023
Joe Watson
Sandy H. Huang
Nicholas Heess
311
20
0
25 May 2023
K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent
  State-Action Pairs
K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action PairsInternational Conference on Machine Learning (ICML), 2023
Andrea Coletta
Svitlana Vyetrenko
T. Balch
OffRL
371
9
0
23 Feb 2023
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided
  Bounds on the Value Function
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided Bounds on the Value Function
Jacob Adamczyk
Stas Tiomkin
R. Kulkarni
OffRL
189
0
0
19 Feb 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood
  Framework for Offline Inverse Reinforcement Learning
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
OffRL
325
26
0
15 Feb 2023
On the Sensitivity of Reward Inference to Misspecified Human Models
On the Sensitivity of Reward Inference to Misspecified Human ModelsInternational Conference on Learning Representations (ICLR), 2022
Joey Hong
Kush S. Bhatia
Anca Dragan
196
28
0
09 Dec 2022
Misspecification in Inverse Reinforcement Learning
Misspecification in Inverse Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
Joar Skalse
Alessandro Abate
320
29
0
06 Dec 2022
Utilizing Prior Solutions for Reward Shaping and Composition in
  Entropy-Regularized Reinforcement Learning
Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
Jacob Adamczyk
A. Arriojas
Stas Tiomkin
R. Kulkarni
215
13
0
02 Dec 2022
Environment Design for Inverse Reinforcement Learning
Environment Design for Inverse Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Thomas Kleine Buening
Victor Villin
Christos Dimitrakakis
395
5
0
26 Oct 2022
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time GuaranteesNeural Information Processing Systems (NeurIPS), 2022
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
420
51
0
04 Oct 2022
Identifiability and generalizability from multiple experts in Inverse
  Reinforcement Learning
Identifiability and generalizability from multiple experts in Inverse Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Paul Rolland
Luca Viano
Norman Schuerhoff
Boris Nikolov
Volkan Cevher
OffRL
344
19
0
22 Sep 2022
Calculus on MDPs: Potential Shaping as a Gradient
Calculus on MDPs: Potential Shaping as a Gradient
Erik Jenner
H. V. Hoof
Adam Gleave
239
4
0
20 Aug 2022
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Fan Luo
Xingchen Cao
Rong-Jun Qin
Yang Yu
371
4
0
01 Jun 2022
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
Adam Gleave
Sam Toyer
205
18
0
22 Mar 2022
Invariance in Policy Optimisation and Partial Identifiability in Reward
  Learning
Invariance in Policy Optimisation and Partial Identifiability in Reward LearningInternational Conference on Machine Learning (ICML), 2022
Joar Skalse
Matthew Farrugia-Roberts
Stuart J. Russell
Alessandro Abate
Adam Gleave
312
55
0
14 Mar 2022
Necessary and Sufficient Conditions for Inverse Reinforcement Learning
  of Bayesian Stopping Time Problems
Necessary and Sufficient Conditions for Inverse Reinforcement Learning of Bayesian Stopping Time Problems
Kunal Pattanayak
Vikram Krishnamurthy
OffRL
549
4
0
07 Jul 2020
1
Page 1 of 1