ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.04763
  4. Cited By
Imitation Learning by Reinforcement Learning
v1v2 (latest)

Imitation Learning by Reinforcement Learning

International Conference on Learning Representations (ICLR), 2021
10 August 2021
K. Ciosek
ArXiv (abs)PDFHTML

Papers citing "Imitation Learning by Reinforcement Learning"

16 / 16 papers shown
When Greedy Wins: Emergent Exploitation Bias in Meta-Bandit LLM Training
When Greedy Wins: Emergent Exploitation Bias in Meta-Bandit LLM Training
Sanxing Chen
Xiaoyin Chen
Yukun Huang
Roy Xie
Bhuwan Dhingra
129
2
0
29 Sep 2025
Social Cooperation in Conversational AI Agents
Social Cooperation in Conversational AI Agents
M. Çelikok
Saptarashmi Bandyopadhyay
R. Loftin
162
0
0
02 Jun 2025
Reinforcement Learning for Hanabi
Reinforcement Learning for Hanabi
Nina Cohen
Kordel K. France
71
0
0
31 May 2025
Imitation Learning of Correlated Policies in Stackelberg Games
Imitation Learning of Correlated Policies in Stackelberg Games
Kunag-Da Wang
Ping-Chun Hsieh
Chao-Han Huck Yang
529
0
0
11 Mar 2025
Robot See, Robot Do: Imitation Reward for Noisy Financial Environments
Robot See, Robot Do: Imitation Reward for Noisy Financial EnvironmentsBigData Congress [Services Society] (BSS), 2024
Sven Goluža
Tomislav Kovačević
Stjepan Begušić
Z. Kostanjčar
243
0
0
13 Nov 2024
On the Complexity of Learning to Cooperate with Populations of Socially
  Rational Agents
On the Complexity of Learning to Cooperate with Populations of Socially Rational Agents
R. Loftin
Saptarashmi Bandyopadhyay
M. Çelikok
276
1
0
29 Jun 2024
Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning
  for Digital Twins
Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning for Digital Twins
Eslam Eldeeb
Houssem Sifaou
Osvaldo Simeone
M. Shehab
Hirley Alves
OffRL
314
10
0
13 Feb 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
SEABO: A Simple Search-Based Method for Offline Imitation LearningInternational Conference on Learning Representations (ICLR), 2024
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
358
16
0
06 Feb 2024
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation
  Learning
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
Chia-Cheng Chiang
Li-Cheng Lan
Wei-Fang Sun
Chien Feng
Cho-Jui Hsieh
Chun-Yi Lee
441
0
0
01 Feb 2024
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable
  Environments
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Xiong-Hui Chen
Junyin Ye
Hang Zhao
Yi-Chen Li
Haoran Shi
...
Si-Hang Yang
Anqi Huang
Kai Xu
Zongzhang Zhang
Yang Yu
256
0
0
09 Oct 2023
MiniLLM: Knowledge Distillation of Large Language Models
MiniLLM: Knowledge Distillation of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Yuxian Gu
Li Dong
Furu Wei
Shiyu Huang
ALM
732
94
0
14 Jun 2023
A Strong Baseline for Batch Imitation Learning
A Strong Baseline for Batch Imitation Learning
Matthew Smith
Lucas Maystre
Zhenwen Dai
K. Ciosek
OffRL
185
5
0
06 Feb 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
369
25
0
06 Feb 2023
Backward Curriculum Reinforcement Learning
Backward Curriculum Reinforcement LearningIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2022
Kyungmin Ko
OnRL
331
0
0
29 Dec 2022
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
Shunyu Liu
Kaixuan Chen
Na Yu
Mingli Song
Zunlei Feng
Weilong Dai
378
2
0
05 Jul 2022
Accelerated Continuous-Time Approximate Dynamic Programming via
  Data-Assisted Hybrid Control
Accelerated Continuous-Time Approximate Dynamic Programming via Data-Assisted Hybrid ControlIFAC-PapersOnLine (IFAC-PapersOnLine), 2022
Daniel E. Ochoa
J. Poveda
132
4
0
27 Apr 2022
1
Page 1 of 1