v1v2v3 (latest)

Robust Asymmetric Learning in POMDPs

International Conference on Machine Learning (ICML), 2020

31 December 2020

Papers citing "Robust Asymmetric Learning in POMDPs"

15 / 15 papers shown

To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable Reinforcement Learning

196

03 Oct 2025

Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access

357

30 Sep 2025

Multi-Agent Guided Policy Optimization

Yueheng Li

Guangming Xie

Zongqing Lu

269

24 Jul 2025

Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityNeural Information Processing Systems (NeurIPS), 2024

Vahid Balazadeh Meresht

492

10 Apr 2024

Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains

462

09 Feb 2024

AgentMixer: Multi-Agent Correlated Policy FactorizationAAAI Conference on Artificial Intelligence (AAAI), 2024

329

16 Jan 2024

TGRL: An Algorithm for Teacher Guided Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023

303

06 Jul 2023

Informed POMDP: Leveraging Additional Information in Model-Based RL

Gaspard Lambrechts

Adrien Bolland

D. Ernst

381

20 Jun 2023

Learning in POMDPs is Sample-Efficient with Hindsight ObservabilityInternational Conference on Machine Learning (ICML), 2023

Jonathan Lee

Alekh Agarwal

Christoph Dann

Tong Zhang

364

31 Jan 2023

Leveraging Fully Observable Policies for Learning under Partial ObservabilityConference on Robot Learning (CoRL), 2022

318

03 Nov 2022

Improved Policy Optimization for Online Imitation Learning

325

29 Jul 2022

Hindsight Learning for MDPs with Exogenous InputsInternational Conference on Machine Learning (ICML), 2022

Sean R. Sinclair

Felipe Vieira Frujeri

332

13 Jul 2022

GridToPix: Training Embodied Agents with Minimal SupervisionIEEE International Conference on Computer Vision (ICCV), 2021

314

14 Apr 2021

Bridging the Imitation Gap by Adaptive InsubordinationNeural Information Processing Systems (NeurIPS), 2020

483

23 Jul 2020

Planning as Inference in Epidemiological ModelsFrontiers in Artificial Intelligence (FAI), 2020

Frank Wood

Andrew Warrington

Saeid Naderiparizi

Christian D. Weilbach

...

514

30 Mar 2020