ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.15566
  4. Cited By
Robust Asymmetric Learning in POMDPs
v1v2v3 (latest)

Robust Asymmetric Learning in POMDPs

International Conference on Machine Learning (ICML), 2020
31 December 2020
Andrew Warrington
J. Lavington
Adam Scibior
Mark Schmidt
Frank Wood
ArXiv (abs)PDFHTML

Papers citing "Robust Asymmetric Learning in POMDPs"

15 / 15 papers shown
To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable Reinforcement Learning
To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable Reinforcement Learning
Yuda Song
Dhruv Rohatgi
Aarti Singh
J. Andrew Bagnell
195
2
0
03 Oct 2025
Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access
Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access
Daniel Ebi
Gaspard Lambrechts
D. Ernst
Klemens Böhm
OffRL
356
0
0
30 Sep 2025
Multi-Agent Guided Policy Optimization
Multi-Agent Guided Policy Optimization
Yueheng Li
Guangming Xie
Zongqing Lu
262
2
0
24 Jul 2025
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityNeural Information Processing Systems (NeurIPS), 2024
Vahid Balazadeh Meresht
Keertana Chidambaram
Viet Nguyen
Fahad Razak
Vasilis Syrgkanis
489
2
0
10 Apr 2024
Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains
Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains
Feiyang Wu
Xavier Nal
Jaehwi Jang
Wei Zhu
Zhaoyuan Gu
Anqi Wu
Ye Zhao
458
0
0
09 Feb 2024
AgentMixer: Multi-Agent Correlated Policy Factorization
AgentMixer: Multi-Agent Correlated Policy FactorizationAAAI Conference on Artificial Intelligence (AAAI), 2024
Zhiyuan Li
Wenshuai Zhao
Lijun Wu
Joni Pajarinen
OffRL
326
5
0
16 Jan 2024
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
TGRL: An Algorithm for Teacher Guided Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
296
22
0
06 Jul 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
379
13
0
20 Jun 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Learning in POMDPs is Sample-Efficient with Hindsight ObservabilityInternational Conference on Machine Learning (ICML), 2023
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
360
25
0
31 Jan 2023
Leveraging Fully Observable Policies for Learning under Partial
  Observability
Leveraging Fully Observable Policies for Learning under Partial ObservabilityConference on Robot Learning (CoRL), 2022
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert Platt
OffRL
311
28
0
03 Nov 2022
Improved Policy Optimization for Online Imitation Learning
Improved Policy Optimization for Online Imitation Learning
J. Lavington
Sharan Vaswani
Mark Schmidt
OffRL
316
7
0
29 Jul 2022
Hindsight Learning for MDPs with Exogenous Inputs
Hindsight Learning for MDPs with Exogenous InputsInternational Conference on Machine Learning (ICML), 2022
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
331
31
0
13 Jul 2022
GridToPix: Training Embodied Agents with Minimal Supervision
GridToPix: Training Embodied Agents with Minimal SupervisionIEEE International Conference on Computer Vision (ICCV), 2021
Unnat Jain
Iou-Jen Liu
Svetlana Lazebnik
Aniruddha Kembhavi
Luca Weihs
Alex Schwing
313
24
0
14 Apr 2021
Bridging the Imitation Gap by Adaptive Insubordination
Bridging the Imitation Gap by Adaptive InsubordinationNeural Information Processing Systems (NeurIPS), 2020
Luca Weihs
Unnat Jain
Iou-Jen Liu
Jordi Salvador
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
474
41
0
23 Jul 2020
Planning as Inference in Epidemiological Models
Planning as Inference in Epidemiological ModelsFrontiers in Artificial Intelligence (FAI), 2020
Frank Wood
Andrew Warrington
Saeid Naderiparizi
Christian D. Weilbach
Vaden Masrani
...
Adam Scibior
Boyan Beronov
John Grefenstette
Duncan Campbell
Alireza Nasseri
499
6
0
30 Mar 2020
1
Page 1 of 1