ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.08991
  4. Cited By
Towards Robust Model-Based Reinforcement Learning Against Adversarial
  Corruption
v1v2 (latest)

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

14 February 2024
Chen Ye
Jiafan He
Quanquan Gu
Tong Zhang
ArXiv (abs)PDFHTMLGithub

Papers citing "Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption"

8 / 8 papers shown
RobustVLA: Robustness-Aware Reinforcement Post-Training for Vision-Language-Action Models
RobustVLA: Robustness-Aware Reinforcement Post-Training for Vision-Language-Action Models
Hongyin Zhang
Shuo Zhang
Junxi Jin
Qixin Zeng
Runze Li
Donglin Wang
VLM
499
4
0
03 Nov 2025
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Longxiang He
Deheng Ye
Junbo Tan
Xueqian Wang
Li Shen
OnRL
399
0
0
29 Sep 2025
ORVIT: Near-Optimal Online Distributionally Robust Reinforcement Learning
ORVIT: Near-Optimal Online Distributionally Robust Reinforcement Learning
Debamita Ghosh
George Atia
Yue Wang
OffRLOOD
447
3
0
05 Aug 2025
Daunce: Data Attribution through Uncertainty Estimation
Daunce: Data Attribution through Uncertainty Estimation
Xingyuan Pan
Chenlu Ye
Joseph Melkonian
Jiaqi W. Ma
Tong Zhang
TDIUQCV
219
2
0
29 May 2025
Catoni Contextual Bandits are Robust to Heavy-tailed Rewards
Catoni Contextual Bandits are Robust to Heavy-tailed Rewards
Chenlu Ye
Yujia Jin
Alekh Agarwal
Tong Zhang
494
1
0
04 Feb 2025
A Model Selection Approach for Corruption Robust Reinforcement Learning
A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021
Chen-Yu Wei
Christoph Dann
Julian Zimmert
445
51
0
31 Dec 2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
702
19
0
07 Nov 2024
Distributionally Robust Reinforcement Learning with Interactive Data
  Collection: Fundamental Hardness and Near-Optimal Algorithm
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal AlgorithmNeural Information Processing Systems (NeurIPS), 2024
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRLOOD
295
23
0
04 Apr 2024
1
Page 1 of 1