Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.11944
Cited By
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
16 April 2025
Xuyang Chen
Guojian Wang
Keyu Yan
Lin Zhao
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning"
1 / 1 papers shown
Title
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
Xuyang Chen
Keyu Yan
Lin Zhao
OffRL
47
0
0
08 May 2025
1