v1v2v3 (latest)

Average-Reward Off-Policy Policy Evaluation with Function Approximation

International Conference on Machine Learning (ICML), 2021

8 January 2021

Yi Wan

Papers citing "Average-Reward Off-Policy Policy Evaluation with Function Approximation"

23 / 23 papers shown

Hardware-Software Collaborative Computing of Photonic Spiking Reinforcement Learning for Robotic Continuous Control

29 Nov 2025

Towards Formalizing Reinforcement Learning Theory

Shangtong Zhang

120

05 Nov 2025

Non-iid hypothesis testing: from classical to quantum

07 Oct 2025

Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features

373

27 May 2025

Towards Optimal Offline Reinforcement Learning

319

15 Mar 2025

Average Reward Reinforcement Learning for Wireless Radio Resource ManagementAsilomar Conference on Signals, Systems and Computers (ACSSC), 2024

Kun Yang

Jing Yang

Cong Shen

219

12 Jan 2025

Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes

Juan Sebastian Rojas

Chi-Guhn Lee

279

14 Oct 2024

RVI-SAC: Average Reward Off-Policy Deep Reinforcement LearningInternational Conference on Machine Learning (ICML), 2024

Yukinari Hisaki

Isao Ono

166

04 Aug 2024

e-COP : Episodic Constrained Optimization of Policies

Rahul Jain

232

13 Jun 2024

Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning

803

22 Feb 2024

Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function ApproximationNeural Information Processing Systems (NeurIPS), 2023

Efstathia Soufleri

Jian Li

237

03 Oct 2023

Infer and Adapt: Bipedal Locomotion Reward Learning from Demonstrations via Inverse Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2023

Zhaoyuan Gu

230

28 Sep 2023

A new Gradient TD Algorithm with only One Step-size: Convergence Rate Analysis using

L

λ

Smoothness

Hengshuai Yao

285

29 Jul 2023

Learning to Stabilize Online Reinforcement Learning in Unbounded State SpacesInternational Conference on Machine Learning (ICML), 2023

359

02 Jun 2023

Model-Free Robust Average-Reward Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023

Yue Wang

Alvaro Velasquez

George Atia

Ashley Prater-Bennette

Shaofeng Zou

210

17 May 2023

Performance Bounds for Policy-Based Average Reward Reinforcement Learning AlgorithmsNeural Information Processing Systems (NeurIPS), 2023

241

02 Feb 2023

ACPO: A Policy Optimization Algorithm for Average MDPs with ConstraintsInternational Conference on Machine Learning (ICML), 2023

Akhil Agnihotri

R. Jain

Haipeng Luo

560

02 Feb 2023

Markovian Interference in ExperimentsNeural Information Processing Systems (NeurIPS), 2022

Vivek F. Farias

163

06 Jun 2022

Stochastic first-order methods for average-reward Markov decision processesMathematics of Operations Research (MOR), 2022

Tianjiao Li

Feiyang Wu

Guanghui Lan

493

11 May 2022

Average-Reward Learning and Planning with Options

Yi Wan

A. Naik

R. Sutton

26 Oct 2021

Average-Reward Reinforcement Learning with Trust Region MethodsInternational Joint Conference on Artificial Intelligence (IJCAI), 2021

207

07 Jun 2021

Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic SettingsNeural Information Processing Systems (NeurIPS), 2021

Ming Yin

Yu Wang

OffRL

277

13 May 2021

Breaking the Deadly Triad with a Target NetworkInternational Conference on Machine Learning (ICML), 2021

734

21 Jan 2021