v1v2v3 (latest)

On Value Functions and the Agent-Environment Boundary

30 May 2019

Nan Jiang

OffRL

ArXiv (abs)PDF HTML

Papers citing "On Value Functions and the Agent-Environment Boundary"

17 / 17 papers shown

Real-World Reinforcement Learning of Active Perception Behaviors

225

01 Dec 2025

Selecting Belief-State Approximations in Simulators with Latent States

Nan Jiang

106

25 Nov 2025

Agency Is Frame-Dependent

...

398

06 Feb 2025

Three Dogmas of Reinforcement Learning

David Abel

Mark K. Ho

Anna Harutyunyan

357

15 Jul 2024

Neural Network Approximation for Pessimistic Offline Reinforcement Learning

Yuling Jiao

275

19 Dec 2023

Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy ConcentrabilityNeural Information Processing Systems (NeurIPS), 2023

Hanlin Zhu

Amy Zhang

OffRL

295

07 Feb 2023

Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

405

30 Jan 2023

Build generally reusable agent-environment interaction models

Jun Jin

Hongming Zhang

Jun Luo

136

13 Nov 2022

Optimal Conservative Offline RL with General Function Approximation via Augmented LagrangianInternational Conference on Learning Representations (ICLR), 2022

372

01 Nov 2022

Provably Efficient Offline Reinforcement Learning with Trajectory-Wise RewardIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022

243

13 Jun 2022

Jump-Start Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

...

317

145

05 Apr 2022

Risk Bounds and Rademacher Complexity in Batch Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021

184

25 Mar 2021

Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of PessimismIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2021

753

314

22 Mar 2021

Towards Continual Reinforcement Learning: A Review and PerspectivesJournal of Artificial Intelligence Research (JAIR), 2020

559

378

25 Dec 2020

Batch Value-function Approximation with Only RealizabilityInternational Conference on Machine Learning (ICML), 2020

Tengyang Xie

Nan Jiang

OffRL

643

128

11 Aug 2020

Bridging the Imitation Gap by Adaptive InsubordinationNeural Information Processing Systems (NeurIPS), 2020

338

23 Jul 2020

Minimax Weight and Q-Function Learning for Off-Policy EvaluationInternational Conference on Machine Learning (ICML), 2019

428

195

28 Oct 2019