ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.13341
  4. Cited By
On Value Functions and the Agent-Environment Boundary
v1v2v3 (latest)

On Value Functions and the Agent-Environment Boundary

30 May 2019
Nan Jiang
    OffRL
ArXiv (abs)PDFHTML

Papers citing "On Value Functions and the Agent-Environment Boundary"

17 / 17 papers shown
Real-World Reinforcement Learning of Active Perception Behaviors
E. Hu
Jie Wang
Xingfang Yuan
Fiona Luo
Muyao Li
Gaspard Lambrechts
Oleh Rybkin
Dinesh Jayaraman
OffRL
225
0
0
01 Dec 2025
Selecting Belief-State Approximations in Simulators with Latent States
Selecting Belief-State Approximations in Simulators with Latent States
Nan Jiang
106
0
0
25 Nov 2025
Agency Is Frame-Dependent
Agency Is Frame-Dependent
David Abel
André Barreto
Michael Bowling
Will Dabney
Shi Dong
...
Doina Precup
Jonathan Richens
Mark Rowland
Tom Schaul
Satinder Singh
398
3
0
06 Feb 2025
Three Dogmas of Reinforcement Learning
Three Dogmas of Reinforcement Learning
David Abel
Mark K. Ho
Anna Harutyunyan
357
11
0
15 Jul 2024
Neural Network Approximation for Pessimistic Offline Reinforcement
  Learning
Neural Network Approximation for Pessimistic Offline Reinforcement Learning
Di Wu
Yuling Jiao
Li Shen
Haizhao Yang
Xiliang Lu
OffRL
275
1
0
19 Dec 2023
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with
  General Function Approximation and Single-Policy Concentrability
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy ConcentrabilityNeural Information Processing Systems (NeurIPS), 2023
Hanlin Zhu
Amy Zhang
OffRL
295
5
0
07 Feb 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline
  Reinforcement Learning
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
405
20
0
30 Jan 2023
Build generally reusable agent-environment interaction models
Build generally reusable agent-environment interaction models
Jun Jin
Hongming Zhang
Jun Luo
136
0
0
13 Nov 2022
Optimal Conservative Offline RL with General Function Approximation via
  Augmented Lagrangian
Optimal Conservative Offline RL with General Function Approximation via Augmented LagrangianInternational Conference on Learning Representations (ICLR), 2022
Paria Rashidinejad
Hanlin Zhu
Kunhe Yang
Stuart J. Russell
Jiantao Jiao
OffRL
372
33
0
01 Nov 2022
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise
  Reward
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise RewardIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022
Tengyu Xu
Yue Wang
Shaofeng Zou
Yingbin Liang
OffRL
243
15
0
13 Jun 2022
Jump-Start Reinforcement Learning
Jump-Start Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRLOnRL
317
145
0
05 Apr 2022
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Risk Bounds and Rademacher Complexity in Batch Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Yaqi Duan
Chi Jin
Zhiyuan Li
OffRL
184
52
0
25 Mar 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale
  of Pessimism
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of PessimismIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2021
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
753
314
0
22 Mar 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and PerspectivesJournal of Artificial Intelligence Research (JAIR), 2020
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLLOffRL
559
378
0
25 Dec 2020
Batch Value-function Approximation with Only Realizability
Batch Value-function Approximation with Only RealizabilityInternational Conference on Machine Learning (ICML), 2020
Tengyang Xie
Nan Jiang
OffRL
643
128
0
11 Aug 2020
Bridging the Imitation Gap by Adaptive Insubordination
Bridging the Imitation Gap by Adaptive InsubordinationNeural Information Processing Systems (NeurIPS), 2020
Luca Weihs
Unnat Jain
Iou-Jen Liu
Jordi Salvador
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
338
41
0
23 Jul 2020
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Minimax Weight and Q-Function Learning for Off-Policy EvaluationInternational Conference on Machine Learning (ICML), 2019
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
428
195
0
28 Oct 2019
1
Page 1 of 1