ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.10701
  4. Cited By
Deep Reinforcement Learning amidst Lifelong Non-Stationarity

Deep Reinforcement Learning amidst Lifelong Non-Stationarity

18 June 2020
Annie Xie
James Harrison
Chelsea Finn
    CLL
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning amidst Lifelong Non-Stationarity"

18 / 18 papers shown
Title
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
34
47
0
06 Oct 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
48
0
0
04 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
42
6
0
24 Jan 2023
Building a Subspace of Policies for Scalable Continual Learning
Building a Subspace of Policies for Scalable Continual Learning
Jean-Baptiste Gaya
T. Doan
Lucas Page-Caccia
Laure Soulier
Ludovic Denoyer
Roberta Raileanu
CLL
29
29
0
18 Nov 2022
Time-Varying Propensity Score to Bridge the Gap between the Past and
  Present
Time-Varying Propensity Score to Bridge the Gap between the Past and Present
Rasool Fakoor
Jonas W. Mueller
Zachary Chase Lipton
Pratik Chaudhari
Alexander J. Smola
OOD
AI4TS
32
3
0
04 Oct 2022
Dialogue Evaluation with Offline Reinforcement Learning
Dialogue Evaluation with Offline Reinforcement Learning
Nurul Lubis
Christian Geishauser
Hsien-Chin Lin
Carel van Niekerk
Michael Heck
Shutong Feng
Milica Gavsić
OffRL
19
4
0
02 Sep 2022
Factored Adaptation for Non-Stationary Reinforcement Learning
Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng
Biwei Huang
Kun Zhang
Sara Magliacane
CML
OffRL
42
32
0
30 Mar 2022
Autonomous Reinforcement Learning: Formalism and Benchmarking
Autonomous Reinforcement Learning: Formalism and Benchmarking
Archit Sharma
Kelvin Xu
Nikhil Sardana
Abhishek Gupta
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
38
26
0
17 Dec 2021
Continual Learning In Environments With Polynomial Mixing Times
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
35
8
0
13 Dec 2021
Interactive Medical Image Segmentation with Self-Adaptive Confidence
  Calibration
Interactive Medical Image Segmentation with Self-Adaptive Confidence Calibration
Wenhao Li
Qisen Xu
Chuyun Shen
Bin Hu
Fengping Zhu
Yuxin Li
Bo Jin
Xiangfeng Wang
30
5
0
15 Nov 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
32
52
0
26 Apr 2021
Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic
  Platforms
Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms
Ali Ghadirzadeh
Xi Chen
Petra Poklukar
Chelsea Finn
Mårten Björkman
Danica Kragic
BDL
31
41
0
05 Mar 2021
Learning Latent Representations to Influence Multi-Agent Interaction
Learning Latent Representations to Influence Multi-Agent Interaction
Annie Xie
Dylan P. Losey
R. Tolsma
Chelsea Finn
Dorsa Sadigh
DRL
15
132
0
12 Nov 2020
Towards Safe Policy Improvement for Non-Stationary MDPs
Towards Safe Policy Improvement for Non-Stationary MDPs
Yash Chandak
Scott M. Jordan
Georgios Theocharous
Martha White
Philip S. Thomas
OffRL
71
33
0
23 Oct 2020
Optimizing for the Future in Non-Stationary MDPs
Optimizing for the Future in Non-Stationary MDPs
Yash Chandak
Georgios Theocharous
Shiv Shankar
Martha White
Sridhar Mahadevan
Philip S. Thomas
OffRL
11
65
0
17 May 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
329
11,681
0
09 Mar 2017
Quickest Change Detection Approach to Optimal Control in Markov Decision
  Processes with Model Changes
Quickest Change Detection Approach to Optimal Control in Markov Decision Processes with Model Changes
T. Banerjee
Miao Liu
Jonathan P. How
19
26
0
21 Sep 2016
1