Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.10701
Cited By
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
18 June 2020
Annie Xie
James Harrison
Chelsea Finn
CLL
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning amidst Lifelong Non-Stationarity"
18 / 18 papers shown
Title
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
34
47
0
06 Oct 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
48
0
0
04 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
42
6
0
24 Jan 2023
Building a Subspace of Policies for Scalable Continual Learning
Jean-Baptiste Gaya
T. Doan
Lucas Page-Caccia
Laure Soulier
Ludovic Denoyer
Roberta Raileanu
CLL
29
29
0
18 Nov 2022
Time-Varying Propensity Score to Bridge the Gap between the Past and Present
Rasool Fakoor
Jonas W. Mueller
Zachary Chase Lipton
Pratik Chaudhari
Alexander J. Smola
OOD
AI4TS
32
3
0
04 Oct 2022
Dialogue Evaluation with Offline Reinforcement Learning
Nurul Lubis
Christian Geishauser
Hsien-Chin Lin
Carel van Niekerk
Michael Heck
Shutong Feng
Milica Gavsić
OffRL
19
4
0
02 Sep 2022
Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng
Biwei Huang
Kun Zhang
Sara Magliacane
CML
OffRL
42
32
0
30 Mar 2022
Autonomous Reinforcement Learning: Formalism and Benchmarking
Archit Sharma
Kelvin Xu
Nikhil Sardana
Abhishek Gupta
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
38
26
0
17 Dec 2021
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
35
8
0
13 Dec 2021
Interactive Medical Image Segmentation with Self-Adaptive Confidence Calibration
Wenhao Li
Qisen Xu
Chuyun Shen
Bin Hu
Fengping Zhu
Yuxin Li
Bo Jin
Xiangfeng Wang
30
5
0
15 Nov 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
32
52
0
26 Apr 2021
Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms
Ali Ghadirzadeh
Xi Chen
Petra Poklukar
Chelsea Finn
Mårten Björkman
Danica Kragic
BDL
31
41
0
05 Mar 2021
Learning Latent Representations to Influence Multi-Agent Interaction
Annie Xie
Dylan P. Losey
R. Tolsma
Chelsea Finn
Dorsa Sadigh
DRL
15
132
0
12 Nov 2020
Towards Safe Policy Improvement for Non-Stationary MDPs
Yash Chandak
Scott M. Jordan
Georgios Theocharous
Martha White
Philip S. Thomas
OffRL
71
33
0
23 Oct 2020
Optimizing for the Future in Non-Stationary MDPs
Yash Chandak
Georgios Theocharous
Shiv Shankar
Martha White
Sridhar Mahadevan
Philip S. Thomas
OffRL
11
65
0
17 May 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
329
11,681
0
09 Mar 2017
Quickest Change Detection Approach to Optimal Control in Markov Decision Processes with Model Changes
T. Banerjee
Miao Liu
Jonathan P. How
19
26
0
21 Sep 2016
1