ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.00810
  4. Cited By
Offline Reinforcement Learning with Differential Privacy

Offline Reinforcement Learning with Differential Privacy

2 June 2022
Dan Qiao
Yu-Xiang Wang
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning with Differential Privacy"

15 / 15 papers shown
Title
Towards Optimal Differentially Private Regret Bounds in Linear MDPs
Towards Optimal Differentially Private Regret Bounds in Linear MDPs
Sharan Sahu
50
0
0
12 Apr 2025
Preserving Expert-Level Privacy in Offline Reinforcement Learning
Preserving Expert-Level Privacy in Offline Reinforcement Learning
Navodita Sharma
Vishnu Vinod
Abhradeep Thakurta
Alekh Agarwal
Borja Balle
Christoph Dann
A. Raghuveer
OffRL
72
0
0
18 Nov 2024
Offline Behavior Distillation
Offline Behavior Distillation
Shiye Lei
Sen Zhang
Dacheng Tao
OffRL
31
0
0
30 Oct 2024
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization
  under Preference Drift
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Seongho Son
William Bankes
Sayak Ray Chowdhury
Brooks Paige
Ilija Bogunovic
32
4
0
26 Jul 2024
Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization
  by Large Step Sizes
Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes
Dan Qiao
Kaiqi Zhang
Esha Singh
Daniel Soudry
Yu-Xiang Wang
NoLa
31
3
0
10 Jun 2024
Provably Robust DPO: Aligning Language Models with Noisy Feedback
Provably Robust DPO: Aligning Language Models with Noisy Feedback
Sayak Ray Chowdhury
Anush Kini
Nagarajan Natarajan
22
54
0
01 Mar 2024
Privately Aligning Language Models with Reinforcement Learning
Privately Aligning Language Models with Reinforcement Learning
Fan Wu
Huseyin A. Inan
A. Backurs
Varun Chandrasekaran
Janardhan Kulkarni
Robert Sim
17
6
0
25 Oct 2023
Offline Policy Evaluation for Reinforcement Learning with Adaptively
  Collected Data
Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data
Sunil Madhow
Dan Xiao
Ming Yin
Yu-Xiang Wang
OffRL
15
0
0
24 Jun 2023
Differential Privacy in Cooperative Multiagent Planning
Differential Privacy in Cooperative Multiagent Planning
Bo Chen
C. Hawkins
Mustafa O. Karabag
Cyrus Neary
Matthew T. Hale
Ufuk Topcu
13
8
0
20 Jan 2023
Near-Optimal Differentially Private Reinforcement Learning
Near-Optimal Differentially Private Reinforcement Learning
Dan Qiao
Yu-Xiang Wang
17
13
0
09 Dec 2022
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning
  with Linear Function Approximation
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation
Dan Qiao
Yu-Xiang Wang
OffRL
61
13
0
03 Oct 2022
Doubly Fair Dynamic Pricing
Doubly Fair Dynamic Pricing
Jianyu Xu
Dan Qiao
Yu-Xiang Wang
11
8
0
23 Sep 2022
Privacy-Constrained Policies via Mutual Information Regularized Policy
  Gradients
Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
Chris Cundy
Rishi Desai
Stefano Ermon
OffRL
20
4
0
30 Dec 2020
When is Memorization of Irrelevant Training Data Necessary for
  High-Accuracy Learning?
When is Memorization of Irrelevant Training Data Necessary for High-Accuracy Learning?
Gavin Brown
Mark Bun
Vitaly Feldman
Adam D. Smith
Kunal Talwar
245
80
0
11 Dec 2020
Reward-Free Exploration for Reinforcement Learning
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
104
194
0
07 Feb 2020
1