Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2505.18407
Cited By

KL-regularization Itself is Differentially Private in Bandits and RLHF

v1v2 (latest)

KL-regularization Itself is Differentially Private in Bandits and RLHF

23 May 2025

Kishan Panaganti

ArXiv (abs)PDF HTML

Papers citing "KL-regularization Itself is Differentially Private in Bandits and RLHF"

4 / 4 papers shown

Offline and Online KL-Regularized RLHF under Differential Privacy

Offline and Online KL-Regularized RLHF under Differential Privacy

Praneeth Vepakomma

Francesco Orabona

117

0

0

15 Oct 2025

Towards User-level Private Reinforcement Learning with Human Feedback

Towards User-level Private Reinforcement Learning with Human Feedback

258

6

0

22 Feb 2025

Differentially Private Policy Gradient

Differentially Private Policy Gradient

275

2

0

31 Jan 2025

Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

556

13

0

07 Nov 2024

Page 1 of 1