ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.19697
  4. Cited By
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity

Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity

30 May 2024
Yan Yang
Bin Gao
Ya-xiang Yuan
ArXivPDFHTML

Papers citing "Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity"

7 / 7 papers shown
Title
LancBiO: dynamic Lanczos-aided bilevel optimization via Krylov subspace
LancBiO: dynamic Lanczos-aided bilevel optimization via Krylov subspace
Bin Gao
Yan Yang
Ya-xiang Yuan
34
2
0
04 Apr 2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and
  RLHF
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen
Zhuoran Yang
Tianyi Chen
OffRL
32
14
0
10 Feb 2024
A framework for bilevel optimization that enables stochastic and global
  variance reduction algorithms
A framework for bilevel optimization that enables stochastic and global variance reduction algorithms
Mathieu Dagréou
Pierre Ablin
Samuel Vaiter
Thomas Moreau
129
95
0
31 Jan 2022
Amortized Implicit Differentiation for Stochastic Bilevel Optimization
Amortized Implicit Differentiation for Stochastic Bilevel Optimization
Michael Arbel
Julien Mairal
92
58
0
29 Nov 2021
Policy Mirror Descent for Reinforcement Learning: Linear Convergence,
  New Sampling Complexity, and Generalized Problem Classes
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
84
135
0
30 Jan 2021
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Luca Franceschi
P. Frasconi
Saverio Salzo
Riccardo Grazzi
Massimiliano Pontil
96
714
0
13 Jun 2018
Forward and Reverse Gradient-Based Hyperparameter Optimization
Forward and Reverse Gradient-Based Hyperparameter Optimization
Luca Franceschi
Michele Donini
P. Frasconi
Massimiliano Pontil
112
370
0
06 Mar 2017
1