ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.09315
  4. Cited By
Learning to Constrain Policy Optimization with Virtual Trust Region

Learning to Constrain Policy Optimization with Virtual Trust Region

20 April 2022
Hung Le
Thommen Karimpanal George
Majid Abdolshah
D. Nguyen
Kien Do
Sunil R. Gupta
Svetha Venkatesh
ArXivPDFHTML

Papers citing "Learning to Constrain Policy Optimization with Virtual Trust Region"

3 / 3 papers shown
Title
Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for
  Reinforcement Learning
Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
H. Le
Kien Do
D. Nguyen
Sunil Gupta
Svetha Venkatesh
30
0
0
14 Oct 2024
Multi-Reference Preference Optimization for Large Language Models
Multi-Reference Preference Optimization for Large Language Models
Hung Le
Quan Tran
D. Nguyen
Kien Do
Saloni Mittal
Kelechi Ogueji
Svetha Venkatesh
55
0
0
26 May 2024
Beyond Surprise: Improving Exploration Through Surprise Novelty
Beyond Surprise: Improving Exploration Through Surprise Novelty
Hung Le
Kien Do
D. Nguyen
Svetha Venkatesh
14
2
0
09 Aug 2023
1