ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.09080
  4. Cited By
Average-Reward Soft Actor-Critic
v1v2 (latest)

Average-Reward Soft Actor-Critic

15 January 2025
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
    OOD
ArXiv (abs)PDFHTMLGithub

Papers citing "Average-Reward Soft Actor-Critic"

1 / 1 papers shown
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Abdullah Vanlioglu
381
12
0
28 Mar 2025
1
Page 1 of 1