ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.04784
  4. Cited By
Post-training Large Language Models for Diverse High-Quality Responses
v1v2 (latest)

Post-training Large Language Models for Diverse High-Quality Responses

5 September 2025
Yilei Chen
Souradip Chakraborty
Lorenz Wolf
I. Paschalidis
Aldo Pacchiano
ArXiv (abs)PDFHTML

Papers citing "Post-training Large Language Models for Diverse High-Quality Responses"

2 / 2 papers shown
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Jens Tuyls
Dylan J. Foster
A. Krishnamurthy
Jordan T. Ash
140
1
0
13 Oct 2025
Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards
Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards
Haoran He
Yuxiao Ye
Qingpeng Cai
Chen-Hao Hu
Binxing Jiao
Daxin Jiang
Ling Pan
OffRLLRM
115
1
0
29 Sep 2025
1