Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.04784
Cited By
v1
v2 (latest)
Post-training Large Language Models for Diverse High-Quality Responses
5 September 2025
Yilei Chen
Souradip Chakraborty
Lorenz Wolf
I. Paschalidis
Aldo Pacchiano
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Post-training Large Language Models for Diverse High-Quality Responses"
2 / 2 papers shown
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Jens Tuyls
Dylan J. Foster
A. Krishnamurthy
Jordan T. Ash
140
1
0
13 Oct 2025
Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards
Haoran He
Yuxiao Ye
Qingpeng Cai
Chen-Hao Hu
Binxing Jiao
Daxin Jiang
Ling Pan
OffRL
LRM
115
1
0
29 Sep 2025
1