Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2509.25760
Cited By
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
30 September 2025
Zhepei Wei
X. J. Yang
Kai Sun
Jiaqi Wang
Rulin Shao
Sean Chen
Mohammad Kachuee
Teja Gollapudi
Tony Liao
Nicolas Scheffer
Rakesh Wanga
Anuj Kumar
Yu Meng
Wen-tau Yih
Xin Luna Dong
HILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (47 upvotes)
Papers citing
"TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning"
1 / 1 papers shown
Title
Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents
Yiding Wang
Zhepei Wei
Xinyu Zhu
Yu Meng
12
1
0
06 Oct 2025
1