ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.25760
  4. Cited By
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

30 September 2025
Zhepei Wei
X. J. Yang
Kai Sun
Jiaqi Wang
Rulin Shao
Sean Chen
Mohammad Kachuee
Teja Gollapudi
Tony Liao
Nicolas Scheffer
Rakesh Wanga
Anuj Kumar
Yu Meng
Wen-tau Yih
Xin Luna Dong
    HILMLRM
ArXiv (abs)PDFHTMLHuggingFace (47 upvotes)

Papers citing "TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning"

1 / 1 papers shown
Title
Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents
Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents
Yiding Wang
Zhepei Wei
Xinyu Zhu
Yu Meng
12
1
0
06 Oct 2025
1