ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.05294
  4. Cited By
Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations

Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations

7 April 2025
Pedro Ferreira
Wilker Aziz
Ivan Titov
    LRM
ArXivPDFHTML

Papers citing "Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations"

Title
No papers