Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2507.23221
Cited By
A Single Direction of Truth: An Observer Model's Linear Residual Probe Exposes and Steers Contextual Hallucinations
31 July 2025
Charles OÑeill
Slava Chalnev
Chi Chi Zhao
Max Kirkby
Mudith Jayasekara
HILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Single Direction of Truth: An Observer Model's Linear Residual Probe Exposes and Steers Contextual Hallucinations"
4 / 4 papers shown
When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs
Shaowen Wang
Yiqi Dong
Ruinian Chang
Tansheng Zhu
Yuebo Sun
Kaifeng Lyu
Jian Li
HILM
323
0
0
10 Nov 2025
Weak Form Learning for Mean-Field Partial Differential Equations: an Application to Insect Movement
Seth Minor
Bret D. Elderd
Benjamin Van Allen
David M. Bortz
Vanja M. Dukic
120
0
0
09 Oct 2025
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
Sasha Cui
Zhongren Chen
LLMSV
238
1
0
25 Sep 2025
Beyond Transcription: Mechanistic Interpretability in ASR
Neta Glazer
Yael Segal-Feldman
Hilit Segev
Aviv Shamsian
Asaf Buchnick
Gill Hetz
Ethan Fetaya
Joseph Keshet
Aviv Navon
96
0
0
21 Aug 2025
1