v1v2 (latest)

Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning

AAAI Conference on Artificial Intelligence (AAAI), 2023

29 December 2023

Xingwu Sun

Papers citing "Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning"

27 / 27 papers shown

TraceDet: Hallucination Detection from the Decoding Trace of Diffusion Large Language Models

156

30 Sep 2025

Steering When Necessary: Flexible Steering Large Language Models with Backtracking

187

25 Aug 2025

Expanding before Inferring: Enhancing Factuality in Large Language Models through Premature Layers Interpolation

314

03 Jun 2025

ExpertSteer: Intervening in LLMs through Expert Knowledge

494

18 May 2025

The Illusionist's Prompt: Exposing the Factual Vulnerabilities of Large Language Models with Linguistic Nuances

240

01 Apr 2025

Personalized Text Generation with Contrastive Activation SteeringAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

267

07 Mar 2025

DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models

301

05 Mar 2025

SAFE: A Sparse Autoencoder-Based Framework for Robust Query Enrichment and Hallucination Mitigation in LLMs

264

04 Mar 2025

Steer LLM Latents for Hallucination Detection

317

01 Mar 2025

Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful ComparatorsAAAI Conference on Artificial Intelligence (AAAI), 2024

515

28 Jan 2025

Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large Vision-Language Model via Causality AnalysisIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

301

04 Dec 2024

Distinguishing Ignorance from Error in LLM Hallucinations

216

29 Oct 2024

Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering VectorsInternational Conference on Learning Representations (ICLR), 2024

327

16 Oct 2024

NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

304

11 Oct 2024

Lower Layers Matter: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused

246

16 Aug 2024

Internal Consistency and Self-Feedback in Large Language Models: A Survey

...

497

19 Jul 2024

Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps

247

09 Jul 2024

Mitigating Large Language Model Hallucination with Faithful Finetuning

Yufei Wang

Irwin King

285

17 Jun 2024

PrivacyRestore: Privacy-Preserving Inference in Large Language Models via Privacy Removal and Restoration

661

03 Jun 2024

Spectral Editing of Activations for Large Language Model AlignmentNeural Information Processing Systems (NeurIPS), 2024

400

15 May 2024

Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression

260

01 May 2024

Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs

287

15 Apr 2024

Non-Linear Inference Time Intervention: Improving LLM Truthfulness

167

27 Mar 2024

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild

281

07 Mar 2024

TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space

Shaolei Zhang

Tian Yu

Yang Feng

HILM KELM

313

27 Feb 2024

GRATH: Gradual Self-Truthifying for Large Language ModelsInternational Conference on Machine Learning (ICML), 2024

126

22 Jan 2024

Zero-Resource Hallucination Prevention for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

489

06 Sep 2023