Exploring Activation Patterns of Parameters in Language Models

28 May 2024

Zhifang Sui

Papers citing "Exploring Activation Patterns of Parameters in Language Models"

2 / 2 papers shown

Title
The Internal State of an LLM Knows When It's Lying A. Azaria Tom Michael Mitchell HILM 216 297 0 26 Apr 2023
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations Atticus Geiger Zhengxuan Wu Christopher Potts Thomas F. Icard Noah D. Goodman CML 73 98 0 05 Mar 2023