Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.17799
Cited By
Exploring Activation Patterns of Parameters in Language Models
28 May 2024
Yudong Wang
Damai Dai
Zhifang Sui
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring Activation Patterns of Parameters in Language Models"
2 / 2 papers shown
Title
The Internal State of an LLM Knows When It's Lying
A. Azaria
Tom Michael Mitchell
HILM
216
297
0
26 Apr 2023
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
Atticus Geiger
Zhengxuan Wu
Christopher Potts
Thomas F. Icard
Noah D. Goodman
CML
73
98
0
05 Mar 2023
1