Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.01702
Cited By
Fluent dreaming for language models
24 January 2024
T. B. Thompson
Zygimantas Straznickas
Michael Sklar
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fluent dreaming for language models"
3 / 3 papers shown
Title
Patterns and Mechanisms of Contrastive Activation Engineering
Yixiong Hao
Ayush Panda
Stepan Shabalin
Sheikh Abdur Raheem Ali
LLMSV
58
0
0
06 May 2025
Finding Neurons in a Haystack: Case Studies with Sparse Probing
Wes Gurnee
Neel Nanda
Matthew Pauly
Katherine Harvey
Dmitrii Troitskii
Dimitris Bertsimas
MILM
153
186
0
02 May 2023
Toy Models of Superposition
Nelson Elhage
Tristan Hume
Catherine Olsson
Nicholas Schiefer
T. Henighan
...
Sam McCandlish
Jared Kaplan
Dario Amodei
Martin Wattenberg
C. Olah
AAML
MILM
120
314
0
21 Sep 2022
1