ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.19278
  4. Cited By
Applying sparse autoencoders to unlearn knowledge in language models

Applying sparse autoencoders to unlearn knowledge in language models

25 October 2024
Eoin Farrell
Yeu-Tong Lau
Arthur Conmy
    MU
ArXivPDFHTML

Papers citing "Applying sparse autoencoders to unlearn knowledge in language models"

2 / 2 papers shown
Title
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders
Bartosz Cywiñski
Kamil Deja
DiffM
61
6
0
29 Jan 2025
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study
Yang Xu
Y. Wang
Hao Wang
95
1
0
23 Dec 2024
1