Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.03296
Cited By
Studying Large Language Model Generalization with Influence Functions
7 August 2023
Roger C. Grosse
Juhan Bae
Cem Anil
Nelson Elhage
Alex Tamkin
Amirhossein Tajdini
Benoit Steiner
Dustin Li
Esin Durmus
Ethan Perez
Evan Hubinger
Kamil.e Lukovsiut.e
Karina Nguyen
Nicholas Joseph
Sam McCandlish
Jared Kaplan
Sam Bowman
TDI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Studying Large Language Model Generalization with Influence Functions"
17 / 17 papers shown
Title
Origin Tracer: A Method for Detecting LoRA Fine-Tuning Origins in LLMs
Hongyu Liang
Yuting Zheng
Yihan Li
Yiran Zhang
Shiyu Liang
18
0
0
26 May 2025
GraSS: Scalable Influence Function with Sparse Gradient Compression
Pingbang Hu
Joseph Melkonian
Weijing Tang
Han Zhao
Jiaqi W. Ma
TDI
133
0
0
25 May 2025
TRACE for Tracking the Emergence of Semantic Representations in Transformers
Nura Aljaafari
Danilo S. Carvalho
André Freitas
49
0
0
23 May 2025
Diagnosing our datasets: How does my language model learn clinical information?
Furong Jia
David Sontag
Monica Agrawal
LM&MA
98
0
0
21 May 2025
Beyond Public Access in LLM Pre-Training Data
Sruly Rosenblat
Tim O'Reilly
Ilan Strauss
MLAU
108
0
0
24 Apr 2025
High-Dimensional Interlingual Representations of Large Language Models
Bryan Wilie
Samuel Cahyawijaya
Junxian He
Pascale Fung
76
0
0
14 Mar 2025
Dataset Featurization: Uncovering Natural Language Features through Unsupervised Data Reconstruction
Michal Bravansky
Vaclav Kubon
Suhas Hariharan
Robert Kirk
81
1
0
24 Feb 2025
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images
Sheng-Yu Wang
Aaron Hertzmann
Alexei A. Efros
Jun-Yan Zhu
Richard Zhang
TDI
149
2
0
21 Feb 2025
Most Influential Subset Selection: Challenges, Promises, and Beyond
Yuzheng Hu
Pingbang Hu
Han Zhao
Jiaqi W. Ma
TDI
157
4
0
10 Jan 2025
Theoretical characterisation of the Gauss-Newton conditioning in Neural Networks
Jim Zhao
Sidak Pal Singh
Aurelien Lucchi
AI4CE
76
0
0
04 Nov 2024
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models
Jinghan Jia
Jiancheng Liu
Yihua Zhang
Parikshit Ram
Nathalie Baracaldo
Sijia Liu
MU
88
2
0
23 Oct 2024
Influence Functions for Scalable Data Attribution in Diffusion Models
Bruno Mlodozeniec
Runa Eschenhagen
Juhan Bae
Alexander Immer
David Krueger
Richard E. Turner
DiffM
TDI
86
4
0
17 Oct 2024
Data Selection via Optimal Control for Language Models
Yuxian Gu
Li Dong
Hongning Wang
Y. Hao
Qingxiu Dong
Furu Wei
Minlie Huang
AI4CE
82
6
0
09 Oct 2024
How Much Can We Forget about Data Contamination?
Sebastian Bordt
Suraj Srinivas
Valentyn Boreiko
U. V. Luxburg
83
2
0
04 Oct 2024
Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons
Yongqi Leng
Deyi Xiong
56
7
0
09 Jul 2024
Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models
Lynn Chua
Badih Ghazi
Yangsibo Huang
Pritish Kamath
Ravi Kumar
Pasin Manurangsi
Amer Sinha
Chulin Xie
Chiyuan Zhang
83
2
0
23 Jun 2024
ALMANACS: A Simulatability Benchmark for Language Model Explainability
Edmund Mills
Shiye Su
Stuart J. Russell
Scott Emmons
84
7
0
20 Dec 2023
1