Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2410.17413
Cited By
Scalable Influence and Fact Tracing for Large Language Model Pretraining
International Conference on Learning Representations (ICLR), 2024
22 October 2024
Tyler A. Chang
Dheeraj Rajagopal
Tolga Bolukbasi
Lucas Dixon
Ian Tenney
TDI
Re-assign community
ArXiv (abs)
PDF
HTML
Github (33★)
Papers citing
"Scalable Influence and Fact Tracing for Large Language Model Pretraining"
10 / 10 papers shown
RAG System for Supporting Japanese Litigation Procedures: Faithful Response Generation Complying with Legal Norms
Yuya Ishihara
Atsushi Keyaki
Hiroaki Yamada
Ryutaro Ohara
Mihoko Sumida
AILaw
171
0
0
28 Nov 2025
LLM generation novelty through the lens of semantic similarity
Philipp Davydov
Ameya Prabhu
Matthias Bethge
Elisa Nguyen
Seong Joon Oh
TDI
503
0
1
31 Oct 2025
Exploring Training Data Attribution under Limited Access Constraints
Shiyuan Zhang
Junwei Deng
Juhan Bae
Jiaqi W. Ma
TDI
317
0
0
16 Sep 2025
Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics
Carter Blum
Katja Filipova
Ann Yuan
Asma Ghandeharioun
Julian Zimmert
...
Jessica Hoffmann
Tal Linzen
Martin Wattenberg
Lucas Dixon
Mor Geva
273
2
0
14 Aug 2025
DATE-LM: Benchmarking Data Attribution Evaluation for Large Language Models
Cathy Jiao
Yijun Pan
Emily Xiao
Daisy Sheng
Niket Jain
H. C. Zhao
Ishita Dasgupta
Jiaqi W. Ma
Chenyan Xiong
296
1
0
12 Jul 2025
Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models
Yukun Huang
Sanxing Chen
Jian Pei
Manzil Zaheer
Bhuwan Dhingra
KELM
RALM
HILM
LRM
458
0
0
21 Jun 2025
Daunce: Data Attribution through Uncertainty Estimation
Xingyuan Pan
Chenlu Ye
Joseph Melkonian
Jiaqi W. Ma
Tong Zhang
TDI
UQCV
219
2
0
29 May 2025
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
Yuzheng Hu
Fan Wu
Haotian Ye
David A. Forsyth
James Y. Zou
Nan Jiang
Jiaqi W. Ma
Han Zhao
OffRL
398
6
0
25 May 2025
Enhancing Training Data Attribution with Representational Optimization
W. Sun
Haokun Liu
Nikhil Kandpal
Colin Raffel
Yiming Yang
TDI
553
3
0
24 May 2025
SAFE: Improving LLM Systems using Sentence-Level In-generation Attribution
João Eduardo Batista
Emil Vatai
Mohamed Wahib
477
0
0
19 May 2025
1
Page 1 of 1