Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.16298
Cited By
Influence Scores at Scale for Efficient Language Data Sampling
27 November 2023
Nikhil Anand
Joshua Tan
Maria Minakova
TDI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Influence Scores at Scale for Efficient Language Data Sampling"
7 / 7 papers shown
Title
TarDiff: Target-Oriented Diffusion Guidance for Synthetic Electronic Health Record Time Series Generation
Bowen Deng
Chang Xu
H. Li
Yuhao Huang
Min Hou
Jiang Bian
MedIm
38
0
0
24 Apr 2025
Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration
Kangxi Wu
Liang Pang
Huawei Shen
Xueqi Cheng
TDI
23
0
0
02 Oct 2024
Sexism Detection on a Data Diet
Rabiraj Bandyopadhyay
Dennis Assenmacher
J. Alonso-Moral
Claudia Wagner
31
0
0
07 Jun 2024
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Shoaib Ahmed Siddiqui
Nitarshan Rajkumar
Tegan Maharaj
David M. Krueger
Sara Hooker
30
27
0
20 Sep 2022
Understanding Dataset Difficulty with
V
\mathcal{V}
V
-Usable Information
Kawin Ethayarajh
Yejin Choi
Swabha Swayamdipta
159
157
0
16 Oct 2021
Estimating Example Difficulty Using Variance of Gradients
Chirag Agarwal
Daniel D'souza
Sara Hooker
190
105
0
26 Aug 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
226
4,424
0
23 Jan 2020
1