Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.03735
Cited By
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
30 September 2024
David Grangier
Simin Fan
Skyler Seto
Pierre Ablin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling"
3 / 3 papers shown
Title
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
Yu Yang
Y. Fu
Xin Dong
Dan Su
...
Hongxu Yin
M. Patwary
Yingyan
Jan Kautz
Pavlo Molchanov
28
0
0
17 Apr 2025
Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging
Pierre Ablin
Angelos Katharopoulos
Skyler Seto
David Grangier
MoMe
40
0
0
03 Feb 2025
Training Bilingual LMs with Data Constraints in the Targeted Language
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
69
0
0
20 Nov 2024
1