Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.16802
Cited By
Unsupervised Topic Models are Data Mixers for Pre-training Language Models
24 February 2025
Jiahui Peng
Xinlin Zhuang
Qiu Jiantao
Ren Ma
Jing Yu
Tianyi Bai
Conghui He
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unsupervised Topic Models are Data Mixers for Pre-training Language Models"
Title
No papers