Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.08545
Cited By
Efficient Continual Pre-training for Building Domain Specific Large Language Models
14 November 2023
Yong Xie
Karan Aggarwal
Aitzaz Ahmad
CLL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient Continual Pre-training for Building Domain Specific Large Language Models"
7 / 7 papers shown
Title
ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment
Elyas Obbad
Iddah Mlauzi
Brando Miranda
Rylan Schaeffer
Kamal Obbad
Suhana Bedi
Sanmi Koyejo
CVBM
37
0
0
23 Oct 2024
Towards LifeSpan Cognitive Systems
Yu Wang
Chi Han
Tongtong Wu
Xiaoxin He
Wangchunshu Zhou
...
Zexue He
Wei Wang
Gholamreza Haffari
Heng Ji
Julian McAuley
KELM
CLL
69
1
0
20 Sep 2024
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
49
36
0
23 Apr 2024
Towards Continual Knowledge Learning of Language Models
Joel Jang
Seonghyeon Ye
Sohee Yang
Joongbo Shin
Janghoon Han
Gyeonghun Kim
Stanley Jungkyu Choi
Minjoon Seo
CLL
KELM
216
122
0
07 Oct 2021
EDGAR-CORPUS: Billions of Tokens Make The World Go Round
Lefteris Loukas
Manos Fergadiotis
Ion Androutsopoulos
Prodromos Malakasiotis
AIFin
71
24
0
29 Sep 2021
Deduplicating Training Data Makes Language Models Better
Katherine Lee
Daphne Ippolito
A. Nystrom
Chiyuan Zhang
Douglas Eck
Chris Callison-Burch
Nicholas Carlini
SyDa
234
447
0
14 Jul 2021
Domain-Adversarial Training of Neural Networks
Yaroslav Ganin
E. Ustinova
Hana Ajakan
Pascal Germain
Hugo Larochelle
François Laviolette
M. Marchand
Victor Lempitsky
GAN
OOD
149
8,353
0
28 May 2015
1