Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.14971
Cited By
Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation
21 June 2024
Shamane Siriwardhana
Mark McQuade
Thomas Gauthier
Lucas Atkins
Fernando Fernandes Neto
Luke Meyers
Anneketh Vij
Tyler Odenthal
Charles Goddard
Mary MacCarthy
Jacob Solawetz
CLL
MoMe
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation"
4 / 4 papers shown
Title
Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Zhipeng Chen
Liang Song
K. Zhou
Wayne Xin Zhao
B. Wang
Weipeng Chen
Ji-Rong Wen
60
0
0
10 Oct 2024
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
Zorik Gekhman
G. Yona
Roee Aharoni
Matan Eyal
Amir Feder
Roi Reichart
Jonathan Herzig
48
98
0
09 May 2024
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Charles Goddard
Shamane Siriwardhana
Malikeh Ehghaghi
Luke Meyers
Vladimir Karpukhin
Brian Benedict
Mark McQuade
Jacob Solawetz
MoMe
KELM
72
75
0
20 Mar 2024
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,791
0
17 Sep 2019
1