Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.08354
Cited By
Task-adaptive Pre-training of Language Models with Word Embedding Regularization
17 September 2021
Kosuke Nishida
Kyosuke Nishida
Sen Yoshida
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Task-adaptive Pre-training of Language Models with Word Embedding Regularization"
3 / 3 papers shown
Title
G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks
Zhongwei Wan
Yichun Yin
Wei Zhang
Jiaxin Shi
Lifeng Shang
Guangyong Chen
Xin Jiang
Qun Liu
VLM
CLL
21
16
0
07 Dec 2022
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,791
0
17 Sep 2019
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,724
0
26 Sep 2016
1