Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.07527
Cited By
Train Faster, Perform Better: Modular Adaptive Training in Over-Parameterized Models
13 May 2024
Yubin Shi
Yixuan Chen
Mingzhi Dong
Xiaochen Yang
Dongsheng Li
Yujiang Wang
Robert P. Dick
Qin Lv
Yingying Zhao
Fan Yang
Tun Lu
Ning Gu
L. Shang
MoMe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Train Faster, Perform Better: Modular Adaptive Training in Over-Parameterized Models"
1 / 1 papers shown
Title
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
251
2,012
0
28 Jul 2020
1