Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.11859
Cited By
Not all parameters are born equal: Attention is mostly what you need
22 October 2020
Nikolay Bogoychev
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Not all parameters are born equal: Attention is mostly what you need"
3 / 3 papers shown
Title
A Fast Transformer-based General-Purpose Lossless Compressor
Yushun Mao
Yufei Cui
Tei-Wei Kuo
C. Xue
ViT
AI4CE
18
28
0
30 Mar 2022
Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation
Mozhdeh Gheini
Xiang Ren
Jonathan May
LRM
20
105
0
18 Apr 2021
The Loss Surfaces of Multilayer Networks
A. Choromańska
Mikael Henaff
Michaël Mathieu
Gerard Ben Arous
Yann LeCun
ODL
175
1,184
0
30 Nov 2014
1