Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.00234
Cited By
Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers
1 January 2021
Machel Reid
Edison Marrese-Taylor
Y. Matsuo
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers"
7 / 7 papers shown
Title
Merging Feed-Forward Sublayers for Compressed Transformers
Neha Verma
Kenton W. Murray
Kevin Duh
AI4CE
50
0
0
10 Jan 2025
QSpec: Speculative Decoding with Complementary Quantization Schemes
Juntao Zhao
Wenhao Lu
Sheng Wang
Lingpeng Kong
Chuan Wu
MQ
62
5
0
15 Oct 2024
DE
3
^3
3
-BERT: Distance-Enhanced Early Exiting for BERT based on Prototypical Networks
Jianing He
Qi Zhang
Weiping Ding
Duoqian Miao
Jun Zhao
Liang Hu
LongBing Cao
34
3
0
03 Feb 2024
Efficient Methods for Natural Language Processing: A Survey
Marcos Vinícius Treviso
Ji-Ung Lee
Tianchu Ji
Betty van Aken
Qingqing Cao
...
Emma Strubell
Niranjan Balasubramanian
Leon Derczynski
Iryna Gurevych
Roy Schwartz
28
109
0
31 Aug 2022
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation
Tao Ge
Si-Qing Chen
Furu Wei
MoE
22
21
0
16 Feb 2022
ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning
J. Tan
Y. Tan
C. Chan
Joon Huang Chuah
VLM
ViT
11
15
0
11 Feb 2022
Text Summarization with Pretrained Encoders
Yang Liu
Mirella Lapata
MILM
254
1,430
0
22 Aug 2019
1