Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.00542
Cited By
An Efficient Transformer Decoder with Compressed Sub-layers
3 January 2021
Yanyang Li
Ye Lin
Tong Xiao
Jingbo Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Efficient Transformer Decoder with Compressed Sub-layers"
12 / 12 papers shown
Title
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Xupeng Miao
Gabriele Oliaro
Zhihao Zhang
Xinhao Cheng
Hongyi Jin
Tianqi Chen
Zhihao Jia
61
76
0
23 Dec 2023
UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science
Yazheng Yang
Yuqi Wang
Guangyi Liu
Ledell Yu Wu
Qi Liu
LMTD
30
15
0
18 Jul 2023
Building Multilingual Machine Translation Systems That Serve Arbitrary X-Y Translations
Akiko Eriguchi
Shufang Xie
Tao Qin
Hany Awadalla
LRM
53
7
0
30 Jun 2022
Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency
Yanyang Li
Fuli Luo
Runxin Xu
Songfang Huang
Fei Huang
Liwei Wang
25
3
0
06 Apr 2022
Towards Continual Knowledge Learning of Language Models
Joel Jang
Seonghyeon Ye
Sohee Yang
Joongbo Shin
Janghoon Han
Gyeonghun Kim
Stanley Jungkyu Choi
Minjoon Seo
CLL
KELM
222
150
0
07 Oct 2021
The NiuTrans Machine Translation Systems for WMT21
Yuhao Zhang
Tao Zhou
Bin Wei
Runzhe Cao
Yongyu Mu
...
Weiqiao Shan
Yinqiao Li
Bei Li
Tong Xiao
Jingbo Zhu
30
17
0
22 Sep 2021
The NiuTrans System for the WMT21 Efficiency Task
Chenglong Wang
Chi Hu
Yongyu Mu
Zhongxiang Yan
Siming Wu
...
Hang Cao
Bei Li
Ye Lin
Tong Xiao
Jingbo Zhu
14
2
0
16 Sep 2021
RankNAS: Efficient Neural Architecture Search by Pairwise Ranking
Chi Hu
Chenglong Wang
Xiangnan Ma
Xia Meng
Yinqiao Li
Tong Xiao
Jingbo Zhu
Changliang Li
12
11
0
15 Sep 2021
Efficient Inference for Multilingual Neural Machine Translation
Alexandre Berard
Dain Lee
S. Clinchant
K. Jung
Vassilina Nikoulina
31
12
0
14 Sep 2021
Bag of Tricks for Optimizing Transformer Efficiency
Ye Lin
Yanyang Li
Tong Xiao
Jingbo Zhu
21
6
0
09 Sep 2021
Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding
Xin Sun
Tao Ge
Furu Wei
Houfeng Wang
9
61
0
09 Jun 2021
Findings of the Second Workshop on Neural Machine Translation and Generation
Alexandra Birch
A. Finch
Minh-Thang Luong
Graham Neubig
Yusuke Oda
DRL
25
12
0
08 Jun 2018
1