ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.05424
  4. Cited By
Fast and Robust Early-Exiting Framework for Autoregressive Language
  Models with Synchronized Parallel Decoding

Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
9 October 2023
Sangmin Bae
Jongwoo Ko
Hwanjun Song
SeYoung Yun
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding"

6 / 56 papers shown
LLM in a flash: Efficient Large Language Model Inference with Limited
  Memory
LLM in a flash: Efficient Large Language Model Inference with Limited MemoryAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Keivan Alizadeh-Vahid
Iman Mirzadeh
Dmitry Belenko
Karen Khatamifard
Minsik Cho
C. C. D. Mundo
Mohammad Rastegari
Mehrdad Farajtabar
278
196
0
12 Dec 2023
Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in
  ML Serving
Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML ServingSymposium on Operating Systems Principles (SOSP), 2023
Yinwei Dai
Rui Pan
Anand Iyer
Kai Li
Ravi Netravali
179
19
0
08 Dec 2023
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language
  Models with 3D Parallelism
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D ParallelismInternational Conference on Machine Learning (ICML), 2023
Yanxi Chen
Xuchen Pan
Yaliang Li
Bolin Ding
Jingren Zhou
LRM
493
58
0
08 Dec 2023
SPIN: Sparsifying and Integrating Internal Neurons in Large Language
  Models for Text Classification
SPIN: Sparsifying and Integrating Internal Neurons in Large Language Models for Text ClassificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Difan Jiao
Yilun Liu
Zhenwei Tang
Daniel Matter
Jürgen Pfeffer
Ashton Anderson
174
6
0
27 Nov 2023
DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder
  Transformer Models
DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models
Peng Tang
Pengkai Zhu
Tian Li
Srikar Appalaraju
Vijay Mahadevan
R. Manmatha
230
9
0
15 Nov 2023
Contrastive Representation Distillation
Contrastive Representation DistillationInternational Conference on Learning Representations (ICLR), 2019
Yonglong Tian
Dilip Krishnan
Phillip Isola
1.4K
1,214
0
23 Oct 2019
Previous
12
Page 2 of 2