Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2304.02721
Cited By

To Asymmetry and Beyond: Structured Pruning of Sequence to Sequence
Models for Improved Inference Efficiency

v1v2v3 (latest)

To Asymmetry and Beyond: Structured Pruning of Sequence to Sequence Models for Improved Inference Efficiency

5 April 2023

Daniel Fernando Campos

Chengxiang Zhai

ArXiv (abs)PDF HTML HuggingFace (3 upvotes)

Papers citing "To Asymmetry and Beyond: Structured Pruning of Sequence to Sequence Models for Improved Inference Efficiency"

1 / 1 papers shown

Predictive Pipelined Decoding: A Compute-Latency Trade-off for Exact LLM
Decoding

Predictive Pipelined Decoding: A Compute-Latency Trade-off for Exact LLM Decoding

Dimitris Papailiopoulos

250

48

0

12 Jul 2023

Page 1 of 1