ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.19146
  4. Cited By
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

28 November 2024
Akhiad Bercovich
Tomer Ronen
Talor Abramovich
Nir Ailon
Nave Assaf
Mohammad Dabbah
Ido Galil
Amnon Geifman
Yonatan Geifman
I. Golan
Netanel Haber
Ehud Karpas
Roi Koren
Itay Levy
Pavlo Molchanov
Shahar Mor
Zach Moshe
Najeeb Nabwani
Omri Puny
Ran Rubin
Itamar Schen
Ido Shahaf
Oren Tropp
Omer Ullman Argov
Ran Zilberstein
Ran El-Yaniv
ArXivPDFHTML

Papers citing "Puzzle: Distillation-Based NAS for Inference-Optimized LLMs"

1 / 1 papers shown
Title
Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding
J. Li
Yixing Xu
Haiduo Huang
Xuanwu Yin
D. Li
Edith C. -H. Ngai
E. Barsoum
45
0
0
13 Mar 2025
1