Towards Fast Multilingual LLM Inference: Speculative Decoding and
  Specialized Drafters

Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters

Papers citing "Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters"