
ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices
Papers citing "ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices"
0 / 0 papers shown
Title | |||
|---|---|---|---|
No papers found | |||

Title | |||
|---|---|---|---|
No papers found | |||