ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices

ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices

22 October 2025

ArXiv (abs)PDF HTML Github

Papers citing "ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices"

0 / 0 papers shown

Title
No papers found