ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices

ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices

    MQ

Papers citing "ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices"

0 / 0 papers shown
Title

No papers found