Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.00462
Cited By
Designing Efficient LLM Accelerators for Edge Devices
1 August 2024
Jude Haris
Rappy Saha
Wenhao Hu
José Cano
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Designing Efficient LLM Accelerators for Edge Devices"
4 / 4 papers shown
Title
Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency
E. J. Husom
Arda Goknil
Merve Astekin
Lwin Khin Shar
Andre Kåsen
S. Sen
Benedikt Andreas Mithassel
Ahmet Soylu
MQ
32
0
0
04 Apr 2025
Exploiting Unstructured Sparsity in Fully Homomorphic Encrypted DNNs
Aidan Ferguson
Perry Gibson
Lara DÁgata
Parker McLeod
Ferhat Yaman
Amitabh Das
Ian Colbert
José Cano
58
0
0
12 Mar 2025
Silent Hazards of Token Reduction in Vision-Language Models: The Hidden Impact on Consistency
Yizheng Sun
Hao Li
Chang Xu
C. Lin
R. Batista-Navarro
Jingyuan Sun
57
0
0
09 Mar 2025
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
57
15
0
06 Oct 2024
1