Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.17376
Cited By
On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration
24 April 2025
Maoyang Xiang
Ramesh Fernando
Bo Wang
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration"
Title
No papers