Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.12687
Cited By
Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models
17 December 2024
Seungeun Oh
Jinhyuk Kim
Jihong Park
Seung-Woo Ko
Tony Q. S. Quek
Seong-Lyun Kim
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models"
2 / 2 papers shown
Title
The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks
Zhonghao Lyu
Ming Xiao
Jie Xu
Mikael Skoglund
Marco Di Renzo
28
0
0
14 May 2025
A Novel Hat-Shaped Device-Cloud Collaborative Inference Framework for Large Language Models
Zuan Xie
Yang Xu
Hongli Xu
Yunming Liao
Zhiwei Yao
53
0
0
23 Mar 2025
1