Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.01229
Cited By
LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving
1 September 2025
Huanqi Hu
Bowen Xiao
Shixuan Sun
Jianian Yin
Zhexi Zhang
Xiang Luo
Chengquan Jiang
Weiqi Xu
Xiaoying Jia
Xin Liu
Minyi Guo
MQ
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (748★)
Papers citing
"LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving"
2 / 2 papers shown
Title
CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization
Zijian Zhang
Rong Wang
Shiyang Li
Yuebo Luo
Mingyi Hong
Caiwen Ding
124
0
0
23 Oct 2025
PreScope: Unleashing the Power of Prefetching for Resource-Constrained MoE Inference
Enda Yu
Zhaoning Zhang
Dezun Dong
Yongwei Wu
Xiangke Liao
144
1
0
28 Sep 2025
1