Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.15238
Cited By
Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance
25 October 2021
Jiarong Xing
Leyuan Wang
Shang Zhang
Jack H Chen
Ang Chen
Yibo Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance"
11 / 11 papers shown
Title
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving
Yaoyao Ding
Bohan Hou
X. Zhang
Allan Lin
Tianqi Chen
Cody Yu Hao
Yida Wang
Gennady Pekhimenko
41
0
0
17 Apr 2025
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch
Abhishek Ghosh
Ajay Nayak
Ashish Panwar
Arkaprava Basu
GNN
43
0
0
25 Mar 2025
QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices
Juntao Zhao
Borui Wan
Yanghua Peng
Haibin Lin
Yibo Zhu
Chuan Wu
13
3
0
02 Jul 2024
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs
Haotian Tang
Shang Yang
Zhijian Liu
Ke Hong
Zhongming Yu
Xiuyu Li
Guohao Dai
Yu Wang
Song Han
55
21
0
25 Oct 2023
Autotuning Apache TVM-based Scientific Applications Using Bayesian Optimization
Xingfu Wu
P. Paramasivam
Valerie Taylor
16
3
0
13 Sep 2023
AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization
Zhiying Xu
H. Peng
Wei Wang
GNN
10
3
0
02 Dec 2022
TLP: A Deep Learning-based Cost Model for Tensor Program Tuning
Yiqiang Zhai
Yu Zhang
Shuo Liu
Xiaomeng Chu
Jie Peng
Jianmin Ji
Yanyong Zhang
17
29
0
07 Nov 2022
ALCOP: Automatic Load-Compute Pipelining in Deep Learning Compiler for AI-GPUs
Guyue Huang
Yang Bai
L. Liu
Yuke Wang
Bei Yu
Yufei Ding
Yuan Xie
44
16
0
29 Oct 2022
ALT: Boosting Deep Learning Performance by Breaking the Wall between Graph and Operator Level Optimizations
Zhiying Xu
Jiafan Xu
H. Peng
Wei Wang
Xiaoliang Wang
...
Haipeng Dai
Yixu Xu
Hao Cheng
Kun Wang
Guihai Chen
13
0
0
22 Oct 2022
Hidet: Task-Mapping Programming Paradigm for Deep Learning Tensor Programs
Yaoyao Ding
Cody Hao Yu
Bojian Zheng
Yizhi Liu
Yida Wang
Gennady Pekhimenko
14
30
0
18 Oct 2022
RepVGG: Making VGG-style ConvNets Great Again
Xiaohan Ding
X. Zhang
Ningning Ma
Jungong Han
Guiguang Ding
Jian-jun Sun
117
1,531
0
11 Jan 2021
1