Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.11267
Cited By
Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
21 April 2023
Yu-Hui Chen
Raman Sarokin
Juhyun Lee
Jiuqiang Tang
Chuo-Ling Chang
Andrei Kulik
Matthias Grundmann
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations"
8 / 8 papers shown
Title
OODTE: A Differential Testing Engine for the ONNX Optimizer
Nikolaos Louloudakis
Ajitha Rajan
22
0
0
03 May 2025
Scaling On-Device GPU Inference for Large Generative Models
Jiuqiang Tang
Raman Sarokin
Ekaterina Ignasheva
Grant Jensen
Lin Chen
Juhyun Lee
Andrei Kulik
Matthias Grundmann
33
0
0
01 May 2025
Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation
J. Park
Maanas Taneja
Qianwen Wang
Dongyeop Kang
VGen
65
0
0
26 Apr 2025
Temporal Feature Matters: A Framework for Diffusion Model Quantization
Yushi Huang
Ruihao Gong
Xianglong Liu
Jing Liu
Yuhang Li
Jiwen Lu
Dacheng Tao
DiffM
MQ
42
0
0
28 Jul 2024
LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models
Dingkun Zhang
Sijia Li
Chen Chen
Qingsong Xie
H. Lu
34
21
0
17 Apr 2024
Squeezing Large-Scale Diffusion Models for Mobile
Jiwoong Choi
Minkyu Kim
Daehyun Ahn
Taesu Kim
Yulhwa Kim
Do-Hyun Jo
H. Jeon
Jae-Joon Kim
Hyungjun Kim
13
9
0
03 Jul 2023
Winograd Convolution for Deep Neural Networks: Efficient Point Selection
Syed Asad Alam
Andrew Anderson
B. Barabasz
David Gregg
46
25
0
25 Jan 2022
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
262
10,183
0
12 Dec 2018
1