Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2412.16915
Cited By
v1
v2 (latest)
FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation
Computer Vision and Pattern Recognition (CVPR), 2024
22 December 2024
Tianyun Zhong
Chao Liang
Jianwen Jiang
Gaojie Lin
Jiaqi Yang
Zhou Zhao
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation"
46 / 46 papers shown
Title
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation
Jianwen Jiang
Weihong Zeng
Zerong Zheng
Jiaqi Yang
Chao Liang
Wang Liao
Han Liang
Yuan Zhang
Mingyuan Gao
VGen
65
4
0
26 Aug 2025
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Gaojie Lin
Jianwen Jiang
Jiaqi Yang
Zerong Zheng
Chao Liang
DiffM
VGen
1.1K
71
0
01 Jul 2025
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
International Conference on Learning Representations (ICLR), 2024
Zhengyao Lv
Chenyang Si
Junhao Song
Zhenyu Yang
Ping Luo
Yu Qiao
Kwan-Yee K. Wong
VGen
DiffM
334
43
0
13 Mar 2025
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
International Conference on Learning Representations (ICLR), 2024
Jiahao Cui
Hui Li
Yao Yao
Hao Zhu
Hanlin Shang
Kaihui Cheng
Hang Zhou
Siyu Zhu
Jingdong Wang
DiffM
VGen
254
69
0
10 Oct 2024
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Neural Information Processing Systems (NeurIPS), 2024
Zhenhui Ye
Tianyun Zhong
Yi Ren
Ziyue Jiang
Jiawei Huang
...
Chen Zhang
Zehan Wang
Xize Chen
Xiang Yin
Zhou Zhao
VGen
246
18
0
09 Oct 2024
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
Gaojie Lin
Jianwen Jiang
Chao Liang
Tianyun Zhong
Jiaqi Yang
Yanbo Zheng
VGen
DiffM
466
31
0
03 Sep 2024
SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher
European Conference on Computer Vision (ECCV), 2024
T. Dao
Thuan Hoang Nguyen
T. Le
D. Vu
Khoi Nguyen
Cuong Pham
Anh Tran
DiffM
230
31
0
26 Aug 2024
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions
Zhiyuan Chen
Jiajiong Cao
Zhiquan Chen
Yuming Li
Chenguang Ma
VGen
218
142
0
11 Jul 2024
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
Jianzhu Guo
Dingyun Zhang
Xiaoqiang Liu
Zhizhou Zhong
Yuan Zhang
Pengfei Wan
Di Zhang
VGen
436
143
0
03 Jul 2024
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Mingwang Xu
Hui Li
Qingkun Su
Hanlin Shang
Liwei Zhang
Ce Liu
Jingdong Wang
Yao Yao
Siyu Zhu
VGen
195
162
0
13 Jun 2024
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Neural Information Processing Systems (NeurIPS), 2024
Yang Sui
Yanyu Li
Vidit Goel
Yerlan Idelbayev
Junli Cao
Ju Hu
Dhritiman Sagar
Bo Yuan
Sergey Tulyakov
Jian Ren
MQ
212
36
0
06 Jun 2024
V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation
Cong Wang
Kuan Tian
Jun Zhang
Yonghang Guan
Feng Luo
Fei Shen
Zhiwei Jiang
Qing Gu
Xiao Han
Wei Yang
199
75
0
04 Jun 2024
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
Tianchen Zhao
Tongcheng Fang
Haofeng Huang
Enshu Liu
Widyadewi Soedarmadji
...
Shengen Yan
Huazhong Yang
Xuefei Ning
Xuefei Ning
Yu Wang
MQ
VGen
400
60
0
04 Jun 2024
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
Hanshu Yan
Xingchao Liu
Jiachun Pan
Jun Hao Liew
Qiang Liu
Jiashi Feng
466
73
0
13 May 2024
Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping
Jianbin Zheng
Minghui Hu
Zhongyi Fan
Chaoyue Wang
Changxing Ding
Dacheng Tao
Tat-Jen Cham
287
39
0
29 Feb 2024
EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Linrui Tian
Qi Wang
Bang Zhang
Liefeng Bo
DiffM
282
202
0
27 Feb 2024
SDXL-Lightning: Progressive Adversarial Diffusion Distillation
Shanchuan Lin
Anran Wang
Xiao Yang
372
193
0
21 Feb 2024
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
International Conference on Learning Representations (ICLR), 2024
Zhenhui Ye
Tianyun Zhong
Yi Ren
Jiaqi Yang
Weichuang Li
...
Jinglin Liu
Chen Zhang
Xiang Yin
Zejun Ma
Zhou Zhao
193
75
0
16 Jan 2024
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Haoning Wu
Zicheng Zhang
Weixia Zhang
Chaofeng Chen
Liang Liao
...
Wenxiu Sun
Qiong Yan
Xiongkuo Min
Guangtao Zhai
Weisi Lin
231
329
0
28 Dec 2023
SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation
Computer Vision and Pattern Recognition (CVPR), 2023
Thuan Hoang Nguyen
Anh Tran
DiffM
291
89
0
08 Dec 2023
Adversarial Diffusion Distillation
European Conference on Computer Vision (ECCV), 2023
Axel Sauer
Dominik Lorenz
A. Blattmann
Robin Rombach
819
577
0
28 Nov 2023
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Computer Vision and Pattern Recognition (CVPR), 2023
Liucheng Hu
Xin Gao
Peng Zhang
Ke Sun
Bang Zhang
Liefeng Bo
DiffM
VGen
353
610
0
28 Nov 2023
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Simian Luo
Yiqin Tan
Longbo Huang
Jian Li
Hang Zhao
DiffM
300
626
0
06 Oct 2023
Temporal Dynamic Quantization for Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Junhyuk So
Jungwon Lee
Daehyun Ahn
Hyungjun Kim
Eunhyeok Park
DiffM
MQ
282
81
0
04 Jun 2023
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Neural Information Processing Systems (NeurIPS), 2023
Yanyu Li
Huan Wang
Qing Jin
Ju Hu
Pavlo Chemerys
Yun Fu
Yanzhi Wang
Sergey Tulyakov
Jian Ren
VLM
241
225
0
01 Jun 2023
PTQD: Accurate Post-Training Quantization for Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Yefei He
Luping Liu
Jing Liu
Weijia Wu
Hong Zhou
Bohan Zhuang
DiffM
MQ
414
156
0
18 May 2023
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Zhenhui Ye
Jinzheng He
Ziyue Jiang
Rongjie Huang
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Xiang Yin
Zejun Ma
Zhou Zhao
CVBM
183
53
0
01 May 2023
Consistency Models
International Conference on Machine Learning (ICML), 2023
Yang Song
Prafulla Dhariwal
Mark Chen
Ilya Sutskever
VLM
DiffM
368
1,377
0
02 Mar 2023
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
International Conference on Learning Representations (ICLR), 2023
Zhenhui Ye
Ziyue Jiang
Yi Ren
Jinglin Liu
Jinzheng He
Zhou Zhao
CVBM
183
174
0
31 Jan 2023
StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yifeng Ma
Suzhe Wang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Zhidong Deng
Xin Yu
286
115
0
03 Jan 2023
Post-training Quantization on Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Yuzhang Shang
Zhihang Yuan
Bin Xie
Bingzhe Wu
Yan Yan
DiffM
MQ
356
257
0
28 Nov 2022
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
International Journal of Computer Vision (IJCV), 2022
Jiaxiang Tang
Kaisiyuan Wang
Hang Zhou
Xiaokang Chen
Dongliang He
Tianshu Hu
Jingtuo Liu
Gang Zeng
Jingdong Wang
3DH
188
111
0
22 Nov 2022
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Computer Vision and Pattern Recognition (CVPR), 2022
Wenxuan Zhang
Xiaodong Cun
Xuan Wang
Yong Zhang
Xiaodong Shen
Yu-Xiao Guo
Ying Shan
Haiwei Yang
VGen
191
377
0
22 Nov 2022
On Distillation of Guided Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Chenlin Meng
Robin Rombach
Ruiqi Gao
Diederik P. Kingma
Stefano Ermon
Jonathan Ho
Tim Salimans
VLM
DiffM
176
677
0
06 Oct 2022
Rectified Flow: A Marginal Preserving Approach to Optimal Transport
Qiang Liu
OT
351
187
0
29 Sep 2022
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
European Conference on Computer Vision (ECCV), 2022
Haoning Zhu
Wayne Wu
Wentao Zhu
Liming Jiang
Siwei Tang
Li Zhang
Ziwei Liu
Chen Change Loy
323
243
0
25 Jul 2022
Thin-Plate Spline Motion Model for Image Animation
Computer Vision and Pattern Recognition (CVPR), 2022
Jian Zhao
Hui Zhang
191
247
0
27 Mar 2022
Depth-Aware Generative Adversarial Network for Talking Head Video Generation
Computer Vision and Pattern Recognition (CVPR), 2022
Fa-Ting Hong
Longhao Zhang
Li Shen
Dan Xu
3DH
CVBM
238
215
0
13 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
1.3K
20,412
0
20 Dec 2021
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
Computer Vision and Pattern Recognition (CVPR), 2021
Hang Zhou
Yasheng Sun
Wayne Wu
Chen Change Loy
Xiaogang Wang
Ziwei Liu
CVBM
287
422
0
22 Apr 2021
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
Computer Vision and Pattern Recognition (CVPR), 2020
Ting-Chun Wang
Arun Mallya
Xuan Li
3DH
448
578
0
30 Nov 2020
Denoising Diffusion Implicit Models
International Conference on Learning Representations (ICLR), 2020
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
1.2K
9,865
0
06 Oct 2020
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
EGVM
359
985
0
23 Aug 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
4.1K
24,859
0
19 Jun 2020
MakeItTalk: Speaker-Aware Talking-Head Animation
Yang Zhou
Xintong Han
Eli Shechtman
J. Echevarria
E. Kalogerakis
Dingzeyu Li
260
493
0
27 Apr 2020
Decision-Making with Auto-Encoding Variational Bayes
Neural Information Processing Systems (NeurIPS), 2020
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
1.4K
19,430
0
17 Feb 2020
1