Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.19005
Cited By
Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization
26 December 2024
Yihan Wu
Yichen Lu
Yifan Peng
Xihua Wang
Ruihua Song
Shinji Watanabe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization"
8 / 8 papers shown
Title
Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration
Wanglong Lu
Jikai Wang
Tao Wang
Kaihao Zhang
Xianta Jiang
Hanli Zhao
DiffM
35
1
0
31 Dec 2024
Semantic Image Synthesis via Diffusion Models
Weilun Wang
Weilun Wang
Wen-gang Zhou
Dongdong Chen
Dong Chen
Lu Yuan
Houqiang Li
DiffM
211
175
0
30 Jun 2022
SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing
Yichun Shi
Xiao Yang
Yangyue Wan
Xiaohui Shen
GAN
140
83
0
04 Dec 2021
High-Fidelity GAN Inversion for Image Attribute Editing
Tengfei Wang
Yong Zhang
Yanbo Fan
Jue Wang
Qifeng Chen
DiffM
194
243
0
14 Sep 2021
Talk-to-Edit: Fine-Grained Facial Editing via Dialog
Yuming Jiang
Ziqi Huang
Xingang Pan
Chen Change Loy
Ziwei Liu
DiffM
107
125
0
09 Sep 2021
Cross-Domain and Disentangled Face Manipulation with 3D Guidance
Can Wang
Menglei Chai
Mingming He
Dongdong Chen
Jing Liao
CVBM
136
26
0
22 Apr 2021
Designing an Encoder for StyleGAN Image Manipulation
Omer Tov
Yuval Alaluf
Yotam Nitzan
Or Patashnik
Daniel Cohen-Or
200
775
0
04 Feb 2021
Image Inpainting for Irregular Holes Using Partial Convolutions
Guilin Liu
F. Reda
Kevin J. Shih
Ting-Chun Wang
Andrew Tao
Bryan Catanzaro
142
1,912
0
20 Apr 2018
1