Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.14717
Cited By
CelebV-Text: A Large-Scale Facial Text-Video Dataset
Computer Vision and Pattern Recognition (CVPR), 2023
26 March 2023
Jianhui Yu
Hao Zhu
Liming Jiang
Chen Change Loy
Weidong (Tom) Cai
Wayne Wu
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4682★)
Papers citing
"CelebV-Text: A Large-Scale Facial Text-Video Dataset"
50 / 61 papers shown
IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
Bo Chen
Tao Liu
Qi Chen
Xie Chen
Zilong Zheng
VGen
128
0
0
27 Nov 2025
MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
Shuai Zhang
Bao Tang
Siyuan Yu
Yueting Zhu
Jingfeng Yao
Ya Zou
Shanglin Yuan
Li Yu
Wenyu Liu
Xinggang Wang
DiffM
VGen
253
0
0
26 Nov 2025
Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations
Chao Wang
Chengan Che
Xinyue Chen
Sophia Tsoka
Luis C. Garcia-Peraza-Herrera
287
0
0
25 Nov 2025
VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models
Zhicheng Zhang
Weicheng Wang
Yongjie Zhu
Wenyu Qin
Pengfei Wan
Di Zhang
Jufeng Yang
159
2
0
04 Nov 2025
What If : Understanding Motion Through Sparse Interactions
S. A. Baumann
Nick Stracke
Timy Phan
Bjorn Ommer
188
1
0
14 Oct 2025
SyncLipMAE: Contrastive Masked Pretraining for Audio-Visual Talking-Face Representation
Zeyu Ling
Xiaodong Gu
Jiangnan Tang
Changqing Zou
CLIP
196
0
0
11 Oct 2025
Durian: Dual Reference Image-Guided Portrait Animation with Attribute Transfer
Hyunsoo Cha
Byungjun Kim
Hanbyul Joo
178
0
0
04 Sep 2025
Human Motion Video Generation: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Haiwei Xue
Xiangyang Luo
Zhanghao Hu
Shu Zhang
Xunzhi Xiang
...
Fei Ma
Zhiyong Wu
Changpeng Yang
Zonghong Dai
Fei Richard Yu
EGVM
VGen
269
33
0
04 Sep 2025
Towards High-Fidelity, Identity-Preserving Real-Time Makeup Transfer: Decoupling Style Generation
Lydia Kin Ching Chau
Zhi Yu
Ruowei Jiang
DiffM
251
0
0
02 Sep 2025
MoSA: Motion-Coherent Human Video Generation via Structure-Appearance Decoupling
Haoyu Wang
Hao Tang
Donglin Di
Zhilu Zhang
W. Zuo
Feng Gao
Siwei Ma
Shiliang Zhang
DiffM
VGen
206
0
0
24 Aug 2025
DiTalker: A Unified DiT-based Framework for High-Quality and Speaking Styles Controllable Portrait Animation
He Feng
Yongjia Ma
Donglin Di
Lei Fan
Tonghua Su
Xiangqian Wu
DiffM
VGen
162
1
0
29 Jul 2025
MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
Xinyang Li
Gen Li
Zhihui Lin
Yichen Qian
Gongxin Yao
Weinan Jia
Aowen Wang
Weihua Chen
Fan Wang
DiffM
VGen
314
0
0
04 Jul 2025
Sonic4D: Spatial Audio Generation for Immersive 4D Scene Exploration
Siyi Xie
Hanxin Zhu
Tianyu He
X. Li
Zhibo Chen
Zhibo Chen
VGen
397
4
0
18 Jun 2025
EchoShot: Multi-Shot Portrait Video Generation
Jiahao Wang
Hualian Sheng
Sijia Cai
Weizhan Zhang
Caixia Yan
Yachuang Feng
Bing Deng
Jieping Ye
DiffM
VGen
281
11
0
16 Jun 2025
Exploring Timeline Control for Facial Motion Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Yifeng Ma
Jinwei Qi
Chaonan Ji
Peng Zhang
Bang Zhang
Zhidong Deng
Liefeng Bo
VGen
283
2
0
27 May 2025
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Antoni Bigata
Rodrigo Mira
Stella Bounareli
Michał Stypułkowski
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
423
8
0
01 May 2025
Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis
Zichuan Liu
Liming Jiang
Qing Yan
Yumin Jia
Hao Kang
Xin Lu
DiffM
499
2
0
19 Apr 2025
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
ACM Multimedia (ACM MM), 2025
Yang Shi
Jiaheng Liu
Yushuo Guan
Zhikai Wu
Yujiao Shi
...
Bohan Zeng
Wei Zhang
Fuzheng Zhang
Wenjing Yang
Di Zhang
VGen
VLM
469
17
0
14 Apr 2025
FVQ: A Large-Scale Dataset and an LMM-based Method for Face Video Quality Assessment
Sijing Wu
Yunhao Li
Ziwen Xu
Yixuan Gao
Huiyu Duan
Wei Sun
Guoquan Zheng
853
4
0
12 Apr 2025
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
Kim Sung-Bin
Jeongsoo Choi
Puyuan Peng
Joon Son Chung
Tae-Hyun Oh
David Harwath
VGen
337
8
0
03 Apr 2025
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
Fa-Ting Hong
Zunnan Xu
Zixiang Zhou
Zhiqiang Zhang
Xiu Li
Qin Lin
Qinglin Lu
D. Xu
DiffM
VGen
608
14
0
03 Apr 2025
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Kun Liu
Qi Liu
Xinchen Liu
Jie Li
Yongdong Zhang
Jiebo Luo
Xiaodong He
Wu Liu
VGen
343
15
0
31 Mar 2025
MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation
Computer Vision and Pattern Recognition (CVPR), 2025
Yukang Lin
Hokit Fung
Jianjin Xu
Zeping Ren
Adela S.M. Lau
Guosheng Yin
Xiu Li
VGen
349
14
0
25 Mar 2025
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Liming Jiang
Qing Yan
Yumin Jia
Zichuan Liu
Hao Kang
Xin Lu
421
34
0
20 Mar 2025
Visual Persona: Foundation Model for Full-Body Human Customization
Computer Vision and Pattern Recognition (CVPR), 2025
Jisu Nam
Soowon Son
Zhan Xu
Jing Shi
Difan Liu
Feng Liu
Aashish Misraa
Seungryong Kim
Yang Zhou
DiffM
384
8
0
19 Mar 2025
Personalized Generation In Large Model Era: A Survey
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Wenjie Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
639
42
0
04 Mar 2025
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
Computer Vision and Pattern Recognition (CVPR), 2025
Antoni Bigata
Michał Stypułkowski
Rodrigo Mira
Stella Bounareli
Konstantinos Vougioukas
Zoe Landgraf
Nikita Drobyshev
Maciej Ziȩba
Stavros Petridis
Maja Pantic
DiffM
VGen
449
10
0
03 Mar 2025
FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion model
Lingzhou Mu
Baiji Liu
Ruonan Zhang
Guiming Mo
Jiawei Jin
Kai Zhang
Haozhi Huang
DiffM
VGen
610
0
0
26 Feb 2025
PERSE: Personalized 3D Generative Avatars from A Single Portrait
Computer Vision and Pattern Recognition (CVPR), 2024
Hyunsoo Cha
Inhee Lee
Hanbyul Joo
3DGS
299
14
0
30 Dec 2024
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Computer Vision and Pattern Recognition (CVPR), 2024
Guocheng Qian
Kuan-Chieh Wang
Or Patashnik
Negin Heravi
Daniil Ostashev
Sergey Tulyakov
Daniel Cohen-Or
Kfir Aberman
483
19
0
12 Dec 2024
HiFiVFS: High Fidelity Video Face Swapping
Xu Chen
Keke He
Junwei Zhu
Yanhao Ge
Wei Li
Chengjie Wang
VGen
DiffM
427
8
0
27 Nov 2024
MotionCharacter: Fine-Grained Motion Controllable Human Video Generation
Haopeng Fang
Di Qiu
Binjie Mao
Pengfei Yan
VGen
DiffM
287
12
0
27 Nov 2024
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Computer Vision and Pattern Recognition (CVPR), 2024
Xiaozhong Ji
Xiaobin Hu
Zhihong Xu
Junwei Zhu
Chuming Lin
...
Donghao Luo
Yi Chen
Qin Lin
Qinglin Lu
Chengjie Wang
VGen
463
59
0
25 Nov 2024
HumanVLM: Foundation for Human-Scene Vision-Language Model
Information Fusion (Inf. Fusion), 2024
Dawei Dai
Xu Long
Li Yutang
Zhang YuanHui
Shuyin Xia
VLM
MLLM
431
15
0
05 Nov 2024
Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions
International Conference on 3D Vision (3DV), 2024
Malte Prinzler
Egor Zakharov
V. Sklyarova
Berna Kabadayi
Justus Thies
DiffM
248
13
0
21 Oct 2024
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
ACM Multimedia (MM), 2024
Sijing Wu
Yunhao Li
Manwen Liao
Huiyu Duan
Ziwei Liu
Guangtao Zhai
3DH
VGen
291
23
0
10 Oct 2024
Face Forgery Detection with Elaborate Backbone
Zonghui Guo
Y. Liu
Jie Zhang
Haiyong Zheng
Shiguang Shan
AAML
CVBM
342
2
0
25 Sep 2024
DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation
Donglin Di
Hao Feng
Wenzhang Sun
Yongjia Ma
Hao Li
Wei Chen
Xiaofei Gou
Tonghua Su
Xun Yang
CVBM
478
2
0
23 Sep 2024
InstantDrag: Improving Interactivity in Drag-based Image Editing
ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2024
Joonghyuk Shin
Daehyeon Choi
Jaesik Park
DiffM
323
34
0
13 Sep 2024
What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer
AAAI Conference on Artificial Intelligence (AAAI), 2024
Chaeyeon Chung
Sunghyun Park
J. Kim
Jaegul Choo
DiffM
187
6
0
29 Aug 2024
15M Multimodal Facial Image-Text Dataset
Dawei Dai
Yutang Li
Yingge Liu
Mingming Jia
Zhang YuanHui
Guoyin Wang
VLM
471
20
0
11 Jul 2024
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
Kepan Nan
Rui Xie
Penghao Zhou
Tiehan Fan
Zhenheng Yang
Zhijie Chen
Xiang Li
Jian Yang
Ying Tai
763
234
0
02 Jul 2024
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset
Kim Sung-Bin
Lee Chae-Yeon
Gihun Son
Oh Hyun-Bin
Janghoon Ju
Suekyeong Nam
Tae-Hyun Oh
278
23
0
20 Jun 2024
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data
Rotem Shalev-Arkushin
Aharon Azulay
Tavi Halperin
Eitan Richardson
Amit H. Bermano
Ohad Fried
DiffM
372
0
0
20 Jun 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
301
43
0
17 May 2024
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Xuanhua He
Quande Liu
Shengju Qian
Xin Eric Wang
Tao Hu
Ke Cao
K. Yan
Jie Zhang
VGen
534
104
0
23 Apr 2024
3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow
Felix Taubner
Prashant Raina
Mathieu Tuli
Eu Wern Teh
Chul Lee
Jinmiao Huang
3DH
CVBM
216
12
0
15 Apr 2024
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Yu Deng
Duomin Wang
Baoyuan Wang
325
56
0
20 Mar 2024
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
Neural Information Processing Systems (NeurIPS), 2024
Wenhao Wang
Yi Yang
VGen
DiffM
540
91
0
10 Mar 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Eric Wang
Xin Li
Luisa Verdoliva
Shu Hu
1.1K
101
0
22 Jan 2024
1
2
Next
Page 1 of 2