Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14631
Cited By
v1
v2
v3 (latest)
Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose Dictionary
29 April 2021
Sibo Zhang
Jiahong Yuan
Miao Liao
Liangjun Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose Dictionary"
20 / 20 papers shown
Title
DATA: Multi-Disentanglement based Contrastive Learning for Open-World Semi-Supervised Deepfake Attribution
Ming-Hui Liu
Xiao-Qian Liu
Xin Luo
Xin-Shun Xu
90
1
0
07 May 2025
FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency
Shiyan Liu
Rui Qu
Yan Jin
88
0
0
06 Apr 2025
OmniTalker: One-shot Real-time Text-Driven Talking Audio-Video Generation With Multimodal Style Mimicking
Zhongjian Wang
Peng Zhang
Jinwei Qi
Guangyuan Wang Sheng Xu
Chaonan Ji
Sheng Xu
Bang Zhang
Liefeng Bo
DiffM
VGen
144
0
0
03 Apr 2025
SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain
Nan Gao
Yihua Bao
Dongdong Weng
Jiayi Zhao
Jia Li
Yan Zhou
Pengfei Wan
Di Zhang
SLR
125
0
0
26 Mar 2025
3D Engine-ready Photorealistic Avatars via Dynamic Textures
Yifan Wang
Ivan Molodetskikh
Ondrej Texler
Dimitar Dinev
90
0
0
19 Mar 2025
Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Florinel-Alin Croitoru
Andrei Iulian Hiji
Vlad Hondru
Nicolae-Cătălin Ristea
Paul Irofti
Marius Popescu
Cristian Rusu
Radu Tudor Ionescu
Fahad Shahbaz Khan
Mubarak Shah
135
5
0
29 Nov 2024
FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset
Donglin Di
Hao Feng
Wenzhang Sun
Yongjia Ma
Hao Li
Wei Chen
Xiaofei Gou
Tonghua Su
Xun Yang
CVBM
140
2
0
23 Sep 2024
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation
Niu Guanchen
3DH
91
0
0
17 Jun 2024
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Youngjoon Jang
Ji-Hoon Kim
Junseok Ahn
Doyeop Kwak
Hong-Sun Yang
Yooncheol Ju
Il-Hwan Kim
Byeong-Yeol Kim
Joon Son Chung
CVBM
90
10
0
16 May 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVM
VGen
104
43
0
08 Mar 2024
Unsupervised Sign Language Translation and Generation
Zhengsheng Guo
Zhiwei He
Wenxiang Jiao
Xing Wang
Rui Wang
Kehai Chen
Zhaopeng Tu
Yong-mei Xu
Min Zhang
131
0
0
12 Feb 2024
Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism
Georgios Milis
P. Filntisis
A. Roussos
Petros Maragos
CVBM
66
3
0
11 Dec 2023
FT2TF: First-Person Statement Text-To-Talking Face Generation
Xingjian Diao
Ming Cheng
Wayner Barrios
SouYoung Jin
105
12
0
09 Dec 2023
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation
Zhichao Wang
M. Dai
Keld Lundgaard
VGen
DiffM
78
2
0
12 Aug 2023
Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models
J. Choi
Minsu Kim
Se Jin Park
Y. Ro
CVBM
50
4
0
28 Jun 2023
SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces
Ziqiao Peng
Yihao Luo
Yue Shi
Hao-Xuan Xu
Xiangyu Zhu
Jun He
Hongyan Liu
Zhaoxin Fan
135
45
0
19 Jun 2023
GesGPT: Speech Gesture Synthesis With Text Parsing from ChatGPT
Nan Gao
Zeyu Zhao
Zhi Zeng
Shuwu Zhang
Dongdong Weng
Yihua Bao
81
8
0
23 Mar 2023
UniFLG: Unified Facial Landmark Generator from Text or Speech
Kentaro Mitsui
Yukiya Hono
Kei Sawada
CVBM
54
7
0
28 Feb 2023
Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
S. Ravichandran
Ondrej Texler
Dimitar Dinev
Hyun Jae Kang
71
4
0
03 Sep 2022
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
Xinsheng Wang
Qicong Xie
Jihua Zhu
Lei Xie
O. Scharenborg
120
19
0
09 Aug 2021
1