ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.01524
  4. Cited By
Text-based Editing of Talking-head Video

Text-based Editing of Talking-head Video

4 June 2019
Ohad Fried
A. Tewari
Michael Zollhöfer
Adam Finkelstein
Eli Shechtman
Dan B. Goldman
Kyle Genova
Zeyu Jin
Christian Theobalt
Maneesh Agrawala
    VGen
ArXiv (abs)PDFHTML

Papers citing "Text-based Editing of Talking-head Video"

50 / 129 papers shown
Title
FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency
FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency
Shiyan Liu
Rui Qu
Yan Jin
88
0
0
06 Apr 2025
Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation
Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation
Baptiste Chopin
Tashvik Dhamija
P. Balaji
Yaohui Wang
A. Dantcheva
DiffMVGen
113
0
0
24 Feb 2025
Lotus: Creating Short Videos From Long Videos With Abstractive and Extractive Summarization
Lotus: Creating Short Videos From Long Videos With Abstractive and Extractive Summarization
Aadit Barua
Karim Benharrak
Meng Chen
Mina Huh
Amy Pavel
DiffMVLM
76
3
0
10 Feb 2025
Multi-Branch Collaborative Learning Network for Video Quality Assessment in Industrial Video Search
Hengzhu Tang
Zefeng Zhang
Zhiping Li
Zhenyu Zhang
Xing Wu
Li Gao
Suqi Cheng
D. Yin
113
1
0
09 Feb 2025
StoryNavi: On-Demand Narrative-Driven Reconstruction of Video Play With
  Generative AI
StoryNavi: On-Demand Narrative-Driven Reconstruction of Video Play With Generative AI
Alston Lantian Xu
Tianwei Ma
Tianmeng Liu
Can Liu
Alvaro Cassinelli
VGen
91
0
0
04 Oct 2024
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of
  Talking Heads
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Suzhen Wang
Yifeng Ma
Yu Ding
Zhipeng Hu
Changjie Fan
Tangjie Lv
Zhidong Deng
Xin Yu
108
12
0
14 Sep 2024
Content and Style Aware Audio-Driven Facial Animation
Content and Style Aware Audio-Driven Facial Animation
Qingju Liu
Hyeongwoo Kim
Gaurav Bharaj
DiffM
84
1
0
13 Aug 2024
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding
  from TV Dramas and Synopses
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Chaolei Tan
Zihang Lin
Junfu Pu
Zhongang Qi
Wei-Yi Pei
Zhi Qu
Yexin Wang
Ying Shan
Wei-Shi Zheng
Jianfang Hu
AI4TS
88
0
0
03 Aug 2024
Text-based Talking Video Editing with Cascaded Conditional Diffusion
Text-based Talking Video Editing with Cascaded Conditional Diffusion
Bo Han
Heqing Zou
Haoyang Li
Guangcong Wang
Chng Eng Siong
VGenDiffM
101
2
0
20 Jul 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
99
2
0
15 Jun 2024
A Large-scale Universal Evaluation Benchmark For Face Forgery Detection
A Large-scale Universal Evaluation Benchmark For Face Forgery Detection
Yijun Bei
Hengrui Lou
Jinsong Geng
Erteng Liu
Lechao Cheng
Jie Song
Mingli Song
Zunlei Feng
CVBM
127
0
0
13 Jun 2024
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait
  Animation
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Yi Ma
Hongyu Liu
Haobo Wang
Heng Pan
Yingqing He
...
Ailing Zeng
Chengfei Cai
H. Shum
Wen Liu
Qifeng Chen
130
61
0
04 Jun 2024
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Youngjoon Jang
Ji-Hoon Kim
Junseok Ahn
Doyeop Kwak
Hong-Sun Yang
Yooncheol Ju
Il-Hwan Kim
Byeong-Yeol Kim
Joon Son Chung
CVBM
88
10
0
16 May 2024
Explainable Deepfake Video Detection using Convolutional Neural Network
  and CapsuleNet
Explainable Deepfake Video Detection using Convolutional Neural Network and CapsuleNet
Gazi Hasin Ishrak
Zalish Mahmud
Md. Zami Al Zunaed Farabe
Tahera Khanom Tinni
Tanzim Reza
Mohammad Zavid Parvez
81
3
0
19 Apr 2024
Superior and Pragmatic Talking Face Generation with Teacher-Student
  Framework
Superior and Pragmatic Talking Face Generation with Teacher-Student Framework
Chao Liang
Jianwen Jiang
Tianyun Zhong
Gaojie Lin
Zhengkun Rong
Jiaqi Yang
Yongming Zhu
101
1
0
26 Mar 2024
ExpressEdit: Video Editing with Natural Language and Sketching
ExpressEdit: Video Editing with Natural Language and Sketching
Bekzat Tilekbay
Saelyne Yang
M. Lewkowicz
Alex Suryapranata
Juho Kim
DiffMVGen
50
10
0
26 Mar 2024
Dyadic Interaction Modeling for Social Behavior Generation
Dyadic Interaction Modeling for Social Behavior Generation
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
95
8
0
14 Mar 2024
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video
  Editing
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing
Bryan Wang
Yuliang Li
Zhaoyang Lv
Haijun Xia
Yan Xu
Raj Sodhi
92
53
0
15 Feb 2024
DreamTalk: When Expressive Talking Head Generation Meets Diffusion
  Probabilistic Models
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
123
23
0
15 Dec 2023
InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars
InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars
Xiaochen Zhao
Jingxiang Sun
Lizhen Wang
Jinli Suo
Yebin Liu
DiffM
97
3
0
03 Dec 2023
PodReels: Human-AI Co-Creation of Video Podcast Teasers
PodReels: Human-AI Co-Creation of Video Podcast Teasers
Sitong Wang
Zheng Ning
Anh Truong
Mira Dontcheva
Dingzeyu Li
Lydia B. Chilton
77
21
0
10 Nov 2023
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous
  Head Motions
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
37
3
0
28 Sep 2023
Audio-Driven Dubbing for User Generated Contents via Style-Aware
  Semi-Parametric Synthesis
Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
Linsen Song
Wayne Wu
Chaoyou Fu
Chen Change Loy
Ran He
89
12
0
31 Aug 2023
MFR-Net: Multi-faceted Responsive Listening Head Generation via
  Denoising Diffusion Model
MFR-Net: Multi-faceted Responsive Listening Head Generation via Denoising Diffusion Model
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
DiffM
60
10
0
31 Aug 2023
The Ethical Implications of Generative Audio Models: A Systematic
  Literature Review
The Ethical Implications of Generative Audio Models: A Systematic Literature Review
J. Barnett
86
32
0
07 Jul 2023
Unsupervised Learning of Style-Aware Facial Animation from Real Acting
  Performances
Unsupervised Learning of Style-Aware Facial Animation from Real Acting Performances
Wolfgang Paier
Anna Hilsmann
Peter Eisert
3DH
88
10
0
16 Jun 2023
NPVForensics: Jointing Non-critical Phonemes and Visemes for Deepfake
  Detection
NPVForensics: Jointing Non-critical Phonemes and Visemes for Deepfake Detection
Yu Chen
Yang Yu
R. Ni
Yao-Min Zhao
Haoliang Li
72
3
0
12 Jun 2023
Efficient Spoken Language Recognition via Multilabel Classification
Efficient Spoken Language Recognition via Multilabel Classification
Oriol Nieto
Zeyu Jin
Franck Dernoncourt
Justin Salamon
56
1
0
02 Jun 2023
ReactFace: Multiple Appropriate Facial Reaction Generation in Dyadic
  Interactions
ReactFace: Multiple Appropriate Facial Reaction Generation in Dyadic Interactions
Cheng Luo
Siyang Song
Weicheng Xie
Micol Spitale
Linlin Shen
Hatice Gunes
CVBMDiffM
57
9
0
25 May 2023
MusicFace: Music-driven Expressive Singing Face Synthesis
MusicFace: Music-driven Expressive Singing Face Synthesis
Peng Liu
W. Deng
Hengda Li
Jintai Wang
Yinglin Zheng
Yiwei Ding
Xiaohu Guo
Ming Zeng
CVBM
73
12
0
24 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to
  GPT-5 All You Need?
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
186
170
0
21 Mar 2023
DINet: Deformation Inpainting Network for Realistic Face Visually
  Dubbing on High Resolution Video
DINet: Deformation Inpainting Network for Realistic Face Visually Dubbing on High Resolution Video
Zhimeng Zhang
Zhipeng Hu
W. Deng
Changjie Fan
Tangjie Lv
Yu-qiong Ding
3DHCVBM
110
67
0
07 Mar 2023
AutoMatch: A Large-scale Audio Beat Matching Benchmark for Boosting Deep
  Learning Assistant Video Editing
AutoMatch: A Large-scale Audio Beat Matching Benchmark for Boosting Deep Learning Assistant Video Editing
Sen Pei
Jingya Yu
Qi Chen
Wozhou He
61
3
0
03 Mar 2023
AVscript: Accessible Video Editing with Audio-Visual Scripts
AVscript: Accessible Video Editing with Audio-Visual Scripts
Mina Huh
Saelyne Yang
Yi-Hao Peng
Xiang Ánthony' Chen
Young-Ho Kim
Amy Pavel
74
34
0
27 Feb 2023
DPE: Disentanglement of Pose and Expression for General Video Portrait
  Editing
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
Youxin Pang
Yong Zhang
Weize Quan
Yanbo Fan
Xiaodong Cun
Ying Shan
Dong-ming Yan
VGen
85
37
0
16 Jan 2023
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Jinbo Xing
Menghan Xia
Yuechen Zhang
Xiaodong Cun
Jue Wang
T. Wong
114
149
0
06 Jan 2023
StyleTalk: One-shot Talking Head Generation with Controllable Speaking
  Styles
StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles
Yifeng Ma
Suzhe Wang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Zhidong Deng
Xin Yu
125
89
0
03 Jan 2023
MetaPortrait: Identity-Preserving Talking Head Generation with Fast
  Personalized Adaptation
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Bo Zhang
Chenyang Qi
Pan Zhang
Bo Zhang
Hsiang-Tao Wu
Dong Chen
Qifeng Chen
Yong Wang
Fang Wen
117
59
0
15 Dec 2022
VideoMap: Supporting Video Editing Exploration, Brainstorming, and
  Prototyping in the Latent Space
VideoMap: Supporting Video Editing Exploration, Brainstorming, and Prototyping in the Latent Space
David Chuan-En Lin
Fabian Caba Heilbron
Joon-Young Lee
Oliver Wang
Nikolas Martelaro
VGen
83
4
0
22 Nov 2022
Next3D: Generative Neural Texture Rasterization for 3D-Aware Head
  Avatars
Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars
Jingxiang Sun
Xuanxia Wang
Lizhen Wang
Xiaoyu Li
Yong Zhang
Hongwen Zhang
Yebin Liu
3DH
109
119
0
21 Nov 2022
Scaling Neural Face Synthesis to High FPS and Low Latency by Neural
  Caching
Scaling Neural Face Synthesis to High FPS and Low Latency by Neural Caching
Frank Yu
S. Fels
Helge Rhodin
3DH
55
0
0
10 Nov 2022
Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows
Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows
Anyi Rao
Xuekun Jiang
Sichen Wang
Yuwei Guo
Zihao Liu
Bo Dai
Long Pang
Xiaoyu Wu
Dahua Lin
Libiao Jin
96
6
0
17 Oct 2022
AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars
AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars
Yue Wu
Yu Deng
Jiaolong Yang
Fangyun Wei
Qifeng Chen
Xin Tong
3DHCVBM
77
51
0
12 Oct 2022
Match Cutting: Finding Cuts with Smooth Visual Transitions
Match Cutting: Finding Cuts with Smooth Visual Transitions
Boris Chen
Amir Ziai
Rebecca Tucker
Yuchen Xie
VGen
100
14
0
11 Oct 2022
StableFace: Analyzing and Improving Motion Stability for Talking Face
  Generation
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
Jun Ling
Xuejiao Tan
Liyang Chen
Runnan Li
Yuchao Zhang
Sheng Zhao
Liang Song
CVBM
83
14
0
29 Aug 2022
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train
  Humans in Lipreading at Scale
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
Aditya Agarwal
Bipasha Sen
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
110
0
0
21 Aug 2022
Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors
Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors
Sindhu B. Hegde
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
CVBM
59
1
0
17 Aug 2022
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head
  Synthesis
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis
Shuai Shen
Wanhua Li
Zhengbiao Zhu
Yueqi Duan
Jie Zhou
Jiwen Lu
CVBM
87
109
0
24 Jul 2022
Multimodal Dialog Systems with Dual Knowledge-enhanced Generative
  Pretrained Language Model
Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
Xiaolin Chen
Xuemeng Song
Liqiang Jing
Shuo Li
Linmei Hu
Liqiang Nie
VLM
79
23
0
16 Jul 2022
NARRATE: A Normal Assisted Free-View Portrait Stylizer
NARRATE: A Normal Assisted Free-View Portrait Stylizer
Youjia Wang
Teng Xu
Yiwen Wu
Minzhang Li
Wenzheng Chen
Lan Xu
Jingyi Yu
DiffM3DH
71
4
0
03 Jul 2022
123
Next