ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.02966
  4. Cited By
You said that?

You said that?

8 May 2017
Joon Son Chung
A. Jamaludin
Andrew Zisserman
    CVBM
ArXivPDFHTML

Papers citing "You said that?"

50 / 54 papers shown
Title
EMOdiffhead: Continuously Emotional Control in Talking Head Generation
  via Diffusion
EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion
Jian Zhang
Weijian Mai
Zhijun Zhang
VGen
32
0
0
11 Sep 2024
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Yixiang Zhuang
Baoping Cheng
Yao Cheng
Yuntao Jin
Renshuai Liu
Chengyang Li
Xuan Cheng
Jing Liao
Juncong Lin
CVBM
3DH
34
6
0
19 Apr 2024
Audio-Driven 3D Facial Animation from In-the-Wild Videos
Audio-Driven 3D Facial Animation from In-the-Wild Videos
Liying Lu
Tianke Zhang
Yunfei Liu
Xuangeng Chu
Yu Li
VGen
44
3
0
20 Jun 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic
  Supervision
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Xubo Liu
Egor Lakomkin
Konstantinos Vougioukas
Pingchuan Ma
Honglie Chen
...
Niko Moritz
J. Kolár
Stavros Petridis
M. Pantic
Christian Fuegen
46
19
0
30 Mar 2023
Imitator: Personalized Speech-driven 3D Facial Animation
Imitator: Personalized Speech-driven 3D Facial Animation
Balamurugan Thambiraja
I. Habibie
S. Aliakbarian
Darren Cosker
Christian Theobalt
Justus Thies
CVBM
39
49
0
30 Dec 2022
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Yating Xu
Conghui Hu
G. Lee
VGen
40
0
0
09 Dec 2022
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video
  Editing In the Wild
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
K. Cheng
Xiaodong Cun
Yong Zhang
Menghan Xia
Fei Yin
Mingrui Zhu
Xuanxia Wang
Jue Wang
Nan Wang
CVBM
25
92
0
27 Nov 2022
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial
  Decomposition
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Jiaxiang Tang
Kaisiyuan Wang
Hang Zhou
Xiaokang Chen
Dongliang He
Tianshu Hu
Jingtuo Liu
Gang Zeng
Jingdong Wang
3DH
34
76
0
22 Nov 2022
RARR: Researching and Revising What Language Models Say, Using Language
  Models
RARR: Researching and Revising What Language Models Say, Using Language Models
Luyu Gao
Zhuyun Dai
Panupong Pasupat
Anthony Chen
Arun Tejasvi Chaganty
...
Vincent Zhao
Ni Lao
Hongrae Lee
Da-Cheng Juan
Kelvin Guu
HILM
KELM
41
256
0
17 Oct 2022
Compressing Video Calls using Synthetic Talking Heads
Compressing Video Calls using Synthetic Talking Heads
Madhav Agarwal
Anchit Gupta
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
17
10
0
07 Oct 2022
Audio-Visual Face Reenactment
Audio-Visual Face Reenactment
Madhav Agarwal
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
VGen
24
22
0
06 Oct 2022
Implicit Warping for Animation with Image Sets
Implicit Warping for Animation with Image Sets
Arun Mallya
Ting-Chun Wang
Xuan Li
VGen
116
41
0
04 Oct 2022
StableFace: Analyzing and Improving Motion Stability for Talking Face
  Generation
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
Jun Ling
Xuejiao Tan
Liyang Chen
Runnan Li
Yuchao Zhang
Sheng Zhao
Liang Song
CVBM
44
13
0
29 Aug 2022
FlexLip: A Controllable Text-to-Lip System
FlexLip: A Controllable Text-to-Lip System
Dan Oneaţă
Beáta Lőrincz
Adriana Stan
H. Cucu
23
3
0
07 Jun 2022
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware
  Motion Model
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model
Xinya Ji
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Wayne Wu
Feng Xu
Xun Cao
CVBM
54
157
0
30 May 2022
Emotion-Controllable Generalized Talking Face Generation
Emotion-Controllable Generalized Talking Face Generation
Sanjana Sinha
S. Biswas
Ravindra Yadav
Brojeshwar Bhowmick
CVBM
13
49
0
02 May 2022
Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation
  in the Wild
Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild
Gang Wang
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
CVBM
19
14
0
08 Mar 2022
Audio-Driven Talking Face Video Generation with Dynamic Convolution
  Kernels
Audio-Driven Talking Face Video Generation with Dynamic Convolution Kernels
Zipeng Ye
Mengfei Xia
Ran Yi
Juyong Zhang
Yu-Kun Lai
Xuanteng Huang
Guoxin Zhang
Yong-jin Liu
CVBM
22
39
0
16 Jan 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
29
48
0
27 Dec 2021
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Mohan Zhou
Yalong Bai
Wei Zhang
Ting Yao
T. Zhao
Tao Mei
EGVM
22
44
0
27 Dec 2021
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Yingruo Fan
Zhaojiang Lin
Jun Saito
Wenping Wang
Taku Komura
CVBM
43
195
0
10 Dec 2021
Talking Head Generation with Audio and Speech Related Facial Action
  Units
Talking Head Generation with Audio and Speech Related Facial Action Units
Sen Chen
Zhilei Liu
Jiaxing Liu
Zhengxiang Yan
Longbiao Wang
CVBM
21
14
0
19 Oct 2021
Neural Dubber: Dubbing for Videos According to Scripts
Neural Dubber: Dubbing for Videos According to Scripts
Chenxu Hu
Qiao Tian
Tingle Li
Yuping Wang
Yuxuan Wang
Hang Zhao
DiffM
VGen
36
39
0
15 Oct 2021
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Yuanxun Lu
Jinxiang Chai
Xun Cao
29
82
0
22 Sep 2021
Sparse to Dense Motion Transfer for Face Image Animation
Sparse to Dense Motion Transfer for Face Image Animation
Ruiqi Zhao
Tianyi Wu
Guodong Guo
3DH
CVBM
27
27
0
01 Sep 2021
NWT: Towards natural audio-to-video generation with representation
  learning
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
21
12
0
08 Jun 2021
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from
  Video using Pose and Lighting Normalization
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting Normalization
A. Lahiri
Vivek Kwatra
C. Frueh
J. P. Lewis
C. Bregler
3DH
27
99
0
08 Jun 2021
Pose-Controllable Talking Face Generation by Implicitly Modularized
  Audio-Visual Representation
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
Hang Zhou
Yasheng Sun
Wayne Wu
Chen Change Loy
Xiaogang Wang
Ziwei Liu
CVBM
28
360
0
22 Apr 2021
Robust One Shot Audio to Video Generation
Robust One Shot Audio to Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
H. Mujtaba
VGen
27
13
0
14 Dec 2020
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
Ting-Chun Wang
Arun Mallya
Xuan Li
3DH
37
469
0
30 Nov 2020
Facial Keypoint Sequence Generation from Audio
Facial Keypoint Sequence Generation from Audio
Prateek Manocha
Prithwijit Guha
3DH
VGen
23
0
0
02 Nov 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Audio- and Gaze-driven Facial Animation of Codec Avatars
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
21
81
0
11 Aug 2020
Generative Adversarial Networks for Image and Video Synthesis:
  Algorithms and Applications
Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications
Xuan Li
Xun Huang
Jiahui Yu
Ting-Chun Wang
Arun Mallya
GAN
28
153
0
06 Aug 2020
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep
  Visual Speech Recognition
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition
Yuanhang Zhang
Shuang Yang
Jingyun Xiao
Shiguang Shan
Xilin Chen
10
64
0
06 Mar 2020
Towards Automatic Face-to-Face Translation
Towards Automatic Face-to-Face Translation
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
31
172
0
01 Mar 2020
Audio-driven Talking Face Video Generation with Learning-based
  Personalized Head Pose
Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong-jin Liu
CVBM
27
122
0
24 Feb 2020
A Neural Lip-Sync Framework for Synthesizing Photorealistic Virtual News
  Anchors
A Neural Lip-Sync Framework for Synthesizing Photorealistic Virtual News Anchors
Ruobing Zheng
Zhou Zhu
Bo Song
Changjiang Ji
3DH
19
2
0
20 Feb 2020
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
31
156
0
14 Jan 2020
Vision-Infused Deep Audio Inpainting
Vision-Infused Deep Audio Inpainting
Hang Zhou
Ziwei Liu
Lingfeng Guo
Ping Luo
Dahua Lin
27
88
0
24 Oct 2019
Neural Style-Preserving Visual Dubbing
Neural Style-Preserving Visual Dubbing
Hyeongwoo Kim
Mohamed A. Elgharib
Michael Zollhöfer
Hans-Peter Seidel
Thabo Beeler
Christian Richardt
Christian Theobalt
VGen
22
93
0
05 Sep 2019
Multi-task Learning For Detecting and Segmenting Manipulated Facial
  Images and Videos
Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos
H. Nguyen
Fuming Fang
Junichi Yamagishi
Isao Echizen
AAML
CVBM
26
424
0
17 Jun 2019
Realistic Speech-Driven Facial Animation with GANs
Realistic Speech-Driven Facial Animation with GANs
Konstantinos Vougioukas
Stavros Petridis
M. Pantic
39
289
0
14 Jun 2019
Learning Individual Styles of Conversational Gesture
Learning Individual Styles of Conversational Gesture
Shiry Ginosar
Amir Bar
Gefen Kohavi
Caroline Chan
Andrew Owens
Jitendra Malik
SLR
18
326
0
10 Jun 2019
Voice Mimicry Attacks Assisted by Automatic Speaker Verification
Voice Mimicry Attacks Assisted by Automatic Speaker Verification
Ville Vestman
Tomi Kinnunen
Rosa González Hautamäki
Md. Sahidullah
26
37
0
03 Jun 2019
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent
  Generative Adversarial Networks
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks
Xiaoqi Jia
Jianwei Tai
Hang Zhou
Yakai Li
Weijuan Zhang
Haichao Du
Qingjia Huang
GAN
17
6
0
27 May 2019
Capsule-Forensics: Using Capsule Networks to Detect Forged Images and
  Videos
Capsule-Forensics: Using Capsule Networks to Detect Forged Images and Videos
H. Nguyen
Junichi Yamagishi
Isao Echizen
21
575
0
26 Oct 2018
Self-supervised learning of a facial attribute embedding from video
Self-supervised learning of a facial attribute embedding from video
Olivia Wiles
A. Sophia Koepke
Andrew Zisserman
CVBM
SSL
24
132
0
21 Aug 2018
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
224
2,234
0
14 Jun 2018
End-to-End Speech-Driven Facial Animation with Temporal GANs
End-to-End Speech-Driven Facial Animation with Temporal GANs
Konstantinos Vougioukas
Stavros Petridis
M. Pantic
CVBM
35
103
0
23 May 2018
Fighting Fake News: Image Splice Detection via Learned Self-Consistency
Fighting Fake News: Image Splice Detection via Learned Self-Consistency
Minyoung Huh
Andrew Liu
Andrew Owens
Alexei A. Efros
SSL
28
381
0
10 May 2018
12
Next