ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.02966
  4. Cited By
You said that?

You said that?

8 May 2017
Joon Son Chung
A. Jamaludin
Andrew Zisserman
    CVBM
ArXivPDFHTML

Papers citing "You said that?"

50 / 58 papers shown
Title
EMOdiffhead: Continuously Emotional Control in Talking Head Generation
  via Diffusion
EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion
Jian Zhang
Weijian Mai
Zhijun Zhang
VGen
32
0
0
11 Sep 2024
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Yixiang Zhuang
Baoping Cheng
Yao Cheng
Yuntao Jin
Renshuai Liu
Chengyang Li
Xuan Cheng
Jing Liao
Juncong Lin
CVBM
3DH
34
6
0
19 Apr 2024
Audio-Driven 3D Facial Animation from In-the-Wild Videos
Audio-Driven 3D Facial Animation from In-the-Wild Videos
Liying Lu
Tianke Zhang
Yunfei Liu
Xuangeng Chu
Yu Li
VGen
44
3
0
20 Jun 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic
  Supervision
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Xubo Liu
Egor Lakomkin
Konstantinos Vougioukas
Pingchuan Ma
Honglie Chen
...
Niko Moritz
J. Kolár
Stavros Petridis
M. Pantic
Christian Fuegen
49
19
0
30 Mar 2023
Imitator: Personalized Speech-driven 3D Facial Animation
Imitator: Personalized Speech-driven 3D Facial Animation
Balamurugan Thambiraja
I. Habibie
S. Aliakbarian
Darren Cosker
Christian Theobalt
Justus Thies
CVBM
44
49
0
30 Dec 2022
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Yating Xu
Conghui Hu
G. Lee
VGen
40
0
0
09 Dec 2022
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video
  Editing In the Wild
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
K. Cheng
Xiaodong Cun
Yong Zhang
Menghan Xia
Fei Yin
Mingrui Zhu
Xuanxia Wang
Jue Wang
Nan Wang
CVBM
25
92
0
27 Nov 2022
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial
  Decomposition
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Jiaxiang Tang
Kaisiyuan Wang
Hang Zhou
Xiaokang Chen
Dongliang He
Tianshu Hu
Jingtuo Liu
Gang Zeng
Jingdong Wang
3DH
34
76
0
22 Nov 2022
RARR: Researching and Revising What Language Models Say, Using Language
  Models
RARR: Researching and Revising What Language Models Say, Using Language Models
Luyu Gao
Zhuyun Dai
Panupong Pasupat
Anthony Chen
Arun Tejasvi Chaganty
...
Vincent Zhao
Ni Lao
Hongrae Lee
Da-Cheng Juan
Kelvin Guu
HILM
KELM
41
256
0
17 Oct 2022
Compressing Video Calls using Synthetic Talking Heads
Compressing Video Calls using Synthetic Talking Heads
Madhav Agarwal
Anchit Gupta
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
17
10
0
07 Oct 2022
Audio-Visual Face Reenactment
Audio-Visual Face Reenactment
Madhav Agarwal
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
VGen
24
22
0
06 Oct 2022
Implicit Warping for Animation with Image Sets
Implicit Warping for Animation with Image Sets
Arun Mallya
Ting-Chun Wang
Xuan Li
VGen
116
41
0
04 Oct 2022
StableFace: Analyzing and Improving Motion Stability for Talking Face
  Generation
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
Jun Ling
Xuejiao Tan
Liyang Chen
Runnan Li
Yuchao Zhang
Sheng Zhao
Liang Song
CVBM
47
13
0
29 Aug 2022
FlexLip: A Controllable Text-to-Lip System
FlexLip: A Controllable Text-to-Lip System
Dan Oneaţă
Beáta Lőrincz
Adriana Stan
H. Cucu
23
3
0
07 Jun 2022
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware
  Motion Model
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model
Xinya Ji
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Wayne Wu
Feng Xu
Xun Cao
CVBM
54
157
0
30 May 2022
Emotion-Controllable Generalized Talking Face Generation
Emotion-Controllable Generalized Talking Face Generation
Sanjana Sinha
S. Biswas
Ravindra Yadav
Brojeshwar Bhowmick
CVBM
15
49
0
02 May 2022
Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation
  in the Wild
Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild
Gang Wang
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
CVBM
19
14
0
08 Mar 2022
Audio-Driven Talking Face Video Generation with Dynamic Convolution
  Kernels
Audio-Driven Talking Face Video Generation with Dynamic Convolution Kernels
Zipeng Ye
Mengfei Xia
Ran Yi
Juyong Zhang
Yu-Kun Lai
Xuanteng Huang
Guoxin Zhang
Yong-jin Liu
CVBM
22
39
0
16 Jan 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
29
48
0
27 Dec 2021
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Mohan Zhou
Yalong Bai
Wei Zhang
Ting Yao
T. Zhao
Tao Mei
EGVM
24
44
0
27 Dec 2021
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Yingruo Fan
Zhaojiang Lin
Jun Saito
Wenping Wang
Taku Komura
CVBM
43
195
0
10 Dec 2021
Talking Head Generation with Audio and Speech Related Facial Action
  Units
Talking Head Generation with Audio and Speech Related Facial Action Units
Sen Chen
Zhilei Liu
Jiaxing Liu
Zhengxiang Yan
Longbiao Wang
CVBM
21
14
0
19 Oct 2021
Neural Dubber: Dubbing for Videos According to Scripts
Neural Dubber: Dubbing for Videos According to Scripts
Chenxu Hu
Qiao Tian
Tingle Li
Yuping Wang
Yuxuan Wang
Hang Zhao
DiffM
VGen
36
39
0
15 Oct 2021
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Yuanxun Lu
Jinxiang Chai
Xun Cao
29
82
0
22 Sep 2021
Sparse to Dense Motion Transfer for Face Image Animation
Sparse to Dense Motion Transfer for Face Image Animation
Ruiqi Zhao
Tianyi Wu
Guodong Guo
3DH
CVBM
27
27
0
01 Sep 2021
Multi-modality Deep Restoration of Extremely Compressed Face Videos
Multi-modality Deep Restoration of Extremely Compressed Face Videos
Xi Zhang
Xiaolin Wu
CVBM
19
13
0
05 Jul 2021
NWT: Towards natural audio-to-video generation with representation
  learning
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
23
12
0
08 Jun 2021
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from
  Video using Pose and Lighting Normalization
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting Normalization
A. Lahiri
Vivek Kwatra
C. Frueh
J. P. Lewis
C. Bregler
3DH
29
99
0
08 Jun 2021
Pose-Controllable Talking Face Generation by Implicitly Modularized
  Audio-Visual Representation
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
Hang Zhou
Yasheng Sun
Wayne Wu
Chen Change Loy
Xiaogang Wang
Ziwei Liu
CVBM
28
360
0
22 Apr 2021
Can audio-visual integration strengthen robustness under multimodal
  attacks?
Can audio-visual integration strengthen robustness under multimodal attacks?
Yapeng Tian
Chenliang Xu
AAML
28
37
0
05 Apr 2021
Robust One Shot Audio to Video Generation
Robust One Shot Audio to Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
H. Mujtaba
VGen
27
13
0
14 Dec 2020
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
Ting-Chun Wang
Arun Mallya
Xuan Li
3DH
37
469
0
30 Nov 2020
Facial Keypoint Sequence Generation from Audio
Facial Keypoint Sequence Generation from Audio
Prateek Manocha
Prithwijit Guha
3DH
VGen
23
0
0
02 Nov 2020
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The
  Wild
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
EGVM
20
757
0
23 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Audio- and Gaze-driven Facial Animation of Codec Avatars
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
21
81
0
11 Aug 2020
Generative Adversarial Networks for Image and Video Synthesis:
  Algorithms and Applications
Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications
Xuan Li
Xun Huang
Jiahui Yu
Ting-Chun Wang
Arun Mallya
GAN
28
153
0
06 Aug 2020
Active Speakers in Context
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Guohao Li
19
61
0
20 May 2020
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep
  Visual Speech Recognition
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition
Yuanhang Zhang
Shuang Yang
Jingyun Xiao
Shiguang Shan
Xilin Chen
10
64
0
06 Mar 2020
Towards Automatic Face-to-Face Translation
Towards Automatic Face-to-Face Translation
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
31
172
0
01 Mar 2020
Audio-driven Talking Face Video Generation with Learning-based
  Personalized Head Pose
Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong-jin Liu
CVBM
27
122
0
24 Feb 2020
A Neural Lip-Sync Framework for Synthesizing Photorealistic Virtual News
  Anchors
A Neural Lip-Sync Framework for Synthesizing Photorealistic Virtual News Anchors
Ruobing Zheng
Zhou Zhu
Bo Song
Changjiang Ji
3DH
19
2
0
20 Feb 2020
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
31
156
0
14 Jan 2020
Vision-Infused Deep Audio Inpainting
Vision-Infused Deep Audio Inpainting
Hang Zhou
Ziwei Liu
Lingfeng Guo
Ping Luo
Dahua Lin
27
88
0
24 Oct 2019
Neural Style-Preserving Visual Dubbing
Neural Style-Preserving Visual Dubbing
Hyeongwoo Kim
Mohamed A. Elgharib
Michael Zollhöfer
Hans-Peter Seidel
Thabo Beeler
Christian Richardt
Christian Theobalt
VGen
22
93
0
05 Sep 2019
Multi-task Learning For Detecting and Segmenting Manipulated Facial
  Images and Videos
Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos
H. Nguyen
Fuming Fang
Junichi Yamagishi
Isao Echizen
AAML
CVBM
26
424
0
17 Jun 2019
Realistic Speech-Driven Facial Animation with GANs
Realistic Speech-Driven Facial Animation with GANs
Konstantinos Vougioukas
Stavros Petridis
M. Pantic
39
289
0
14 Jun 2019
Learning Individual Styles of Conversational Gesture
Learning Individual Styles of Conversational Gesture
Shiry Ginosar
Amir Bar
Gefen Kohavi
Caroline Chan
Andrew Owens
Jitendra Malik
SLR
18
326
0
10 Jun 2019
Voice Mimicry Attacks Assisted by Automatic Speaker Verification
Voice Mimicry Attacks Assisted by Automatic Speaker Verification
Ville Vestman
Tomi Kinnunen
Rosa González Hautamäki
Md. Sahidullah
26
37
0
03 Jun 2019
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent
  Generative Adversarial Networks
ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks
Xiaoqi Jia
Jianwei Tai
Hang Zhou
Yakai Li
Weijuan Zhang
Haichao Du
Qingjia Huang
GAN
17
6
0
27 May 2019
Capsule-Forensics: Using Capsule Networks to Detect Forged Images and
  Videos
Capsule-Forensics: Using Capsule Networks to Detect Forged Images and Videos
H. Nguyen
Junichi Yamagishi
Isao Echizen
21
575
0
26 Oct 2018
12
Next