ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.13616
  4. Cited By
Modality Dropout for Improved Performance-driven Talking Faces

Modality Dropout for Improved Performance-driven Talking Faces

International Conference on Multimodal Interaction (ICMI), 2020
27 May 2020
Ahmed Hussen Abdelaziz
B. Theobald
Paul Dixon
Reinhard Knothe
N. Apostoloff
Sachin Kajareker
ArXiv (abs)PDFHTML

Papers citing "Modality Dropout for Improved Performance-driven Talking Faces"

27 / 27 papers shown
Multimodal Negative Learning
Multimodal Negative Learning
Baoquan Gong
X. Gao
Q. Hu
Qinghua Hu
Bing Cao
152
1
0
23 Oct 2025
Learning Contrastive Multimodal Fusion with Improved Modality Dropout for Disease Detection and Prediction
Learning Contrastive Multimodal Fusion with Improved Modality Dropout for Disease Detection and PredictionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Yi Gu
Kuniaki Saito
Jiaxin Ma
196
2
0
22 Sep 2025
FuseCodec: Semantic-Contextual Fusion and Supervision for Neural Codecs
FuseCodec: Semantic-Contextual Fusion and Supervision for Neural Codecs
Md Mubtasim Ahasan
Rafat Hasan Khan
Tasnim Mohiuddin
Vasu Sharma
Tariq Iqbal
M. A. Amin
Amin Ahsan Ali
M. Islam
A. K. M. Mahbubur Rahman
334
1
0
14 Sep 2025
AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars
AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars
T. Zhang
Jian Zhao
Yuer Li
Zheng Zhu
Ping Hu
Zhaoxin Fan
Wenjun Wu
Xuelong Li
283
0
0
21 May 2025
On-the-fly Modulation for Balanced Multimodal Learning
On-the-fly Modulation for Balanced Multimodal LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yake Wei
D. Hu
Henghui Du
Ji-Rong Wen
310
37
0
15 Oct 2024
MMP: Towards Robust Multi-Modal Learning with Masked Modality Projection
MMP: Towards Robust Multi-Modal Learning with Masked Modality Projection
Niki Nezakati
Md Kaykobad Reza
Mashhour Solh
Mashhour Solh
M. Salman Asif
460
6
0
03 Oct 2024
Dyadic Interaction Modeling for Social Behavior Generation
Dyadic Interaction Modeling for Social Behavior GenerationEuropean Conference on Computer Vision (ECCV), 2024
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
437
28
0
14 Mar 2024
DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven
  Holistic 3D Expression and Gesture Generation
DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Junming Chen
Yunfei Liu
Jianan Wang
Ailing Zeng
Yu Li
Qifeng Chen
VGen
319
69
0
09 Jan 2024
LaughTalk: Expressive 3D Talking Head Generation with Laughter
LaughTalk: Expressive 3D Talking Head Generation with LaughterIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Kim Sung-Bin
Lee Hyun
Da Hye Hong
Suekyeong Nam
Janghoon Ju
Tae-Hyun Oh
315
33
0
02 Nov 2023
Modality Dropout for Multimodal Device Directed Speech Detection using
  Verbal and Non-Verbal Features
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal FeaturesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
G. Krishna
Sameer Dharur
Oggi Rudovic
Pranay Dighe
Saurabh N. Adya
Ahmed Hussen Abdelaziz
Ahmed H. Tewfik
248
6
0
23 Oct 2023
What Makes for Robust Multi-Modal Models in the Face of Missing
  Modalities?
What Makes for Robust Multi-Modal Models in the Face of Missing Modalities?
Siting Li
Chenzhuang Du
Yue Zhao
Yu Huang
Hang Zhao
252
6
0
10 Oct 2023
Robust Multimodal Learning with Missing Modalities via
  Parameter-Efficient Adaptation
Robust Multimodal Learning with Missing Modalities via Parameter-Efficient AdaptationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Md Kaykobad Reza
Ashley Prater-Bennette
M. Salman Asif
353
43
0
06 Oct 2023
Audio-Driven 3D Facial Animation from In-the-Wild Videos
Audio-Driven 3D Facial Animation from In-the-Wild Videos
Liying Lu
Tianke Zhang
Yunfei Liu
Xuangeng Chu
Yu Li
VGen
186
8
0
20 Jun 2023
Language-Guided Music Recommendation for Video via Prompt Analogies
Language-Guided Music Recommendation for Video via Prompt AnalogiesComputer Vision and Pattern Recognition (CVPR), 2023
Daniel McKee
Justin Salamon
Josef Sivic
Bryan C. Russell
VGen
314
33
0
15 Jun 2023
AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction
AVFace: Towards Detailed Audio-Visual 4D Face ReconstructionComputer Vision and Pattern Recognition (CVPR), 2023
Aggelina Chatziagapi
Dimitris Samaras
3DHCVBM
270
5
0
25 Apr 2023
Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic
  Segmentation
Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Harsh Maheshwari
Yen-Cheng Liu
Z. Kira
209
34
0
21 Apr 2023
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face AnimationIEEE International Conference on Computer Vision (ICCV), 2023
Ziqiao Peng
Hao Wu
Zhenbo Song
Hao-Xuan Xu
Xiangyu Zhu
Jun He
Hongyan Liu
Zhaoxin Fan
CVBM
498
188
0
20 Mar 2023
Pose-Controllable 3D Facial Animation Synthesis using Hierarchical
  Audio-Vertex Attention
Pose-Controllable 3D Facial Animation Synthesis using Hierarchical Audio-Vertex Attention
Yinan Han
Xiaolin K. Wei
Bo Li
Junjie Cao
Yunyu Lai
CVBM
222
3
0
24 Feb 2023
Beyond Triplet: Leveraging the Most Data for Multimodal Machine
  Translation
Beyond Triplet: Leveraging the Most Data for Multimodal Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yaoming Zhu
Zewei Sun
Shanbo Cheng
Yuyang Huang
Liwei Wu
Mingxuan Wang
318
22
0
20 Dec 2022
Naturalistic Head Motion Generation from Speech
Naturalistic Head Motion Generation from SpeechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Trisha Mittal
Zakaria Aldeneh
Masha Fedzechkina
Anurag Ranjan
B. Theobald
219
1
0
26 Oct 2022
On the role of Lip Articulation in Visual Speech Perception
On the role of Lip Articulation in Visual Speech PerceptionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zakaria Aldeneh
Masha Fedzechkina
Skyler Seto
Katherine Metcalf
Miguel Sarabia
N. Apostoloff
B. Theobald
258
2
0
18 Mar 2022
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition
  on Modality-Specific Annotated Videos
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated VideosComputer Vision and Pattern Recognition (CVPR), 2022
Saghir Alfasly
Jian Lu
C. Xu
Yuru Zou
413
30
0
06 Mar 2022
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster
  Prediction
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster PredictionInternational Conference on Learning Representations (ICLR), 2022
Bowen Shi
Wei-Ning Hsu
Kushal Lakhotia
Abdel-rahman Mohamed
SSL
465
447
0
05 Jan 2022
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Yingruo Fan
Mohammad Kachuee
Jun Saito
Wenping Wang
Taku Komura
CVBM
893
287
0
10 Dec 2021
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from
  Video using Pose and Lighting Normalization
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting NormalizationComputer Vision and Pattern Recognition (CVPR), 2021
A. Lahiri
Vivek Kwatra
C. Frueh
J. P. Lewis
C. Bregler
3DH
300
116
0
08 Jun 2021
Improved Lite Audio-Visual Speech Enhancement
Improved Lite Audio-Visual Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Shang-Yi Chuang
Hsin-Min Wang
Yu Tsao
401
44
0
30 Aug 2020
Audiovisual Speech Synthesis using Tacotron2
Audiovisual Speech Synthesis using Tacotron2International Conference on Multimodal Interaction (ICMI), 2020
Ahmed Hussen Abdelaziz
Anushree Prasanna Kumar
Chloe Seivwright
Gabriele Fanelli
Justin Binder
Y. Stylianou
S. Kajarekar
263
18
0
03 Aug 2020
1
Page 1 of 1