ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.10839
  4. Cited By
Deep Learning for Visual Speech Analysis: A Survey

Deep Learning for Visual Speech Analysis: A Survey

22 May 2022
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Y. Guo
Xin Xu
M. Pietikäinen
Li Liu
    VLM
ArXivPDFHTML

Papers citing "Deep Learning for Visual Speech Analysis: A Survey"

14 / 14 papers shown
Title
OXSeg: Multidimensional attention UNet-based lip segmentation using semi-supervised lip contours
OXSeg: Multidimensional attention UNet-based lip segmentation using semi-supervised lip contours
Hanie Moghaddasi
Christina Chambers
Sarah N. Mattson
Jeffrey R. Wozniak
Claire D. Coles
Raja Mukherjee
Michael Suttie
19
0
0
08 May 2025
CBW: Towards Dataset Ownership Verification for Speaker Verification via Clustering-based Backdoor Watermarking
CBW: Towards Dataset Ownership Verification for Speaker Verification via Clustering-based Backdoor Watermarking
Yiming Li
Kaiying Yan
Shuo Shao
Tongqing Zhai
Shu-Tao Xia
Z. Qin
D. Tao
AAML
89
0
0
02 Mar 2025
Transforming Game Play: A Comparative Study of DCQN and DTQN
  Architectures in Reinforcement Learning
Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning
William A. Stigall
45
0
0
14 Oct 2024
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Yixiang Zhuang
Baoping Cheng
Yao Cheng
Yuntao Jin
Renshuai Liu
Chengyang Li
Xuan Cheng
Jing Liao
Juncong Lin
CVBM
3DH
29
6
0
19 Apr 2024
MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading
MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading
Wenhao Zhang
Jun Wang
Yong Luo
Lei Yu
Wei Yu
Zheng He
Jialie Shen
35
0
0
18 Apr 2024
One-shot Talking Face Generation from Single-speaker Audio-Visual
  Correlation Learning
One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning
Suzhe Wang
Lincheng Li
Yueqing Ding
Xin Yu
CVBM
59
117
0
06 Dec 2021
End-to-end Audio-visual Speech Recognition with Conformers
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
224
0
12 Feb 2021
Lipreading using Temporal Convolutional Networks
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
M. Pantic
168
238
0
23 Jan 2020
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
214
2,224
0
14 Jun 2018
Transfer Learning from Speaker Verification to Multispeaker
  Text-To-Speech Synthesis
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
...
Z. Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
Yonghui Wu
204
819
0
12 Jun 2018
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
  Applications
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
948
20,471
0
17 Apr 2017
Image-to-Image Translation with Conditional Adversarial Networks
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
SSeg
212
19,387
0
21 Nov 2016
Lip Reading Sentences in the Wild
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
162
783
0
16 Nov 2016
Convolutional LSTM Network: A Machine Learning Approach for
  Precipitation Nowcasting
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
201
7,884
0
13 Jun 2015
1