ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.04988
  4. Cited By
LCANet: End-to-End Lipreading with Cascaded Attention-CTC

LCANet: End-to-End Lipreading with Cascaded Attention-CTC

13 March 2018
Kai Xu
Dawei Li
N. Cassimatis
Xiaolong Wang
ArXivPDFHTML

Papers citing "LCANet: End-to-End Lipreading with Cascaded Attention-CTC"

11 / 11 papers shown
Title
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
49
0
0
07 May 2025
Robust Dual-Modal Speech Keyword Spotting for XR Headsets
Robust Dual-Modal Speech Keyword Spotting for XR Headsets
Zhuojiang Cai
Yuhan Ma
Feng Lu
22
0
0
26 Jan 2024
Show Me Your Face, And I'll Tell You How You Speak
Show Me Your Face, And I'll Tell You How You Speak
Christen Millerdurai
L. A. Khaliq
Timon Ulrich
CVBM
60
0
0
28 Jun 2022
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction
  and Lip Reading
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading
Leyuan Qu
C. Weber
S. Wermter
28
23
0
09 Dec 2021
"Notic My Speech" -- Blending Speech Patterns With Multimedia
"Notic My Speech" -- Blending Speech Patterns With Multimedia
Dhruva Sahrawat
Yaman Kumar Singla
Shashwat Aggarwal
Yifang Yin
R. Shah
Roger Zimmermann
17
3
0
12 Jun 2020
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep
  Visual Speech Recognition
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition
Yuanhang Zhang
Shuang Yang
Jingyun Xiao
Shiguang Shan
Xilin Chen
10
64
0
06 Mar 2020
Audio-Visual Decision Fusion for WFST-based and seq2seq Models
Audio-Visual Decision Fusion for WFST-based and seq2seq Models
R. Aralikatti
Sharad Roy
Abhinav Thanda
D. Margam
Pujitha Appan Kandala
Tanay Sharma
S. Venkatesan
17
1
0
29 Jan 2020
Spatial Group-wise Enhance: Improving Semantic Feature Learning in
  Convolutional Networks
Spatial Group-wise Enhance: Improving Semantic Feature Learning in Convolutional Networks
Xiang Li
Xiaolin Hu
Jian Yang
19
193
0
23 May 2019
Zero-shot keyword spotting for visual speech recognition in-the-wild
Zero-shot keyword spotting for visual speech recognition in-the-wild
Themos Stafylakis
Georgios Tzimiropoulos
25
38
0
23 Jul 2018
Large-Scale Visual Speech Recognition
Large-Scale Visual Speech Recognition
Brendan Shillingford
Yannis Assael
Matthew W. Hoffman
T. Paine
Cían Hughes
...
Marie Mulville
Ben Coppin
Ben Laurie
A. Senior
Nando de Freitas
24
152
0
13 Jul 2018
Lip Reading Sentences in the Wild
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
162
784
0
16 Nov 2016
1