ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.05709
  4. Cited By
Deformation Flow Based Two-Stream Network for Lip Reading
v1v2 (latest)

Deformation Flow Based Two-Stream Network for Lip Reading

IEEE International Conference on Automatic Face & Gesture Recognition (FG), 2020
12 March 2020
Jingyun Xiao
Shuang Yang
Yuanhang Zhang
Shiguang Shan
Xilin Chen
ArXiv (abs)PDFHTML

Papers citing "Deformation Flow Based Two-Stream Network for Lip Reading"

23 / 23 papers shown
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
456
8
0
07 May 2025
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and LanguageAAAI Conference on Artificial Intelligence (AAAI), 2024
Jeong Hun Yeo
Chae Won Kim
Hyunjun Kim
Hyeongseop Rha
Seunghee Han
Wen-Huang Cheng
Y. Ro
540
6
0
03 Jan 2025
RAL:Redundancy-Aware Lipreading Model Based on Differential Learning
  with Symmetric Views
RAL:Redundancy-Aware Lipreading Model Based on Differential Learning with Symmetric Views
Zejun gu
Junxia jiang
331
0
0
09 Sep 2024
MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading
MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading
Wenhao Zhang
Jun Wang
Yong Luo
Lei Yu
Wei Yu
Zheng He
Jialie Shen
423
7
0
18 Apr 2024
Learning Separable Hidden Unit Contributions for Speaker-Adaptive
  Lip-Reading
Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading
Songtao Luo
Shuang Yang
Shiguang Shan
Xilin Chen
356
3
0
08 Oct 2023
Lip Reading for Low-resource Languages by Learning and Combining General
  Speech Knowledge and Language-specific Knowledge
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific KnowledgeIEEE International Conference on Computer Vision (ICCV), 2023
Minsu Kim
Jeong Hun Yeo
J. Choi
Y. Ro
288
31
0
18 Aug 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and
  Generation
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
226
9
0
17 Aug 2023
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by
  Compressing Audio Knowledge of a Pretrained Model
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained ModelIEEE transactions on multimedia (IEEE TMM), 2023
Jeong Hun Yeo
Minsu Kim
J. Choi
Dae Hoe Kim
Y. Ro
275
27
0
15 Aug 2023
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition
Multi-Temporal Lip-Audio Memory for Visual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jeong Hun Yeo
Minsu Kim
Y. Ro
210
18
0
08 May 2023
RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network
RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing NetworkAdvanced Intelligent Systems (Adv. Intell. Syst.), 2023
Sangmin Yoo
Eric Lee
Ziyu Wang
Xinxin Wang
Wei D. Lu
490
12
0
19 Mar 2023
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech
  Recognition
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Minsu Kim
Hyungil Kim
Y. Ro
VLM
309
33
0
16 Feb 2023
Speaker-adaptive Lip Reading with User-dependent Padding
Speaker-adaptive Lip Reading with User-dependent PaddingEuropean Conference on Computer Vision (ECCV), 2022
Minsu Kim
Hyunjun Kim
Y. Ro
171
31
0
09 Aug 2022
Lip-Listening: Mixing Senses to Understand Lips using Cross Modality
  Knowledge Distillation for Word-Based Models
Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models
Hadeel Mabrouk
Omar Abugabal
Nourhan Sakr
Hesham M. Eraqi
VLM
203
2
0
05 Jun 2022
Deep Learning for Visual Speech Analysis: A Survey
Deep Learning for Visual Speech Analysis: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Yike Guo
Xin Xu
M. Pietikäinen
Tianpeng Liu
VLM
394
56
0
22 May 2022
Lip to Speech Synthesis with Visual Context Attentional GAN
Lip to Speech Synthesis with Visual Context Attentional GANNeural Information Processing Systems (NeurIPS), 2022
Minsu Kim
Joanna Hong
Y. Ro
358
70
0
04 Apr 2022
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip
  Reading
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip ReadingAAAI Conference on Artificial Intelligence (AAAI), 2022
Minsu Kim
Jeong Hun Yeo
Yong Man Ro
280
85
0
04 Apr 2022
Multi-modality Associative Bridging through Memory: Speech Sound
  Recollected from Face Video
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face VideoIEEE International Conference on Computer Vision (ICCV), 2021
Minsu Kim
Joanna Hong
Se Jin Park
Yong Man Ro
CVBM
200
48
0
04 Apr 2022
Advances and Challenges in Deep Lip Reading
Advances and Challenges in Deep Lip Reading
Marzieh Oghbaie
Arian Sabaghi
Kooshan Hashemifard
Mohammad Akbari
VLM
193
17
0
15 Oct 2021
LRWR: Large-Scale Benchmark for Lip Reading in Russian language
LRWR: Large-Scale Benchmark for Lip Reading in Russian language
E. Egorov
Vasily Kostyumov
M. Konyk
Sergey Kolesnikov
221
11
0
14 Sep 2021
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip
  Reading
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip ReadingInternational Joint Conference on Computational Intelligence (IJCCI), 2021
Shahd Elashmawy
Marian M. Ramsis
Hesham M. Eraqi
Farah Eldeshnawy
Hadeel Mabrouk
Omar Abugabal
Nourhan Sakr
248
1
0
07 Aug 2021
Contrastive Learning of Global-Local Video Representations
Contrastive Learning of Global-Local Video Representations
Shuang Ma
Zhaoyang Zeng
Daniel J. McDuff
Yale Song
SSL
261
9
0
07 Apr 2021
Learn an Effective Lip Reading Model without Pains
Learn an Effective Lip Reading Model without Pains
Dalu Feng
Shuang Yang
Shiguang Shan
Xilin Chen
298
70
0
15 Nov 2020
Synchronous Bidirectional Learning for Multilingual Lip Reading
Synchronous Bidirectional Learning for Multilingual Lip Reading
Mingshuang Luo
Shuang Yang
Xilin Chen
Zitao Liu
Shiguang Shan
214
19
0
08 May 2020
1
Page 1 of 1