Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.06990
Cited By
LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild
16 October 2018
Shuang Yang
Yuanhang Zhang
Dalu Feng
Mingmin Yang
Chenhao Wang
Jingyun Xiao
Keyu Long
Shiguang Shan
Xilin Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild"
27 / 27 papers shown
Title
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
54
0
0
07 May 2025
Development and evaluation of a deep learning algorithm for German word recognition from lip movements
Dinh Nam Pham
Torsten Rahne
55
3
0
22 Apr 2025
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Jeong Hun Yeo
Chae Won Kim
Hyunjun Kim
Hyeongseop Rha
Seunghee Han
Wen-Huang Cheng
Y. Ro
59
3
0
03 Jan 2025
From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications
Shreyank N. Gowda
Dheeraj Pandey
Shashank Narayana Gowda
54
3
0
30 Aug 2023
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge
Minsu Kim
Jeong Hun Yeo
J. Choi
Y. Ro
34
16
0
18 Aug 2023
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition
Jeong Hun Yeo
Minsu Kim
Y. Ro
27
11
0
08 May 2023
Word-level Persian Lipreading Dataset
J. Peymanfard
Ali Lashini
Samin Heydarian
Hossein Zeinali
N. Mozayani
33
5
0
08 Apr 2023
The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis
Haoxu Wang
Ming Cheng
Qiang Fu
Ming Li
39
8
0
04 Mar 2023
VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Aoting Zhang
He Wang
Pengcheng Guo
Yihui Fu
Linfu Xie
Yingying Gao
Shilei Zhang
Junlan Feng
18
4
0
27 Feb 2023
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
J. Peymanfard
Samin Heydarian
Ali Lashini
Hossein Zeinali
Mohammad Reza Mohammadi
N. Mozayani
32
10
0
21 Jan 2023
OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
J. Park
Jung-Wook Hwang
Kwanghee Choi
Seung-Hyun Lee
Jun-Hwan Ahn
R.-H. Park
Hyung-Min Park
29
3
0
16 Jan 2023
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading
Minsu Kim
Jeong Hun Yeo
Yong Man Ro
13
61
0
04 Apr 2022
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video
Minsu Kim
Joanna Hong
Se Jin Park
Yong Man Ro
CVBM
25
40
0
04 Apr 2022
A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning
Gerald Schwiebert
C. Weber
Leyuan Qu
Henrique Siqueira
S. Wermter
32
12
0
27 Feb 2022
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading
Leyuan Qu
C. Weber
S. Wermter
38
23
0
09 Dec 2021
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading
Shahd Elashmawy
Marian M. Ramsis
Hesham M. Eraqi
Farah Eldeshnawy
Hadeel Mabrouk
Omar Abugabal
Nourhan Sakr
35
1
0
07 Aug 2021
Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Meng Liu
Longbiao Wang
Kong Aik Lee
Hanyi Zhang
Chang Zeng
J. Dang
HAI
30
12
0
17 Apr 2021
Learn an Effective Lip Reading Model without Pains
Dalu Feng
Shuang Yang
Shiguang Shan
Xilin Chen
30
61
0
15 Nov 2020
Lip-reading with Densely Connected Temporal Convolutional Networks
Pingchuan Ma
Yujiang Wang
Jie Shen
Stavros Petridis
M. Pantic
22
56
0
29 Sep 2020
Towards Practical Lipreading with Distilled and Efficient Models
Pingchuan Ma
Brais Martínez
Stavros Petridis
M. Pantic
26
95
0
13 Jul 2020
Discriminative Multi-modality Speech Recognition
Bo Xu
Cheng Lu
Yandong Guo
Jacob Wang
26
98
0
12 May 2020
Mutual Information Maximization for Effective Lip Reading
Xingyuan Zhao
Shuang Yang
Shiguang Shan
Xilin Chen
24
58
0
13 Mar 2020
Deformation Flow Based Two-Stream Network for Lip Reading
Jingyun Xiao
Shuang Yang
Yuanhang Zhang
Shiguang Shan
Xilin Chen
25
64
0
12 Mar 2020
Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Mingshuang Luo
Shuang Yang
Shiguang Shan
Xilin Chen
27
41
0
09 Mar 2020
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition
Yuanhang Zhang
Shuang Yang
Jingyun Xiao
Shiguang Shan
Xilin Chen
18
64
0
06 Mar 2020
A Cascade Sequence-to-Sequence Model for Chinese Mandarin Lip Reading
Ya Zhao
Rui Xu
Xiuming Zhang
27
62
0
14 Aug 2019
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
185
784
0
16 Nov 2016
1