ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.02599
  4. Cited By
VSEGAN: Visual Speech Enhancement Generative Adversarial Network
v1v2 (latest)

VSEGAN: Visual Speech Enhancement Generative Adversarial Network

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
4 February 2021
Xinmeng Xu
Yang Wang
Dongxiang Xu
Yiyuan Peng
Cong Zhang
Jie Jia
Binbin Chen
    GAN
ArXiv (abs)PDFHTML

Papers citing "VSEGAN: Visual Speech Enhancement Generative Adversarial Network"

6 / 6 papers shown
AUREXA-SE: Audio-Visual Unified Representation Exchange Architecture with Cross-Attention and Squeezeformer for Speech Enhancement
AUREXA-SE: Audio-Visual Unified Representation Exchange Architecture with Cross-Attention and Squeezeformer for Speech Enhancement
M. Sajid
Deepanshu Gupta
Yash Modi
Sanskriti Jain
Harshith Jai Surya Ganji
A. Rahaman
Harshvardhan Choudhary
Nasir Saleem
Amir Hussain
M. Tanveer
151
0
0
06 Oct 2025
Vision-Integrated High-Quality Neural Speech Coding
Vision-Integrated High-Quality Neural Speech Coding
Yao Guo
Yang Ai
Rui Zheng
Hui-Peng Du
Xiao-Hang Jiang
Zhen-Hua Ling
277
0
0
29 May 2025
Incorporating Ultrasound Tongue Images for Audio-Visual Speech
  Enhancement through Knowledge Distillation
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge DistillationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Ruixin Zheng
Yang Ai
Zhenhua Ling
314
17
0
24 May 2023
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural VocodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rodrigo Mira
Buye Xu
Jacob Donley
Anurag Kumar
Stavros Petridis
V. Ithapu
Maja Pantic
251
21
0
20 Nov 2022
VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer
VoViT: Low Latency Graph-based Audio-Visual Voice Separation TransformerEuropean Conference on Computer Vision (ECCV), 2022
Juan F. Montesinos
V. S. Kadandale
G. Haro
ViT
350
25
0
08 Mar 2022
Multi-layer Feature Fusion Convolution Network for Audio-visual Speech
  Enhancement
Multi-layer Feature Fusion Convolution Network for Audio-visual Speech Enhancement
Xinmeng Xu
Jia Hao
261
1
0
15 Jan 2021
1
Page 1 of 1