Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.03998
Cited By
Improving noise robust automatic speech recognition with single-channel time-domain enhancement network
9 March 2020
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving noise robust automatic speech recognition with single-channel time-domain enhancement network"
36 / 36 papers shown
Title
Communication Access Real-Time Translation Through Collaborative Correction of Automatic Speech Recognition
Korbinian Kuhn
Verena Kersken
Gottfried Zimmermann
40
0
0
19 Mar 2025
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR
Nian Shao
Rui Zhou
Pengyu Wang
Xian Li
Ying Fang
Yujie Yang
Xiaofei Li
39
0
0
27 Feb 2025
Measuring the Accuracy of Automatic Speech Recognition Solutions
Korbinian Kuhn
Verena Kersken
Benedikt Reuter
Niklas Egger
Gottfried Zimmermann
27
19
0
29 Aug 2024
Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance
M. Milling
Shuo Liu
Andreas Triantafyllopoulos
Ilhan Aslan
Björn W. Schuller
29
2
0
12 Aug 2024
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement
Chenda Li
Samuele Cornell
Shinji Watanabe
Yanmin Qian
DiffM
29
2
0
19 Jun 2024
Stability Evaluation via Distributional Perturbation Analysis
Jose H. Blanchet
Peng Cui
Jiajin Li
Jiashuo Liu
38
0
0
06 May 2024
Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
Tsubasa Ochiai
Kazuma Iwamoto
Marc Delcroix
Rintaro Ikeshita
Hiroshi Sato
Shoko Araki
Shigeru Katagiri
29
2
0
23 Apr 2024
TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Chengxin Chen
Pengyuan Zhang
27
0
0
19 Apr 2024
Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Yufeng Yang
Ashutosh Pandey
DeLiang Wang
44
4
0
11 Mar 2024
Investigating the Design Space of Diffusion Models for Speech Enhancement
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
27
6
0
07 Dec 2023
D4AM: A General Denoising Framework for Downstream Acoustic Models
H. Wang
Yu Tsao
Hsin-Min Wang
Chu-Song Chen
13
4
0
28 Nov 2023
Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection
Cunhang Fan
Mingming Ding
Jianhua Tao
Ruibo Fu
Jiangyan Yi
Zhengqi Wen
Zhao Lv
37
4
0
13 Oct 2023
Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments
Philippe Gonzalez
T. S. Alstrøm
Tobias May
17
13
0
12 Sep 2023
SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Iván Vallés-Pérez
Grzegorz Beringer
Piotr Bilinski
G. Cook
Roberto Barra-Chicote
16
1
0
23 Jul 2023
Improving the Intent Classification accuracy in Noisy Environment
Mohamed Nabih Ali
A. Brutti
Daniele Falavigna
16
0
0
12 Mar 2023
Analysis of Noisy-target Training for DNN-based speech enhancement
Takuya Fujimura
T. Toda
27
4
0
02 Nov 2022
Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Catalin Zorila
R. Doddipatla
16
11
0
09 May 2022
On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
22
4
0
03 May 2022
Mask scalar prediction for improving robust automatic speech recognition
A. Narayanan
James Walker
S. Panchapagesan
N. Howard
Yuma Koizumi
11
4
0
26 Apr 2022
Chain-based Discriminative Autoencoders for Speech Recognition
Hung-Shin Lee
Pin-Tuan Huang
Yao-Fei Cheng
Hsin-Min Wang
11
1
0
25 Mar 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
26
35
0
16 Feb 2022
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement
Kuansan Wang
Kai-Chun Liu
Hsin-Min Wang
Yu Tsao
30
1
0
14 Feb 2022
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR
Kazuma Iwamoto
Tsubasa Ochiai
Marc Delcroix
Rintaro Ikeshita
Hiroshi Sato
S. Araki
S. Katagiri
22
57
0
18 Jan 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
31
90
0
05 Jan 2022
SNRi Target Training for Joint Speech Enhancement and Recognition
Yuma Koizumi
Shigeki Karita
A. Narayanan
S. Panchapagesan
M. Bacchiani
25
14
0
01 Nov 2021
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions
Wangyou Zhang
Jing Shi
Chenda Li
Shinji Watanabe
Y. Qian
19
22
0
27 Oct 2021
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition
Yiming Wang
Jinyu Li
Heming Wang
Yao Qian
Chengyi Wang
Yu Wu
35
47
0
11 Oct 2021
Embedding and Beamforming: All-neural Causal Beamformer for Multichannel Speech Enhancement
Andong Li
Wenzhe Liu
C. Zheng
Xiaodong Li
12
55
0
01 Sep 2021
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR
Fu-An Chao
J. Hung
Berlin Chen
8
7
0
26 Aug 2021
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Duo Ma
Nana Hou
Van Tung Pham
Haihua Xu
Chng Eng Siong
25
22
0
22 Jul 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
19
41
0
30 Jun 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
11
9
0
30 Mar 2021
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Shota Orihashi
Ryo Masumura
22
8
0
02 Mar 2021
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information
Yen-Ju Lu
Chia-Yu Chang
Cheng Yu
Ching-Feng Liu
J. Hung
Shinji Watanabe
Yu Tsao
19
4
0
15 Nov 2020
Dual Application of Speech Enhancement for Automatic Speech Recognition
Ashutosh Pandey
Chunxi Liu
Yun Wang
Yatharth Saraf
33
37
0
07 Nov 2020
Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks
Yuichiro Koyama
Tyler Vuong
Stefan Uhlich
Bhiksha Raj
12
41
0
23 May 2020
1