ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.03998
  4. Cited By
Improving noise robust automatic speech recognition with single-channel
  time-domain enhancement network

Improving noise robust automatic speech recognition with single-channel time-domain enhancement network

9 March 2020
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
ArXivPDFHTML

Papers citing "Improving noise robust automatic speech recognition with single-channel time-domain enhancement network"

36 / 36 papers shown
Title
Communication Access Real-Time Translation Through Collaborative Correction of Automatic Speech Recognition
Communication Access Real-Time Translation Through Collaborative Correction of Automatic Speech Recognition
Korbinian Kuhn
Verena Kersken
Gottfried Zimmermann
40
0
0
19 Mar 2025
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR
Nian Shao
Rui Zhou
Pengyu Wang
Xian Li
Ying Fang
Yujie Yang
Xiaofei Li
39
0
0
27 Feb 2025
Measuring the Accuracy of Automatic Speech Recognition Solutions
Measuring the Accuracy of Automatic Speech Recognition Solutions
Korbinian Kuhn
Verena Kersken
Benedikt Reuter
Niklas Egger
Gottfried Zimmermann
27
19
0
29 Aug 2024
Audio Enhancement for Computer Audition -- An Iterative Training
  Paradigm Using Sample Importance
Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance
M. Milling
Shuo Liu
Andreas Triantafyllopoulos
Ilhan Aslan
Björn W. Schuller
29
2
0
12 Aug 2024
Diffusion-based Generative Modeling with Discriminative Guidance for
  Streamable Speech Enhancement
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement
Chenda Li
Samuele Cornell
Shinji Watanabe
Yanmin Qian
DiffM
29
2
0
19 Jun 2024
Stability Evaluation via Distributional Perturbation Analysis
Stability Evaluation via Distributional Perturbation Analysis
Jose H. Blanchet
Peng Cui
Jiajin Li
Jiashuo Liu
41
0
0
06 May 2024
Rethinking Processing Distortions: Disentangling the Impact of Speech
  Enhancement Errors on Speech Recognition Performance
Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
Tsubasa Ochiai
Kazuma Iwamoto
Marc Delcroix
Rintaro Ikeshita
Hiroshi Sato
Shoko Araki
Shigeru Katagiri
29
2
0
23 Apr 2024
TRNet: Two-level Refinement Network leveraging Speech Enhancement for
  Noise Robust Speech Emotion Recognition
TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Chengxin Chen
Pengyuan Zhang
27
0
0
19 Apr 2024
Towards Decoupling Frontend Enhancement and Backend Recognition in
  Monaural Robust ASR
Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Yufeng Yang
Ashutosh Pandey
DeLiang Wang
44
4
0
11 Mar 2024
Investigating the Design Space of Diffusion Models for Speech
  Enhancement
Investigating the Design Space of Diffusion Models for Speech Enhancement
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
27
6
0
07 Dec 2023
D4AM: A General Denoising Framework for Downstream Acoustic Models
D4AM: A General Denoising Framework for Downstream Acoustic Models
H. Wang
Yu Tsao
Hsin-Min Wang
Chu-Song Chen
13
4
0
28 Nov 2023
Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech
  Detection
Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection
Cunhang Fan
Mingming Ding
Jianhua Tao
Ruibo Fu
Jiangyan Yi
Zhengqi Wen
Zhao Lv
37
4
0
13 Oct 2023
Assessing the Generalization Gap of Learning-Based Speech Enhancement
  Systems in Noisy and Reverberant Environments
Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments
Philippe Gonzalez
T. S. Alstrøm
Tobias May
17
13
0
12 Sep 2023
SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic
  Spaces
SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Iván Vallés-Pérez
Grzegorz Beringer
Piotr Bilinski
G. Cook
Roberto Barra-Chicote
16
1
0
23 Jul 2023
Improving the Intent Classification accuracy in Noisy Environment
Improving the Intent Classification accuracy in Noisy Environment
Mohamed Nabih Ali
A. Brutti
Daniele Falavigna
18
0
0
12 Mar 2023
Analysis of Noisy-target Training for DNN-based speech enhancement
Analysis of Noisy-target Training for DNN-based speech enhancement
Takuya Fujimura
T. Toda
27
4
0
02 Nov 2022
Speaker Reinforcement Using Target Source Extraction for Robust
  Automatic Speech Recognition
Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Catalin Zorila
R. Doddipatla
16
11
0
09 May 2022
On monoaural speech enhancement for automatic recognition of real noisy
  speech using mixture invariant training
On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
22
4
0
03 May 2022
Mask scalar prediction for improving robust automatic speech recognition
Mask scalar prediction for improving robust automatic speech recognition
A. Narayanan
James Walker
S. Panchapagesan
N. Howard
Yuma Koizumi
11
4
0
26 Apr 2022
Chain-based Discriminative Autoencoders for Speech Recognition
Chain-based Discriminative Autoencoders for Speech Recognition
Hung-Shin Lee
Pin-Tuan Huang
Yao-Fei Cheng
Hsin-Min Wang
11
1
0
25 Mar 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with
  attention-in-attention transformer for monaural speech enhancement
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
26
35
0
16 Feb 2022
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement
Kuansan Wang
Kai-Chun Liu
Hsin-Min Wang
Yu Tsao
30
1
0
14 Feb 2022
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement
  Errors on ASR
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR
Kazuma Iwamoto
Tsubasa Ochiai
Marc Delcroix
Rintaro Ikeshita
Hiroshi Sato
S. Araki
S. Katagiri
22
57
0
18 Jan 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Robust Self-Supervised Audio-Visual Speech Recognition
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
31
90
0
05 Jan 2022
SNRi Target Training for Joint Speech Enhancement and Recognition
SNRi Target Training for Joint Speech Enhancement and Recognition
Yuma Koizumi
Shigeki Karita
A. Narayanan
S. Panchapagesan
M. Bacchiani
25
14
0
01 Nov 2021
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on
  Real and Simulation Conditions
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions
Wangyou Zhang
Jing Shi
Chenda Li
Shinji Watanabe
Y. Qian
19
22
0
27 Oct 2021
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs
  for Robust Speech Recognition
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition
Yiming Wang
Jinyu Li
Heming Wang
Yao Qian
Chengyi Wang
Yu Wu
35
47
0
11 Oct 2021
Embedding and Beamforming: All-neural Causal Beamformer for Multichannel
  Speech Enhancement
Embedding and Beamforming: All-neural Causal Beamformer for Multichannel Speech Enhancement
Andong Li
Wenzhe Liu
C. Zheng
Xiaodong Li
12
55
0
01 Sep 2021
Cross-domain Single-channel Speech Enhancement Model with Bi-projection
  Fusion Module for Noise-robust ASR
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR
Fu-An Chao
J. Hung
Berlin Chen
8
7
0
26 Aug 2021
Multitask-Based Joint Learning Approach To Robust ASR For Radio
  Communication Speech
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Duo Ma
Nana Hou
Van Tung Pham
Haihua Xu
Chng Eng Siong
25
22
0
22 Jul 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using
  linear complexity self-attention for speech enhancement
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
19
41
0
30 Jun 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
13
9
0
30 Mar 2021
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Shota Orihashi
Ryo Masumura
22
8
0
02 Mar 2021
Improving Speech Enhancement Performance by Leveraging Contextual Broad
  Phonetic Class Information
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information
Yen-Ju Lu
Chia-Yu Chang
Cheng Yu
Ching-Feng Liu
J. Hung
Shinji Watanabe
Yu Tsao
19
4
0
15 Nov 2020
Dual Application of Speech Enhancement for Automatic Speech Recognition
Dual Application of Speech Enhancement for Automatic Speech Recognition
Ashutosh Pandey
Chunxi Liu
Yun Wang
Yatharth Saraf
33
37
0
07 Nov 2020
Exploring the Best Loss Function for DNN-Based Low-latency Speech
  Enhancement with Temporal Convolutional Networks
Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks
Yuichiro Koyama
Tyler Vuong
Stefan Uhlich
Bhiksha Raj
14
41
0
23 May 2020
1