ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.01941
  4. Cited By
Dense CNN with Self-Attention for Time-Domain Speech Enhancement
v1v2 (latest)

Dense CNN with Self-Attention for Time-Domain Speech Enhancement

3 September 2020
Ashutosh Pandey
DeLiang Wang
ArXiv (abs)PDFHTML

Papers citing "Dense CNN with Self-Attention for Time-Domain Speech Enhancement"

34 / 34 papers shown
Title
Modulating State Space Model with SlowFast Framework for Compute-Efficient Ultra Low-Latency Speech Enhancement
Modulating State Space Model with SlowFast Framework for Compute-Efficient Ultra Low-Latency Speech Enhancement
Longbiao Cheng
Ashutosh Pandey
Buye Xu
T. Delbruck
V. Ithapu
Shih-Chii Liu
71
2
0
04 Nov 2024
Multichannel-to-Multichannel Target Sound Extraction Using Direction and
  Timestamp Clues
Multichannel-to-Multichannel Target Sound Extraction Using Direction and Timestamp Clues
Dayun Choi
Jung-Woo Choi
61
0
0
19 Sep 2024
Heterogeneous Space Fusion and Dual-Dimension Attention: A New Paradigm
  for Speech Enhancement
Heterogeneous Space Fusion and Dual-Dimension Attention: A New Paradigm for Speech Enhancement
Tao Zheng
Liejun Wang
Yinfeng Yu
97
1
0
13 Aug 2024
Towards Decoupling Frontend Enhancement and Backend Recognition in
  Monaural Robust ASR
Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Yufeng Yang
Ashutosh Pandey
DeLiang Wang
55
4
0
11 Mar 2024
Objective and subjective evaluation of speech enhancement methods in the
  UDASE task of the 7th CHiME challenge
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
Simon Leglaive
Matthieu Fraticelli
Hend ElGhazaly
Léonie Borne
Mostafa Sadeghi
Scott Wisdom
Manuel Pariente
J. Hershey
Daniel Pressnitzer
Jon P. Barker
77
11
0
02 Feb 2024
On the Importance of Neural Wiener Filter for Resource Efficient
  Multichannel Speech Enhancement
On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement
Tsun-An Hsieh
Jacob Donley
Daniel D. E. Wong
Buye Xu
Ashutosh Pandey
58
3
0
15 Jan 2024
Decoupled Spatial and Temporal Processing for Resource Efficient
  Multichannel Speech Enhancement
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
90
3
0
15 Jan 2024
Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained
  Generative Methods for Speech Enhancement in Adverse Conditions
Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained Generative Methods for Speech Enhancement in Adverse Conditions
Heming Wang
Meng Yu
Huatian Zhang
Chunlei Zhang
Zhongweiyang Xu
Muqiao Yang
Yixuan Zhang
Dong Yu
88
3
0
16 Sep 2023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with
  Convolutional Cross Attention in Multi-talker Conditions
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions
Jie Zhang
Qingquan Xu
Qiu-shi Zhu
Zhenhua Ling
70
12
0
17 May 2023
Neural Speech Enhancement with Very Low Algorithmic Latency and
  Complexity via Integrated Full- and Sub-Band Modeling
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
AI4TS
52
12
0
18 Apr 2023
Wav2code: Restore Clean Speech Representations via Codebook Lookup for
  Noise-Robust ASR
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR
Yuchen Hu
Cheng Chen
Qiu-shi Zhu
Eng Siong Chng
124
16
0
11 Apr 2023
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech
  Enhancement
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
76
29
0
15 Dec 2022
SkipConvGAN: Monaural Speech Dereverberation using Generative
  Adversarial Networks via Complex Time-Frequency Masking
SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking
Vinay Kothapally
John H. L. Hansen
44
23
0
22 Nov 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech
  Separation
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
104
138
0
22 Nov 2022
Self-Supervised Learning for Speech Enhancement through Synthesis
Self-Supervised Learning for Speech Enhancement through Synthesis
Bryce Irvin
Marko Stamenovic
M. Kegler
Li-Chia Yang
88
21
0
04 Nov 2022
A Training and Inference Strategy Using Noisy and Enhanced Speech as
  Target for Speech Enhancement without Clean Speech
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
Li-Wei Chen
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
61
3
0
27 Oct 2022
Time-Domain Speech Enhancement for Robust Automatic Speech Recognition
Time-Domain Speech Enhancement for Robust Automatic Speech Recognition
Yufeng Yang
Ashutosh Pandey
DeLiang Wang
57
8
0
24 Oct 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural
  Speaker Separation
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
149
108
0
08 Sep 2022
A two-stage full-band speech enhancement model with effective spectral
  compression mapping
A two-stage full-band speech enhancement model with effective spectral compression mapping
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
60
0
0
27 Jun 2022
MANNER: Multi-view Attention Network for Noise Erasure
MANNER: Multi-view Attention Network for Noise Erasure
Hyun Joon Park
Byung Ha Kang
Wooseok Shin
Jin Sob Kim
S. W. Han
92
50
0
04 Mar 2022
RemixIT: Continual self-training of speech enhancement models via
  bootstrapped remixing
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
86
55
0
17 Feb 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with
  attention-in-attention transformer for monaural speech enhancement
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
82
37
0
16 Feb 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent
  Architecture for Acoustic Signal Enhancement
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
50
3
0
24 Jan 2022
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for
  Speech Enhancement Using Sign-Exponent-Only Floating-Points
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Yu-Chen Lin
Cheng Yu
Y. Hsu
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
31
6
0
08 Nov 2021
Multichannel Speech Enhancement without Beamforming
Multichannel Speech Enhancement without Beamforming
Asutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
62
15
0
25 Oct 2021
Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network
Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network
Ashutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
73
5
0
22 Oct 2021
TPARN: Triple-path Attentive Recurrent Network for Time-domain
  Multichannel Speech Enhancement
TPARN: Triple-path Attentive Recurrent Network for Time-domain Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
Anurag Kumar
Jacob Donley
P. Calamia
DeLiang Wang
KELM
91
45
0
20 Oct 2021
Continual self-training with bootstrapped remixing for speech
  enhancement
Continual self-training with bootstrapped remixing for speech enhancement
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Anurag Kumar
85
16
0
19 Oct 2021
Personalized Speech Enhancement: New Models and Comprehensive Evaluation
Personalized Speech Enhancement: New Models and Comprehensive Evaluation
Sefik Emre Eskimez
Takuya Yoshioka
Huaming Wang
Xiaofei Wang
Zhuo Chen
Xuedong Huang
87
62
0
18 Oct 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel
  Speech Enhancement
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement
Andong Li
C. Zheng
Lu Zhang
Xiaodong Li
91
147
0
22 Jun 2021
Self-attending RNN for Speech Enhancement to Improve Cross-corpus
  Generalization
Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Ashutosh Pandey
DeLiang Wang
39
45
0
26 May 2021
DBNet: A Dual-branch Network Architecture Processing on Spectrum and
  Waveform for Single-channel Speech Enhancement
DBNet: A Dual-branch Network Architecture Processing on Spectrum and Waveform for Single-channel Speech Enhancement
Kanghao Zhang
Shulin He
Hao Li
Xueliang Zhang
36
13
0
06 May 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal
  Convolutional Networks
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Ju Lin
A. Wijngaarden
Kuang-Ching Wang
M. C. Smith
78
51
0
24 Feb 2021
Dual-path Self-Attention RNN for Real-Time Speech Enhancement
Dual-path Self-Attention RNN for Real-Time Speech Enhancement
Ashutosh Pandey
DeLiang Wang
70
24
0
23 Oct 2020
1