ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.09963
  4. Cited By
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement
  in the Time Domain

TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain

18 March 2021
Kai Wang
Bengbeng He
Weiping Zhu
ArXivPDFHTML

Papers citing "TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain"

50 / 52 papers shown
Title
A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices
A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices
Ci-Hao Wu
Tian-Sheuan Chang
61
1
0
27 Mar 2025
HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks
HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks
Ekaterina Dmitrieva
Maksim Kaledin
42
0
0
21 Mar 2025
Linguistic Knowledge Transfer Learning for Speech Enhancement
Kuo-Hsuan Hung
Xugang Lu
Szu-Wei Fu
H. Tseng
Hsin-Yi Lin
Chii-Wann Lin
Yu Tsao
VLM
65
0
0
10 Mar 2025
PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement
PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement
Zizhen Lin
Junyu Wang
Ruili Li
Fei Shen
Xi Xuan
64
0
0
27 Feb 2025
DENOASR: Debiasing ASRs through Selective Denoising
DENOASR: Debiasing ASRs through Selective Denoising
Anand Kumar Rai
S. Jaiswal
Shubham Prakash
Bendi Pragnya Sree
Animesh Mukherjee
34
0
0
22 Oct 2024
TrustEMG-Net: Using Representation-Masking Transformer with U-Net for
  Surface Electromyography Enhancement
TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement
Kuan-Chen Wang
Kai-Chun Liu
Ping-Cheng Yeh
Sheng-Yu Peng
Yu Tsao
26
1
0
04 Oct 2024
DeFT-Mamba: Universal Multichannel Sound Separation and Polyphonic Audio
  Classification
DeFT-Mamba: Universal Multichannel Sound Separation and Polyphonic Audio Classification
Dongheon Lee
Jung-Woo Choi
Mamba
29
1
0
19 Sep 2024
Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight
  Speech Enhancement
Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement
Zizhen Lin
Yuanle Li
Junyu Wang
Ruili Li
34
0
0
18 Sep 2024
BSS-CFFMA: Cross-Domain Feature Fusion and Multi-Attention Speech
  Enhancement Network based on Self-Supervised Embedding
BSS-CFFMA: Cross-Domain Feature Fusion and Multi-Attention Speech Enhancement Network based on Self-Supervised Embedding
Alimjan Mattursun
Liejun Wang
Yinfeng Yu
30
2
0
13 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
33
0
0
08 Aug 2024
Improving Speech Enhancement by Integrating Inter-Channel and Band
  Features with Dual-branch Conformer
Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer
Jizhen Li
Xinmeng Xu
Weiping Tu
Yuhong Yang
Rong Zhu
24
1
0
09 Jul 2024
Vision Transformer Segmentation for Visual Bird Sound Denoising
Vision Transformer Segmentation for Visual Bird Sound Denoising
Sahil Kumar
Jialu Li
Youshan Zhang
34
1
0
13 Jun 2024
Diffusion Gaussian Mixture Audio Denoise
Diffusion Gaussian Mixture Audio Denoise
Pu Wang
Junhui Li
Jialu Li
Liangdong Guo
Youshan Zhang
DiffM
29
0
0
13 Jun 2024
MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion
  Enhanced Taylor Transformer for U-Net-based Speech Enhancement
MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancement
Zizhen Lin
Xiaoting Chen
Junyu Wang
27
2
0
07 Jun 2024
What do neural networks listen to? Exploring the crucial bands in Speech
  Enhancement using Sinc-convolution
What do neural networks listen to? Exploring the crucial bands in Speech Enhancement using Sinc-convolution
Kuan-Hsun Ho
J. Hung
Berlin Chen
26
1
0
04 Mar 2024
Automatic Speech Recognition using Advanced Deep Learning Approaches: A
  survey
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Hamza Kheddar
Mustapha Hemis
Yassine Himeur
OffRL
38
59
0
02 Mar 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic
  Control Using Multi-Objective Learning
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Xincheng Yu
Dongyue Guo
Jianwei Zhang
Yi Lin
17
3
0
11 Dec 2023
Improving Label Assignments Learning by Dynamic Sample Dropout Combined
  with Layer-wise Optimization in Speech Separation
Improving Label Assignments Learning by Dynamic Sample Dropout Combined with Layer-wise Optimization in Speech Separation
Chenyu Gao
Yue Gu
I. Marsic
16
0
0
20 Nov 2023
Physics-Informed Data Denoising for Real-Life Sensing Systems
Physics-Informed Data Denoising for Real-Life Sensing Systems
Xiyuan Zhang
Xiaohan Fu
Diyan Teng
Chengyu Dong
Keerthivasan Vijayakumar
...
Junsheng Han
Dezhi Hong
Rashmi Kulkarni
Jingbo Shang
Rajesh K. Gupta
AI4CE
PINN
6
3
0
12 Nov 2023
DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Jialu Li
Junhui Li
Pu Wang
Youshan Zhang
18
4
0
30 Oct 2023
DPATD: Dual-Phase Audio Transformer for Denoising
DPATD: Dual-Phase Audio Transformer for Denoising
Junhui Li
Pu Wang
Jialu Li
Xinzhe Wang
Youshan Zhang
13
4
0
30 Oct 2023
Complex Image Generation SwinTransformer Network for Audio Denoising
Complex Image Generation SwinTransformer Network for Audio Denoising
Youshan Zhang
Jialu Li
22
6
0
24 Oct 2023
Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
Kamil Akesbi
Dorian Desblancs
Benjamin Martin
34
0
0
20 Oct 2023
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network
  Speech Enhancement
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
Bengt J. Borgström
M. Brandstein
9
2
0
21 Sep 2023
Explicit Estimation of Magnitude and Phase Spectra in Parallel for
  High-Quality Speech Enhancement
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Ye-Xin Lu
Yang Ai
Zhenhua Ling
25
7
0
17 Aug 2023
PCNN: A Lightweight Parallel Conformer Neural Network for Efficient
  Monaural Speech Enhancement
PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement
Xinmeng Xu
Weiping Tu
Yuhong Yang
11
9
0
28 Jul 2023
Efficient Encoder-Decoder and Dual-Path Conformer for Comprehensive
  Feature Learning in Speech Enhancement
Efficient Encoder-Decoder and Dual-Path Conformer for Comprehensive Feature Learning in Speech Enhancement
Junyu Wang
21
4
0
09 Jun 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
Frequency bin-wise single channel speech presence probability estimation
  using multiple DNNs
Frequency bin-wise single channel speech presence probability estimation using multiple DNNs
Shuai Tao
Himavanth Reddy
Jesper Rindom Jensen
M. G. Christensen
14
1
0
23 Feb 2023
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using
  Joint Complex Masking and Complex Spectral Mapping for Monaural Speech
  Enhancement
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement
Shengkui Zhao
Bin Ma
27
16
0
23 Feb 2023
Improving Speech Enhancement via Event-based Query
Improving Speech Enhancement via Event-based Query
Yifei Xin
Xiulian Peng
Yan Lu
26
6
0
20 Feb 2023
Audio Denoising for Robust Audio Fingerprinting
Audio Denoising for Robust Audio Fingerprinting
Kamil Akesbi
21
3
0
21 Dec 2022
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech
  Enhancement
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
19
25
0
15 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
28
21
0
01 Dec 2022
Analysis of Noisy-target Training for DNN-based speech enhancement
Analysis of Noisy-target Training for DNN-based speech enhancement
Takuya Fujimura
T. Toda
27
4
0
02 Nov 2022
TT-Net: Dual-path transformer based sound field translation in the
  spherical harmonic domain
TT-Net: Dual-path transformer based sound field translation in the spherical harmonic domain
Yiwen Wang
Zijian Lan
Xihong Wu
T. Qu
13
1
0
30 Oct 2022
BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Youshan Zhang
Jialu Li
VLM
17
16
0
18 Oct 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
21
61
0
22 Sep 2022
MVNet: Memory Assistance and Vocal Reinforcement Network for Speech
  Enhancement
MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement
Jianrong Wang
Xiaomin Li
Xuewei Li
Mei Yu
Qiang Fang
Li Liu
31
0
0
15 Sep 2022
Multi-View Attention Transfer for Efficient Speech Enhancement
Multi-View Attention Transfer for Efficient Speech Enhancement
Wooseok Shin
Hyun Joon Park
Jin Sob Kim
B. Lee
S. W. Han
25
8
0
22 Aug 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
10
91
0
28 Mar 2022
MANNER: Multi-view Attention Network for Noise Erasure
MANNER: Multi-view Attention Network for Noise Erasure
Hyun Joon Park
Byung Ha Kang
Wooseok Shin
Jin Sob Kim
S. W. Han
24
48
0
04 Mar 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with
  attention-in-attention transformer for monaural speech enhancement
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
26
35
0
16 Feb 2022
U-shaped Transformer with Frequency-Band Aware Attention for Speech
  Enhancement
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
10
25
0
11 Dec 2021
Deep Spoken Keyword Spotting: An Overview
Deep Spoken Keyword Spotting: An Overview
Iván López-Espejo
Z. Tan
John H. L. Hansen
Jesper Jensen
13
100
0
20 Nov 2021
Uformer: A Unet based dilated complex & real dual-path conformer network
  for simultaneous speech enhancement and dereverberation
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
19
48
0
11 Nov 2021
Dual-branch Attention-In-Attention Transformer for single-channel speech
  enhancement
Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Guochen Yu
Andong Li
C. Zheng
Yinuo Guo
Yutian Wang
Hui Wang
27
83
0
13 Oct 2021
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant
  Sound Source Separation
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant Sound Source Separation
Ali Aroudi
Stefan Uhlich
M. Font
ViT
16
5
0
08 Oct 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using
  linear complexity self-attention for speech enhancement
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
19
41
0
30 Jun 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion
  Network for Speech Enhancement
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
68
94
0
27 Apr 2021
12
Next