Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.09963
Cited By
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
18 March 2021
Kai Wang
Bengbeng He
Weiping Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain"
50 / 52 papers shown
Title
A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices
Ci-Hao Wu
Tian-Sheuan Chang
61
1
0
27 Mar 2025
HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks
Ekaterina Dmitrieva
Maksim Kaledin
42
0
0
21 Mar 2025
Linguistic Knowledge Transfer Learning for Speech Enhancement
Kuo-Hsuan Hung
Xugang Lu
Szu-Wei Fu
H. Tseng
Hsin-Yi Lin
Chii-Wann Lin
Yu Tsao
VLM
65
0
0
10 Mar 2025
PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement
Zizhen Lin
Junyu Wang
Ruili Li
Fei Shen
Xi Xuan
64
0
0
27 Feb 2025
DENOASR: Debiasing ASRs through Selective Denoising
Anand Kumar Rai
S. Jaiswal
Shubham Prakash
Bendi Pragnya Sree
Animesh Mukherjee
34
0
0
22 Oct 2024
TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement
Kuan-Chen Wang
Kai-Chun Liu
Ping-Cheng Yeh
Sheng-Yu Peng
Yu Tsao
26
1
0
04 Oct 2024
DeFT-Mamba: Universal Multichannel Sound Separation and Polyphonic Audio Classification
Dongheon Lee
Jung-Woo Choi
Mamba
29
1
0
19 Sep 2024
Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement
Zizhen Lin
Yuanle Li
Junyu Wang
Ruili Li
34
0
0
18 Sep 2024
BSS-CFFMA: Cross-Domain Feature Fusion and Multi-Attention Speech Enhancement Network based on Self-Supervised Embedding
Alimjan Mattursun
Liejun Wang
Yinfeng Yu
30
2
0
13 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
33
0
0
08 Aug 2024
Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer
Jizhen Li
Xinmeng Xu
Weiping Tu
Yuhong Yang
Rong Zhu
24
1
0
09 Jul 2024
Vision Transformer Segmentation for Visual Bird Sound Denoising
Sahil Kumar
Jialu Li
Youshan Zhang
34
1
0
13 Jun 2024
Diffusion Gaussian Mixture Audio Denoise
Pu Wang
Junhui Li
Jialu Li
Liangdong Guo
Youshan Zhang
DiffM
29
0
0
13 Jun 2024
MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancement
Zizhen Lin
Xiaoting Chen
Junyu Wang
27
2
0
07 Jun 2024
What do neural networks listen to? Exploring the crucial bands in Speech Enhancement using Sinc-convolution
Kuan-Hsun Ho
J. Hung
Berlin Chen
26
1
0
04 Mar 2024
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Hamza Kheddar
Mustapha Hemis
Yassine Himeur
OffRL
38
58
0
02 Mar 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Xincheng Yu
Dongyue Guo
Jianwei Zhang
Yi Lin
17
3
0
11 Dec 2023
Improving Label Assignments Learning by Dynamic Sample Dropout Combined with Layer-wise Optimization in Speech Separation
Chenyu Gao
Yue Gu
I. Marsic
16
0
0
20 Nov 2023
Physics-Informed Data Denoising for Real-Life Sensing Systems
Xiyuan Zhang
Xiaohan Fu
Diyan Teng
Chengyu Dong
Keerthivasan Vijayakumar
...
Junsheng Han
Dezhi Hong
Rashmi Kulkarni
Jingbo Shang
Rajesh K. Gupta
AI4CE
PINN
6
3
0
12 Nov 2023
DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Jialu Li
Junhui Li
Pu Wang
Youshan Zhang
18
4
0
30 Oct 2023
DPATD: Dual-Phase Audio Transformer for Denoising
Junhui Li
Pu Wang
Jialu Li
Xinzhe Wang
Youshan Zhang
13
4
0
30 Oct 2023
Complex Image Generation SwinTransformer Network for Audio Denoising
Youshan Zhang
Jialu Li
22
6
0
24 Oct 2023
Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
Kamil Akesbi
Dorian Desblancs
Benjamin Martin
34
0
0
20 Oct 2023
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
Bengt J. Borgström
M. Brandstein
9
2
0
21 Sep 2023
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Ye-Xin Lu
Yang Ai
Zhenhua Ling
25
7
0
17 Aug 2023
PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement
Xinmeng Xu
Weiping Tu
Yuhong Yang
11
9
0
28 Jul 2023
Efficient Encoder-Decoder and Dual-Path Conformer for Comprehensive Feature Learning in Speech Enhancement
Junyu Wang
21
4
0
09 Jun 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
Frequency bin-wise single channel speech presence probability estimation using multiple DNNs
Shuai Tao
Himavanth Reddy
Jesper Rindom Jensen
M. G. Christensen
14
1
0
23 Feb 2023
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement
Shengkui Zhao
Bin Ma
27
16
0
23 Feb 2023
Improving Speech Enhancement via Event-based Query
Yifei Xin
Xiulian Peng
Yan Lu
26
6
0
20 Feb 2023
Audio Denoising for Robust Audio Fingerprinting
Kamil Akesbi
18
3
0
21 Dec 2022
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
19
25
0
15 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
28
21
0
01 Dec 2022
Analysis of Noisy-target Training for DNN-based speech enhancement
Takuya Fujimura
T. Toda
27
4
0
02 Nov 2022
TT-Net: Dual-path transformer based sound field translation in the spherical harmonic domain
Yiwen Wang
Zijian Lan
Xihong Wu
T. Qu
13
1
0
30 Oct 2022
BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Youshan Zhang
Jialu Li
VLM
17
16
0
18 Oct 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
21
61
0
22 Sep 2022
MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement
Jianrong Wang
Xiaomin Li
Xuewei Li
Mei Yu
Qiang Fang
Li Liu
31
0
0
15 Sep 2022
Multi-View Attention Transfer for Efficient Speech Enhancement
Wooseok Shin
Hyun Joon Park
Jin Sob Kim
B. Lee
S. W. Han
25
8
0
22 Aug 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
8
91
0
28 Mar 2022
MANNER: Multi-view Attention Network for Noise Erasure
Hyun Joon Park
Byung Ha Kang
Wooseok Shin
Jin Sob Kim
S. W. Han
22
48
0
04 Mar 2022
DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Guochen Yu
Andong Li
Hui Wang
Yutian Wang
Yuxuan Ke
C. Zheng
26
35
0
16 Feb 2022
U-shaped Transformer with Frequency-Band Aware Attention for Speech Enhancement
Yi Li
Yang Sun
S. M. Naqvi
8
25
0
11 Dec 2021
Deep Spoken Keyword Spotting: An Overview
Iván López-Espejo
Z. Tan
John H. L. Hansen
Jesper Jensen
13
100
0
20 Nov 2021
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
19
48
0
11 Nov 2021
Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Guochen Yu
Andong Li
C. Zheng
Yinuo Guo
Yutian Wang
Hui Wang
27
83
0
13 Oct 2021
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant Sound Source Separation
Ali Aroudi
Stefan Uhlich
M. Font
ViT
14
5
0
08 Oct 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
19
41
0
30 Jun 2021
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
68
94
0
27 Apr 2021
1
2
Next