v1v2 (latest)

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021

30 June 2021

Papers citing "DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement"

27 / 27 papers shown

Improving DF-Conformer Using Hydra For High-Fidelity Generative Speech Enhancement on Discrete Codec Token

Shogo Seki

Shaoxiang Dang

Li Li

135

04 Nov 2025

Universal Discrete-Domain Speech Enhancement

181

11 Oct 2025

Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration

1.1K

07 May 2025

Linguistic Knowledge Transfer Learning for Speech Enhancement

389

10 Mar 2025

FLEURS-R: A Restored Multilingual Speech Corpus for Generation TasksInterspeech (Interspeech), 2024

Min Ma

Yuma Koizumi

242

12 Aug 2024

Sampling-Frequency-Independent Universal Sound Separation

Tomohiko Nakamura

Kohei Yatabe

202

22 Sep 2023

HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methodsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

285

15 Sep 2023

Exploiting Time-Frequency Conformers for Music Audio EnhancementACM Multimedia (ACM MM), 2023

295

24 Aug 2023

Algorithms of Sampling-Frequency-Independent Layers for Non-integer StridesEuropean Signal Processing Conference (EUSIPCO), 2023

Hiroshi Saruwatari

178

19 Jun 2023

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech CorpusInterspeech (Interspeech), 2023

273

159

30 May 2023

Anomalous Sound Detection Based on Sound SeparationInterspeech (Interspeech), 2023

Kanta Shimonishi

Kota Dohi

Yohei Kawaguchi

214

25 May 2023

AudioSlots: A slot-centric generative model for audio separation

304

09 May 2023

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text RepresentationsIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023

318

03 Mar 2023

DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech EnhancementIEEE Signal Processing Letters (SPL), 2022

Dongheon Lee

Jung-Woo Choi

399

15 Dec 2022

Analysis of Noisy-target Training for DNN-based speech enhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Takuya Fujimura

Tomoki Toda

264

02 Nov 2022

Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input RepresentationSpoken Language Technology Workshop (SLT), 2022

Martin Strauss

Matteo Torcoli

B. Edler

225

21 Oct 2022

CMGAN: Conformer-Based Metric-GAN for Monaural Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Sherif Abdulatif

Ru Cao

Bin Yang

439

126

22 Sep 2022

Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech SeparationInterspeech (Interspeech), 2022

209

28 Jun 2022

Insights Into Deep Non-linear Filters for Improved Multi-channel Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Kristina Tesch

Timo Gerkmann

421

27 Jun 2022

On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech EnhancementInterspeech (Interspeech), 2022

Kristina Tesch

Nils-Hendrik Mohrmann

Timo Gerkmann

220

22 Jun 2022

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR AccuracyInterspeech (Interspeech), 2022

220

06 May 2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral ShapingInterspeech (Interspeech), 2022

380

31 Mar 2022

MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker VerificationInterspeech (Interspeech), 2022

Haibin Wu

Zhiyong Wu

298

171

29 Mar 2022

Exploring Self-Attention Mechanisms for Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Mirco Ravanelli

300

06 Feb 2022

BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement

Sunwoo Kim

Minje Kim

690

17 Nov 2021

MT3: Multi-Task Multitrack Music TranscriptionInternational Conference on Learning Representations (ICLR), 2021

684

127

04 Nov 2021

SNRi Target Training for Joint Speech Enhancement and RecognitionInterspeech (Interspeech), 2021

302

01 Nov 2021