ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.15813
  4. Cited By
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using
  linear complexity self-attention for speech enhancement
v1v2 (latest)

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
30 June 2021
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
ArXiv (abs)PDFHTML

Papers citing "DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement"

27 / 27 papers shown
Improving DF-Conformer Using Hydra For High-Fidelity Generative Speech Enhancement on Discrete Codec Token
Improving DF-Conformer Using Hydra For High-Fidelity Generative Speech Enhancement on Discrete Codec Token
Shogo Seki
Shaoxiang Dang
Li Li
135
0
0
04 Nov 2025
Universal Discrete-Domain Speech Enhancement
Universal Discrete-Domain Speech Enhancement
Fei Liu
Yang Ai
Ye-Xin Lu
Rui Zheng
Hui-Peng Du
Zhen-Hua Ling
181
2
0
11 Oct 2025
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
Shigeki Karita
Yuma Koizumi
Heiga Zen
Haruko Ishikawa
Robin Scheibler
M. Bacchiani
VLM
1.1K
5
0
07 May 2025
Linguistic Knowledge Transfer Learning for Speech Enhancement
Linguistic Knowledge Transfer Learning for Speech Enhancement
Kuo-Hsuan Hung
Xugang Lu
Szu-Wei Fu
Huan-Hsin Tseng
Hsin-Yi Lin
Chii-Wann Lin
Yu Tsao
VLM
389
2
0
10 Mar 2025
FLEURS-R: A Restored Multilingual Speech Corpus for Generation Tasks
FLEURS-R: A Restored Multilingual Speech Corpus for Generation TasksInterspeech (Interspeech), 2024
Min Ma
Yuma Koizumi
Shigeki Karita
Heiga Zen
Jason Riesa
Haruko Ishikawa
M. Bacchiani
VLM
242
17
0
12 Aug 2024
Sampling-Frequency-Independent Universal Sound Separation
Sampling-Frequency-Independent Universal Sound Separation
Tomohiko Nakamura
Kohei Yatabe
202
0
0
22 Sep 2023
HM-Conformer: A Conformer-based audio deepfake detection system with
  hierarchical pooling and multi-level classification token aggregation methods
HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methodsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Hyun-Seo Shin
Ju-Sung Heo
Ju-ho Kim
Chanmann Lim
Wonbin Kim
Ha-Jin Yu
285
20
0
15 Sep 2023
Exploiting Time-Frequency Conformers for Music Audio Enhancement
Exploiting Time-Frequency Conformers for Music Audio EnhancementACM Multimedia (ACM MM), 2023
Yunkee Chae
Junghyun Koo
Sungho Lee
Kyogu Lee
295
8
0
24 Aug 2023
Algorithms of Sampling-Frequency-Independent Layers for Non-integer
  Strides
Algorithms of Sampling-Frequency-Independent Layers for Non-integer StridesEuropean Signal Processing Conference (EUSIPCO), 2023
Kanami Imamura
Tomohiko Nakamura
Norihiro Takamune
Kohei Yatabe
Hiroshi Saruwatari
178
5
0
19 Jun 2023
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech CorpusInterspeech (Interspeech), 2023
Yuma Koizumi
Heiga Zen
Shigeki Karita
Yifan Ding
Kohei Yatabe
Nobuyuki Morioka
M. Bacchiani
Yu Zhang
Wei Han
Ankur Bapna
273
159
0
30 May 2023
Anomalous Sound Detection Based on Sound Separation
Anomalous Sound Detection Based on Sound SeparationInterspeech (Interspeech), 2023
Kanta Shimonishi
Kota Dohi
Yohei Kawaguchi
214
7
0
25 May 2023
AudioSlots: A slot-centric generative model for audio separation
AudioSlots: A slot-centric generative model for audio separation
P. Reddy
Scott Wisdom
Klaus Greff
J. Hershey
Thomas Kipf
OCLVLM
304
6
0
09 May 2023
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised
  Speech and Text Representations
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text RepresentationsIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
Yuma Koizumi
Heiga Zen
Shigeki Karita
Yifan Ding
Kohei Yatabe
Nobuyuki Morioka
Yu Zhang
Wei Han
Ankur Bapna
M. Bacchiani
318
48
0
03 Mar 2023
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech
  Enhancement
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech EnhancementIEEE Signal Processing Letters (SPL), 2022
Dongheon Lee
Jung-Woo Choi
399
43
0
15 Dec 2022
Analysis of Noisy-target Training for DNN-based speech enhancement
Analysis of Noisy-target Training for DNN-based speech enhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Takuya Fujimura
Tomoki Toda
264
10
0
02 Nov 2022
Improved Normalizing Flow-Based Speech Enhancement using an All-pole
  Gammatone Filterbank for Conditional Input Representation
Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input RepresentationSpoken Language Technology Workshop (SLT), 2022
Martin Strauss
Matteo Torcoli
B. Edler
225
7
0
21 Oct 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
CMGAN: Conformer-Based Metric-GAN for Monaural Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Sherif Abdulatif
Ru Cao
Bin Yang
439
126
0
22 Sep 2022
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech
  Separation
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech SeparationInterspeech (Interspeech), 2022
Jian Luo
Jianzong Wang
Ning Cheng
Edward Xiao
Xulong Zhang
Jing Xiao
ViT
209
13
0
28 Jun 2022
Insights Into Deep Non-linear Filters for Improved Multi-channel Speech
  Enhancement
Insights Into Deep Non-linear Filters for Improved Multi-channel Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Kristina Tesch
Timo Gerkmann
421
89
0
27 Jun 2022
On the Role of Spatial, Spectral, and Temporal Processing for DNN-based
  Non-linear Multi-channel Speech Enhancement
On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech EnhancementInterspeech (Interspeech), 2022
Kristina Tesch
Nils-Hendrik Mohrmann
Timo Gerkmann
220
10
0
22 Jun 2022
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller
  Optimized for ASR Accuracy
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR AccuracyInterspeech (Interspeech), 2022
S. Panchapagesan
A. Narayanan
T. Shabestary
Shuai Shao
N. Howard
Alex Park
James Walker
A. Gruenstein
220
9
0
06 May 2022
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with
  Adaptive Noise Spectral Shaping
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral ShapingInterspeech (Interspeech), 2022
Yuma Koizumi
Heiga Zen
Kohei Yatabe
Nanxin Chen
M. Bacchiani
DiffM
380
54
0
31 Mar 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic
  Speaker Verification
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker VerificationInterspeech (Interspeech), 2022
Yang Zhang
Zhiqiang Lv
Haibin Wu
Shanshan Zhang
Pengfei Hu
Zhiyong Wu
Hung-yi Lee
Helen Meng
ViT
298
171
0
29 Mar 2022
Exploring Self-Attention Mechanisms for Speech Separation
Exploring Self-Attention Mechanisms for Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
Mirko Bronzi
300
41
0
06 Feb 2022
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable
  and Efficient Speech Enhancement
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Sunwoo Kim
Minje Kim
690
8
0
17 Nov 2021
MT3: Multi-Task Multitrack Music Transcription
MT3: Multi-Task Multitrack Music TranscriptionInternational Conference on Learning Representations (ICLR), 2021
Josh Gardner
Ian Simon
Ethan Manilow
Curtis Hawthorne
Jesse Engel
684
127
0
04 Nov 2021
SNRi Target Training for Joint Speech Enhancement and Recognition
SNRi Target Training for Joint Speech Enhancement and RecognitionInterspeech (Interspeech), 2021
Yuma Koizumi
Shigeki Karita
A. Narayanan
S. Panchapagesan
M. Bacchiani
302
18
0
01 Nov 2021
1
Page 1 of 1