ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10400
  4. Cited By
Filterbank design for end-to-end speech separation
v1v2 (latest)

Filterbank design for end-to-end speech separation

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
23 October 2019
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
ArXiv (abs)PDFHTML

Papers citing "Filterbank design for end-to-end speech separation"

32 / 32 papers shown
Title
Advances in Speech Separation: Techniques, Challenges, and Future Trends
Advances in Speech Separation: Techniques, Challenges, and Future Trends
Kai Li
Guo Chen
Wendi Sang
Yi Luo
Zhuo Chen
...
Shulin He
Zhong-Qiu Wang
Andong Li
Z. Wu
Xiaolin Hu
AI4TS
88
4
0
14 Aug 2025
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling
Jakob Poncelet
Hugo Van hamme
342
2
0
05 Feb 2025
MR-RawNet: Speaker verification system with multiple temporal
  resolutions for variable duration utterances using raw waveforms
MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms
Seung-bin Kim
Chan-yeong Lim
Jungwoo Heo
Ju-ho Kim
Hyun-Seo Shin
Kyo-Won Koo
Ha-Jin Yu
237
3
0
11 Jun 2024
To what extent can ASV systems naturally defend against spoofing
  attacks?
To what extent can ASV systems naturally defend against spoofing attacks?Interspeech (Interspeech), 2024
Jee-weon Jung
Xin Eric Wang
Nicholas W. D. Evans
Shinji Watanabe
Hye-jin Shim
Hemlata Tak
Sidhhant Arora
Junichi Yamagishi
Joon Son Chung
AAML
179
10
0
08 Jun 2024
Real-time Low-latency Music Source Separation using Hybrid
  Spectrogram-TasNet
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet
Satvik Venkatesh
Arthur Benilov
Philip Coleman
Frederic Roskam
265
11
0
27 Feb 2024
Channel-Combination Algorithms for Robust Distant Voice Activity and
  Overlapped Speech Detection
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection
Théo Mariotte
Anthony Larcher
Silvio Montrésor
Jean-Hugh Thomas
140
6
0
13 Feb 2024
A Convolutional Network Adaptation for Cortical Classification During
  Mobile Brain Imaging
A Convolutional Network Adaptation for Cortical Classification During Mobile Brain Imaging
B. Cichy
J. Lukos
Mohammad Alam
J. C. Bradford
Nicholas Wymbs
101
0
0
11 Oct 2023
Spectrogram Inversion for Audio Source Separation via Consistency,
  Mixing, and Magnitude Constraints
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude ConstraintsEuropean Signal Processing Conference (EUSIPCO), 2023
P. Magron
Maria Sandsten
114
0
0
03 Mar 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation
  using Gated Single-Head Transformer with Convolution-Augmented Joint
  Self-Attentions
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-AttentionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shengkui Zhao
Bin Ma
197
69
0
23 Feb 2023
Efficient Transformer-based Speech Enhancement Using Long Frames and
  STFT Magnitudes
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT MagnitudesInterspeech (Interspeech), 2022
Danilo de Oliveira
Tal Peer
Timo Gerkmann
138
24
0
23 Jun 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Phase-Aware Deep Speech Enhancement: It's All About The Frame LengthJASA Express Letters (JE), 2022
Tal Peer
Timo Gerkmann
162
24
0
30 Mar 2022
Pushing the limits of raw waveform speaker recognition
Pushing the limits of raw waveform speaker recognitionInterspeech (Interspeech), 2022
Jee-weon Jung
You Jin Kim
Hee-Soo Heo
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
187
115
0
16 Mar 2022
Learning Filterbanks for End-to-End Acoustic Beamforming
Learning Filterbanks for End-to-End Acoustic Beamforming
Samuele Cornell
Manuel Pariente
François Grondin
S. Squartini
183
8
0
08 Nov 2021
SNRi Target Training for Joint Speech Enhancement and Recognition
SNRi Target Training for Joint Speech Enhancement and RecognitionInterspeech (Interspeech), 2021
Yuma Koizumi
Shigeki Karita
A. Narayanan
S. Panchapagesan
M. Bacchiani
210
16
0
01 Nov 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in
  High-order Latent Domain
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent DomainIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Zengwei Yao
Wenjie Pei
Fanglin Chen
Guangming Lu
David C. Zhang
162
13
0
10 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under
  mismatched conditions
A study of the robustness of raw waveform based speaker embeddings under mismatched conditionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ge Zhu
Frank Cwitkowitz
Z. Duan
202
3
0
08 Oct 2021
Optimized Power Normalized Cepstral Coefficients towards Robust Deep
  Speaker Verification
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker VerificationAutomatic Speech Recognition & Understanding (ASRU), 2021
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
114
7
0
24 Sep 2021
Learning Sparse Analytic Filters for Piano Transcription
Learning Sparse Analytic Filters for Piano Transcription
Frank Cwitkowitz
M. Heydari
Z. Duan
262
2
0
23 Aug 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using
  linear complexity self-attention for speech enhancement
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancementIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
219
48
0
30 Jun 2021
A Modulation Front-End for Music Audio Tagging
A Modulation Front-End for Music Audio TaggingIEEE International Joint Conference on Neural Network (IJCNN), 2021
Cyrus Vahidi
C. Saitis
Gyorgy Fazekas
136
3
0
25 May 2021
Learnable MFCCs for Speaker Verification
Learnable MFCCs for Speaker VerificationInternational Symposium on Circuits and Systems (ISCAS), 2021
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
112
18
0
20 Feb 2021
LEAF: A Learnable Frontend for Audio Classification
LEAF: A Learnable Frontend for Audio ClassificationInternational Conference on Learning Representations (ICLR), 2021
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLMAAML
203
163
0
21 Jan 2021
A comparison of handcrafted, parameterized, and learnable features for
  speech separation
A comparison of handcrafted, parameterized, and learnable features for speech separationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
153
5
0
29 Nov 2020
Attention-based scaling adaptation for target speech extraction
Attention-based scaling adaptation for target speech extractionAutomatic Speech Recognition & Understanding (ASRU), 2020
Jiangyu Han
Wei Rao
Yanhua Long
Jiaen Liang
166
10
0
19 Oct 2020
Vector-Quantized Timbre Representation
Vector-Quantized Timbre Representation
Adrien Bitton
P. Esling
Tatsuya Harada
133
12
0
13 Jul 2020
Unsupervised Sound Separation Using Mixture Invariant Training
Unsupervised Sound Separation Using Mixture Invariant Training
Scott Wisdom
Efthymios Tzinis
Hakan Erdogan
Ron J. Weiss
K. Wilson
J. Hershey
213
27
0
23 Jun 2020
Asteroid: the PyTorch-based audio source separation toolkit for
  researchers
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
194
167
0
08 May 2020
Unsupervised Interpretable Representation Learning for Singing Voice
  Separation
Unsupervised Interpretable Representation Learning for Singing Voice SeparationEuropean Signal Processing Conference (EUSIPCO), 2020
S. I. Mimilakis
Konstantinos Drossos
G. Schuller
207
8
0
03 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers
Voice Separation with an Unknown Number of Multiple SpeakersInternational Conference on Machine Learning (ICML), 2020
Eliya Nachmani
Yossi Adi
Lior Wolf
318
182
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker ClusteringIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Neil Zeghidour
David Grangier
VLM
264
282
0
20 Feb 2020
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNetIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
David Ditter
Timo Gerkmann
198
62
0
25 Oct 2019
Deep Ad-hoc Beamforming
Deep Ad-hoc Beamforming
Xiao-Lei Zhang
480
25
0
03 Nov 2018
1