Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1910.10400
Cited By
v1
v2 (latest)
Filterbank design for end-to-end speech separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
23 October 2019
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Filterbank design for end-to-end speech separation"
32 / 32 papers shown
Title
Advances in Speech Separation: Techniques, Challenges, and Future Trends
Kai Li
Guo Chen
Wendi Sang
Yi Luo
Zhuo Chen
...
Shulin He
Zhong-Qiu Wang
Andong Li
Z. Wu
Xiaolin Hu
AI4TS
88
4
0
14 Aug 2025
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling
Jakob Poncelet
Hugo Van hamme
342
2
0
05 Feb 2025
MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms
Seung-bin Kim
Chan-yeong Lim
Jungwoo Heo
Ju-ho Kim
Hyun-Seo Shin
Kyo-Won Koo
Ha-Jin Yu
237
3
0
11 Jun 2024
To what extent can ASV systems naturally defend against spoofing attacks?
Interspeech (Interspeech), 2024
Jee-weon Jung
Xin Eric Wang
Nicholas W. D. Evans
Shinji Watanabe
Hye-jin Shim
Hemlata Tak
Sidhhant Arora
Junichi Yamagishi
Joon Son Chung
AAML
179
10
0
08 Jun 2024
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet
Satvik Venkatesh
Arthur Benilov
Philip Coleman
Frederic Roskam
265
11
0
27 Feb 2024
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection
Théo Mariotte
Anthony Larcher
Silvio Montrésor
Jean-Hugh Thomas
140
6
0
13 Feb 2024
A Convolutional Network Adaptation for Cortical Classification During Mobile Brain Imaging
B. Cichy
J. Lukos
Mohammad Alam
J. C. Bradford
Nicholas Wymbs
101
0
0
11 Oct 2023
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints
European Signal Processing Conference (EUSIPCO), 2023
P. Magron
Maria Sandsten
114
0
0
03 Mar 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shengkui Zhao
Bin Ma
197
69
0
23 Feb 2023
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Interspeech (Interspeech), 2022
Danilo de Oliveira
Tal Peer
Timo Gerkmann
138
24
0
23 Jun 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
JASA Express Letters (JE), 2022
Tal Peer
Timo Gerkmann
162
24
0
30 Mar 2022
Pushing the limits of raw waveform speaker recognition
Interspeech (Interspeech), 2022
Jee-weon Jung
You Jin Kim
Hee-Soo Heo
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
187
115
0
16 Mar 2022
Learning Filterbanks for End-to-End Acoustic Beamforming
Samuele Cornell
Manuel Pariente
François Grondin
S. Squartini
183
8
0
08 Nov 2021
SNRi Target Training for Joint Speech Enhancement and Recognition
Interspeech (Interspeech), 2021
Yuma Koizumi
Shigeki Karita
A. Narayanan
S. Panchapagesan
M. Bacchiani
210
16
0
01 Nov 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Zengwei Yao
Wenjie Pei
Fanglin Chen
Guangming Lu
David C. Zhang
162
13
0
10 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under mismatched conditions
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ge Zhu
Frank Cwitkowitz
Z. Duan
202
3
0
08 Oct 2021
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification
Automatic Speech Recognition & Understanding (ASRU), 2021
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
114
7
0
24 Sep 2021
Learning Sparse Analytic Filters for Piano Transcription
Frank Cwitkowitz
M. Heydari
Z. Duan
262
2
0
23 Aug 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Yuma Koizumi
Shigeki Karita
Scott Wisdom
Hakan Erdogan
J. Hershey
Llion Jones
M. Bacchiani
219
48
0
30 Jun 2021
A Modulation Front-End for Music Audio Tagging
IEEE International Joint Conference on Neural Network (IJCNN), 2021
Cyrus Vahidi
C. Saitis
Gyorgy Fazekas
136
3
0
25 May 2021
Learnable MFCCs for Speaker Verification
International Symposium on Circuits and Systems (ISCAS), 2021
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
112
18
0
20 Feb 2021
LEAF: A Learnable Frontend for Audio Classification
International Conference on Learning Representations (ICLR), 2021
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLM
AAML
203
163
0
21 Jan 2021
A comparison of handcrafted, parameterized, and learnable features for speech separation
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
153
5
0
29 Nov 2020
Attention-based scaling adaptation for target speech extraction
Automatic Speech Recognition & Understanding (ASRU), 2020
Jiangyu Han
Wei Rao
Yanhua Long
Jiaen Liang
166
10
0
19 Oct 2020
Vector-Quantized Timbre Representation
Adrien Bitton
P. Esling
Tatsuya Harada
133
12
0
13 Jul 2020
Unsupervised Sound Separation Using Mixture Invariant Training
Scott Wisdom
Efthymios Tzinis
Hakan Erdogan
Ron J. Weiss
K. Wilson
J. Hershey
213
27
0
23 Jun 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
194
167
0
08 May 2020
Unsupervised Interpretable Representation Learning for Singing Voice Separation
European Signal Processing Conference (EUSIPCO), 2020
S. I. Mimilakis
Konstantinos Drossos
G. Schuller
207
8
0
03 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers
International Conference on Machine Learning (ICML), 2020
Eliya Nachmani
Yossi Adi
Lior Wolf
318
182
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Neil Zeghidour
David Grangier
VLM
264
282
0
20 Feb 2020
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
David Ditter
Timo Gerkmann
198
62
0
25 Oct 2019
Deep Ad-hoc Beamforming
Xiao-Lei Zhang
480
25
0
03 Nov 2018
1