v1v2 (latest)

Filterbank design for end-to-end speech separation

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

23 October 2019

Antoine Deleforge

Papers citing "Filterbank design for end-to-end speech separation"

32 / 32 papers shown

Title
Advances in Speech Separation: Techniques, Challenges, and Future Trends Kai Li Guo Chen Wendi Sang Yi Luo Zhuo Chen ... Shulin He Zhong-Qiu Wang Andong Li Z. Wu Xiaolin Hu AI4TS 88 4 0 14 Aug 2025
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling Jakob Poncelet Hugo Van hamme 342 2 0 05 Feb 2025
MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms Seung-bin Kim Chan-yeong Lim Jungwoo Heo Ju-ho Kim Hyun-Seo Shin Kyo-Won Koo Ha-Jin Yu 237 3 0 11 Jun 2024
To what extent can ASV systems naturally defend against spoofing attacks?Interspeech (Interspeech), 2024 Jee-weon Jung Xin Eric Wang Nicholas W. D. Evans Shinji Watanabe Hye-jin Shim Hemlata Tak Sidhhant Arora Junichi Yamagishi Joon Son Chung AAML 179 10 0 08 Jun 2024
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet Satvik Venkatesh Arthur Benilov Philip Coleman Frederic Roskam 265 11 0 27 Feb 2024
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection Théo Mariotte Anthony Larcher Silvio Montrésor Jean-Hugh Thomas 140 6 0 13 Feb 2024
A Convolutional Network Adaptation for Cortical Classification During Mobile Brain Imaging B. Cichy J. Lukos Mohammad Alam J. C. Bradford Nicholas Wymbs 101 0 0 11 Oct 2023
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude ConstraintsEuropean Signal Processing Conference (EUSIPCO), 2023 P. Magron Maria Sandsten 114 0 0 03 Mar 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-AttentionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Shengkui Zhao Bin Ma 197 69 0 23 Feb 2023
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT MagnitudesInterspeech (Interspeech), 2022 Danilo de Oliveira Tal Peer Timo Gerkmann 138 24 0 23 Jun 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame LengthJASA Express Letters (JE), 2022 Tal Peer Timo Gerkmann 162 24 0 30 Mar 2022
Pushing the limits of raw waveform speaker recognitionInterspeech (Interspeech), 2022 Jee-weon Jung You Jin Kim Hee-Soo Heo Bong-Jin Lee Youngki Kwon Joon Son Chung 187 115 0 16 Mar 2022
Learning Filterbanks for End-to-End Acoustic Beamforming Samuele Cornell Manuel Pariente François Grondin S. Squartini 183 8 0 08 Nov 2021
SNRi Target Training for Joint Speech Enhancement and RecognitionInterspeech (Interspeech), 2021 Yuma Koizumi Shigeki Karita A. Narayanan S. Panchapagesan M. Bacchiani 210 16 0 01 Nov 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent DomainIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021 Zengwei Yao Wenjie Pei Fanglin Chen Guangming Lu David C. Zhang 162 13 0 10 Oct 2021
A study of the robustness of raw waveform based speaker embeddings under mismatched conditionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Ge Zhu Frank Cwitkowitz Z. Duan 202 3 0 08 Oct 2021
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker VerificationAutomatic Speech Recognition & Understanding (ASRU), 2021 Xuechen Liu Md. Sahidullah Tomi Kinnunen 114 7 0 24 Sep 2021
Learning Sparse Analytic Filters for Piano Transcription Frank Cwitkowitz M. Heydari Z. Duan 262 2 0 23 Aug 2021
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancementIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021 Yuma Koizumi Shigeki Karita Scott Wisdom Hakan Erdogan J. Hershey Llion Jones M. Bacchiani 219 48 0 30 Jun 2021
A Modulation Front-End for Music Audio TaggingIEEE International Joint Conference on Neural Network (IJCNN), 2021 Cyrus Vahidi C. Saitis Gyorgy Fazekas 136 3 0 25 May 2021
Learnable MFCCs for Speaker VerificationInternational Symposium on Circuits and Systems (ISCAS), 2021 Xuechen Liu Md. Sahidullah Tomi Kinnunen 112 18 0 20 Feb 2021
LEAF: A Learnable Frontend for Audio ClassificationInternational Conference on Learning Representations (ICLR), 2021 Neil Zeghidour O. Teboul Félix de Chaumont Quitry Marco Tagliasacchi VLM AAML 203 163 0 21 Jan 2021
A comparison of handcrafted, parameterized, and learnable features for speech separationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020 Wenbo Zhu Mou Wang Xiao-Lei Zhang S. Rahardja 153 5 0 29 Nov 2020
Attention-based scaling adaptation for target speech extractionAutomatic Speech Recognition & Understanding (ASRU), 2020 Jiangyu Han Wei Rao Yanhua Long Jiaen Liang 166 10 0 19 Oct 2020
Vector-Quantized Timbre Representation Adrien Bitton P. Esling Tatsuya Harada 133 12 0 13 Jul 2020
Unsupervised Sound Separation Using Mixture Invariant Training Scott Wisdom Efthymios Tzinis Hakan Erdogan Ron J. Weiss K. Wilson J. Hershey 213 27 0 23 Jun 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers Manuel Pariente Samuele Cornell Joris Cosentino S. Sivasankaran Efthymios Tzinis ... Juan M. Martín-Donas David Ditter Ariel Frank Antoine Deleforge Emmanuel Vincent 194 167 0 08 May 2020
Unsupervised Interpretable Representation Learning for Singing Voice SeparationEuropean Signal Processing Conference (EUSIPCO), 2020 S. I. Mimilakis Konstantinos Drossos G. Schuller 207 8 0 03 Mar 2020
Voice Separation with an Unknown Number of Multiple SpeakersInternational Conference on Machine Learning (ICML), 2020 Eliya Nachmani Yossi Adi Lior Wolf 318 182 0 29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker ClusteringIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020 Neil Zeghidour David Grangier VLM 264 282 0 20 Feb 2020
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNetIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 David Ditter Timo Gerkmann 198 62 0 25 Oct 2019
Deep Ad-hoc Beamforming Xiao-Lei Zhang 480 25 0 03 Nov 2018