SDR - half-baked or well done?

6 November 2018

Papers citing "SDR - half-baked or well done?"

50 / 614 papers shown

Title
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement Yuma Koizumi Shigeki Karita Scott Wisdom Hakan Erdogan J. Hershey Llion Jones M. Bacchiani 29 41 0 30 Jun 2021
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders Xiaoyu Bie Simon Leglaive Xavier Alameda-Pineda Laurent Girin DiffM 38 54 0 23 Jun 2021
Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement Andong Li C. Zheng Lu Zhang Xiaodong Li 19 142 0 22 Jun 2021
Improving On-Screen Sound Separation for Open-Domain Videos with Audio-Visual Self-Attention Efthymios Tzinis Scott Wisdom Tal Remez J. Hershey VLM 32 8 0 17 Jun 2021
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation Martin Strauss Jouni Paulus Matteo Torcoli B. Edler 31 8 0 16 Jun 2021
Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation Jisi Zhang Catalin Zorila R. Doddipatla Jon Barker 23 22 0 15 Jun 2021
Few-shot learning of new sound classes for target sound extraction Marc Delcroix Jorge Bennasar Vázquez Tsubasa Ochiai K. Kinoshita S. Araki VLM 29 11 0 14 Jun 2021
Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex Keitaro Tanaka Ryosuke Sawata Shusuke Takahashi 27 0 0 04 Jun 2021
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation Scott Wisdom A. Jansen Ron J. Weiss Hakan Erdogan J. Hershey 43 26 0 01 Jun 2021
Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement Guillaume Carbajal Julius Richter Timo Gerkmann DRL 20 15 0 19 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics V. Jayaram John Thickstun DiffM 28 23 0 17 May 2021
Move2Hear: Active Audio-Visual Source Separation Sagnik Majumder Ziad Al-Halah Kristen Grauman 21 44 0 15 May 2021
Separate but Together: Unsupervised Federated Learning for Speech Enhancement from Non-IID Data Efthymios Tzinis Jonah Casebeer Zhepei Wang Paris Smaragdis FedML 32 19 0 11 May 2021
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation Sunwoo Kim Minje Kim 36 19 0 08 May 2021
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection Aswin Sivaraman Minje Kim 21 9 0 08 May 2021
Weakly Supervised Source-Specific Sound Level Estimation in Noisy Soundscapes Aurora Cramer M. Cartwright Fatemeh Pishdadian J. P. Bello 23 2 0 06 May 2021
AvaTr: One-Shot Speaker Extraction with Transformers S. Hu Md Rifat Arefin V. Nguyen Alish Dipani Xaq Pitkow A. Tolias 38 4 0 03 May 2021
Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain Rongzhi Gu Shi-Xiong Zhang Yuexian Zou Dong Yu 41 34 0 26 Apr 2021
Improving Neural Silent Speech Interface Models by Adversarial Training Amin Honarmandi Shandiz L. Tóth G. Gosztolya Alexandra Markó Tamás Gábor Csapó AAML GAN 24 7 0 23 Apr 2021
Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders Yicong Yu Amin Honarmandi Shandiz L. Tóth 22 18 0 23 Apr 2021
Nonlinear Spatial Filtering in Multichannel Speech Enhancement Kristina Tesch Timo Gerkmann 19 19 0 22 Apr 2021
Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation Yueyue Na Ziteng Wang Zhang Liu Biao Tian Q. Fu 30 3 0 09 Apr 2021
Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification Aswin Sivaraman Sunwoo Kim Minje Kim 19 23 0 05 Apr 2021
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment Meng Yu Chunlei Zhang Yong-mei Xu Shi-Xiong Zhang Dong Yu 12 30 0 02 Apr 2021
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech Chenglin Xu Wei Rao Jibin Wu Haizhou Li 34 32 0 30 Mar 2021
Time-domain Speech Enhancement with Generative Adversarial Learning Feiyang Xiao Jian Guan Qiuqiang Kong Wenwu Wang GAN 21 9 0 30 Mar 2021
On TasNet for Low-Latency Single-Speaker Speech Enhancement Morten Kolbæk Zheng-Hua Tan S. H. Jensen Jesper Jensen 25 2 0 27 Mar 2021
Blind Speech Separation and Dereverberation using Neural Beamforming Lukas Pfeifenberger Franz Pernkopf 26 5 0 24 Mar 2021
Compute and memory efficient universal sound source separation Efthymios Tzinis Zhepei Wang Xilin Jiang Paris Smaragdis 26 40 0 03 Mar 2021
Reverb Conversion of Mixed Vocal Tracks Using an End-to-end Convolutional Deep Neural Network Junghyun Koo Seungryeol Paik Kyogu Lee 24 13 0 03 Mar 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect Jun Wang Max W. Y. Lam Dan Su Dong Yu 22 6 0 02 Mar 2021
Contrastive Separative Coding for Self-supervised Representation Learning Jun Wang Max W. Y. Lam Dan Su Dong Yu SSL 24 3 0 01 Mar 2021
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks Ju Lin A. Wijngaarden Kuang-Ching Wang M. C. Smith 27 50 0 24 Feb 2021
Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder Hu Fang Guillaume Carbajal S. Wermter Timo Gerkmann 39 59 0 17 Feb 2021
Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier Guillaume Carbajal Julius Richter Timo Gerkmann DRL SSL 18 16 0 12 Feb 2021
Joint Dereverberation and Separation with Iterative Source Steering Taishi Nakashima Robin Scheibler M. Togami Nobutaka Ono 6 12 0 12 Feb 2021
Multichannel-based learning for audio object extraction Daniel Arteaga Jordi Pons DiffM 13 3 0 11 Feb 2021
Real-time Monaural Speech Enhancement With Short-time Discrete Cosine Transform Qinglong Li Fei Gao Haixing Guan Kaichi Ma 33 24 0 09 Feb 2021
ICASSP 2021 Deep Noise Suppression Challenge: Decoupling Magnitude and Phase Optimization with a Two-Stage Deep Network Andong Li Wenzhe Liu Xiaoxue Luo C. Zheng Xiaodong Li 26 57 0 08 Feb 2021
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism Jisi Zhang Catalin Zorila R. Doddipatla Jon Barker 22 13 0 07 Feb 2021
Real-time Denoising and Dereverberation with Tiny Recurrent U-Net Hyeong-Seok Choi Sungjin Park Jie Hwan Lee Hoon Heo Dongsuk Jeon Kyogu Lee 36 57 0 05 Feb 2021
Music source separation conditioned on 3D point clouds Francesc Lluís V. Chatziioannou A. Hofmann 3DPC 24 5 0 03 Feb 2021
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework Yucheng Zhao Dacheng Yin Chong Luo Zhiyuan Zhao Chuanxin Tang Wenjun Zeng Zhengjun Zha SSL 11 6 0 03 Feb 2021
Towards efficient models for real-time deep noise suppression Sebastian Braun H. Gamper Chandan K. A. Reddy I. Tashev 21 104 0 22 Jan 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks Max W. Y. Lam Jun Wang Dan Su Dong Yu 43 29 0 13 Jan 2021
Neural Network-based Virtual Microphone Estimator Tsubasa Ochiai Marc Delcroix Tomohiro Nakatani Rintaro Ikeshita K. Kinoshita S. Araki 22 10 0 12 Jan 2021
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation Z. Zhang Yong-mei Xu Meng Yu Shi-Xiong Zhang Lianwu Chen Donald Williamson Dong Yu 22 28 0 24 Dec 2020
A Synergistic Kalman- and Deep Postfiltering Approach to Acoustic Echo Cancellation Thomas Haubner Mhd Modar Halimeh Andreas Brendel Walter Kellermann 19 15 0 16 Dec 2020
Group Communication with Context Codec for Lightweight Source Separation Yi Luo Cong Han N. Mesgarani 26 20 0 14 Dec 2020
Towards speech enhancement using a variational U-Net architecture E. J. Nustede Jörn Anemüller 17 1 0 07 Dec 2020