SDR - half-baked or well done?

6 November 2018

Papers citing "SDR - half-baked or well done?"

50 / 614 papers shown

Title
On the Design and Training Strategies for RNN-based Online Neural Speech Separation Systems Kai Li Yi Luo 29 13 0 15 Jun 2022
Sampling Frequency Independent Dialogue Separation Jouni Paulus Matteo Torcoli 22 12 0 05 Jun 2022
SepIt: Approaching a Single Channel Speech Separation Bound Shahar Lutati Eliya Nachmani Lior Wolf VLM 46 27 0 24 May 2022
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement Meng Yu Yong-mei Xu Chunlei Zhang Shizhong Zhang Dong Yu 28 11 0 20 May 2022
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation William Ravenscroft Stefan Goetze Thomas Hain 29 6 0 17 May 2022
A deep representation learning speech enhancement method using $β$ -VAE Yang Xiang Jesper Lisby Højvang M. Rasmussen M. G. Christensen DRL 27 2 0 11 May 2022
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality Xu Tan Jiawei Chen Haohe Liu Jian Cong Chen Zhang ... Lei He Frank Soong Tao Qin Sheng Zhao Tie-Yan Liu 46 213 0 09 May 2022
Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter Y.-R. Tsai Yicheng Hsu M. Bai 21 1 0 07 May 2022
Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking Tsubasa Ochiai Marc Delcroix Tomohiro Nakatani S. Araki 14 20 0 07 May 2022
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy S. Panchapagesan A. Narayanan T. Shabestary Shuai Shao N. Howard Alex Park James Walker A. Gruenstein 29 3 0 06 May 2022
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement Andong Li Shan You Guochen Yu C. Zheng Xiaodong Li 38 26 0 30 Apr 2022
Meta-AF: Meta-Learning for Adaptive Filters Jonah Casebeer Nicholas J. Bryan Paris Smaragdis 175 28 0 25 Apr 2022
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation Jiangyu Han Yanhua Long 28 6 0 23 Apr 2022
STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency Zhong-Qiu Wang Gordon Wichern Shinji Watanabe Jonathan Le Roux 25 36 0 21 Apr 2022
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction Zifeng Zhao Rongzhi Gu Dongchao Yang Jinchuan Tian Yuexian Zou 33 2 0 15 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System M. Z. Ozturk Chenshu Wu Beibei Wang Min Wu K. Liu 27 20 0 14 Apr 2022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration Haohe Liu Xubo Liu Qiuqiang Kong Qiao Tian Yan Zhao DeLiang Wang Chuanzeng Huang Yuxuan Wang 21 51 0 12 Apr 2022
Text-Driven Separation of Arbitrary Sounds Kevin Kilgour Beat Gfeller Qingqing Huang A. Jansen Scott Wisdom Marco Tagliasacchi 30 30 0 12 Apr 2022
Listen only to me! How well can target speech extraction handle false alarms? Marc Delcroix K. Kinoshita Tsubasa Ochiai Kateřina Žmolíková Hiroshi Sato Tomohiro Nakatani 34 15 0 11 Apr 2022
Multichannel Speech Separation with Narrow-band Conformer Changsheng Quan Xiaofei Li 31 12 0 09 Apr 2022
Heterogeneous Target Speech Separation Hyunjae Cho Wonbin Jung Junhyeok Lee Paris Smaragdis Sanghyun Woo 51 26 0 07 Apr 2022
FFC-SE: Fast Fourier Convolution for Speech Enhancement Ivan Shchekotov Pavel Andreev Oleg Ivanov Aibek Alanov Dmitry Vetrov 40 23 0 06 Apr 2022
Expression-preserving face frontalization improves visually assisted speech processing Zhiqi Kang M. Sadeghi Radu Horaud Xavier Alameda-Pineda CVBM 28 8 0 06 Apr 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing Zhenyu Tang R. Aralikatti Anton Ratnarajah Tianyi Zhou 40 31 0 04 Apr 2022
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches Zifeng Zhao Dongchao Yang Rongzhi Gu Haoran Zhang Yuexian Zou 30 16 0 04 Apr 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers Soumi Maiti Yushi Ueda Shinji Watanabe Chunlei Zhang Meng Yu Shi-Xiong Zhang Yong-mei Xu 42 33 0 31 Mar 2022
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain Simon Welker Julius Richter Timo Gerkmann DiffM 33 110 0 31 Mar 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Zexu Pan Meng Ge Haizhou Li 31 17 0 31 Mar 2022
Speaker Extraction with Co-Speech Gestures Cue Zexu Pan Xinyuan Qian Haizhou Li SLR 31 27 0 31 Mar 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length Tal Peer Timo Gerkmann 22 21 0 30 Mar 2022
Separate What You Describe: Language-Queried Audio Source Separation Xubo Liu Haohe Liu Qiuqiang Kong Xinhao Mei Jinzheng Zhao Qiushi Huang Mark D. Plumbley Wenwu Wang 42 58 0 28 Mar 2022
Improving Source Separation by Explicitly Modeling Dependencies Between Sources Ethan Manilow Curtis Hawthorne Cheng-Zhi Anna Huang Bryan Pardo Jesse Engel BDL 28 7 0 28 Mar 2022
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation Xue Yang C. Bao 27 3 0 25 Mar 2022
HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement Pavel Andreev Aibek Alanov Oleg Ivanov Dmitry Vetrov 38 38 0 24 Mar 2022
Upmixing via style transfer: a variational autoencoder for disentangling spatial images and musical content Haici Yang Sanna Wager S. Russell Mike Luo Minje Kim Wontak Kim 8 2 0 22 Mar 2022
RoSS: Utilizing Robotic Rotation for Audio Source Separation Hyungjoo Seo Sahil Bhandary Karnoor Romit Roy Choudhury 28 0 0 18 Mar 2022
TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory Andong Li Guochen Yu C. Zheng Xiaodong Li 17 10 0 14 Mar 2022
MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient Andong Li C. Zheng Ziyang Zhang Xiaodong Li 29 3 0 14 Mar 2022
Single microphone speaker extraction using unified time-frequency Siamese-Unet Aviad Eisenberg Sharon Gannot Shlomo E. Chazan 32 3 0 06 Mar 2022
Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement Hu Fang Tal Peer S. Wermter Timo Gerkmann 31 6 0 04 Mar 2022
Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge Yen-Ju Lu Samuele Cornell Xuankai Chang Wangyou Zhang Chenda Li Zhaoheng Ni Zhong-Qiu Wang Shinji Watanabe 19 28 0 24 Feb 2022
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing Efthymios Tzinis Yossi Adi V. Ithapu Buye Xu Paris Smaragdis Anurag Kumar CLL 30 54 0 17 Feb 2022
On loss functions and evaluation metrics for music source separation Enric Gusó Jordi Pons Santiago Pascual Joan Serrà 22 19 0 16 Feb 2022
A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning Tassadaq Hussain Muhammad Diyan M. Gogate K. Dashtipour Ahsan Adeel Yu Tsao Amir Hussain AuLLM 21 3 0 11 Feb 2022
A Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning for Hearing-Assistive Technologies Tassadaq Hussain Muhammad Diyan M. Gogate K. Dashtipour Ahsan Adeel Yu Tsao Amir Hussain AuLLM 22 2 0 08 Feb 2022
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training Ertuğ Karamatlı S. Kırbız SSL 36 9 0 08 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation Cem Subakan Mirco Ravanelli Samuele Cornell François Grondin Mirko Bronzi 40 23 0 06 Feb 2022
Distortion Audio Effects: Learning How to Recover the Clean Signal Johannes Imort Giorgio Fabbro Marco A. Martínez-Ramírez Stefan Uhlich Yuichiro Koyama Yuki Mitsufuji 11 10 0 03 Feb 2022
Active Audio-Visual Separation of Dynamic Sound Sources Sagnik Majumder Kristen Grauman 29 21 0 02 Feb 2022
New Insights on Target Speaker Extraction Mohamed Elminshawi Wolfgang Mack Srikanth Raj Chetupalli Soumitro Chakrabarty Emanuel Habets 19 18 0 01 Feb 2022