ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02508
  4. Cited By
SDR - half-baked or well done?

SDR - half-baked or well done?

6 November 2018
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
ArXivPDFHTML

Papers citing "SDR - half-baked or well done?"

50 / 611 papers shown
Title
Fast Random Approximation of Multi-channel Room Impulse Response
Fast Random Approximation of Multi-channel Room Impulse Response
Yi Luo
Rongzhi Gu
20
4
0
17 Apr 2023
On Data Sampling Strategies for Training Neural Network Speech
  Separation Models
On Data Sampling Strategies for Training Neural Network Speech Separation Models
William Ravenscroft
Stefan Goetze
Thomas Hain
VLM
24
6
0
14 Apr 2023
Partially Adaptive Multichannel Joint Reduction of Ego-noise and
  Environmental Noise
Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise
Hu Fang
Niklas Wittmer
Johannes Twiefel
S. Wermter
Timo Gerkmann
33
3
0
27 Mar 2023
Better Together: Dialogue Separation and Voice Activity Detection for
  Audio Personalization in TV
Better Together: Dialogue Separation and Voice Activity Detection for Audio Personalization in TV
Matteo Torcoli
Emanuel Habets
30
3
0
23 Mar 2023
End-to-End Integration of Speech Separation and Voice Activity Detection
  for Low-Latency Diarization of Telephone Conversations
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Giovanni Morrone
Samuele Cornell
L. Serafini
Enrico Zovato
Alessio Brutti
S. Squartini
23
4
0
21 Mar 2023
Configurable EBEN: Extreme Bandwidth Extension Network to enhance
  body-conducted speech capture
Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture
Hauret Julien
Joubaud Thomas
V. Zimpfer
Bavu Éric
21
6
0
17 Mar 2023
The Intel Neuromorphic DNS Challenge
The Intel Neuromorphic DNS Challenge
Jonathan Timcheck
S. Shrestha
D. B. Rubin
A. Kupryjanow
Garrick Orchard
Lukasz Pindor
Timothy M. Shea
Mike Davies
39
27
0
16 Mar 2023
Beamformer-Guided Target Speaker Extraction
Beamformer-Guided Target Speaker Extraction
Mohamed Elminshawi
Srikanth Raj Chetupalli
Emanuel Habets
35
7
0
15 Mar 2023
Towards Real-Time Single-Channel Speech Separation in Noisy and
  Reverberant Environments
Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant Environments
Julian Neri
Sebastian Braun
19
1
0
14 Mar 2023
Multi-Microphone Speaker Separation by Spatial Regions
Multi-Microphone Speaker Separation by Spatial Regions
Julian Wechsler
Srikanth Raj Chetupalli
Wolfgang Mack
Emanuel Habets
34
10
0
13 Mar 2023
Distribution Preserving Source Separation With Time Frequency Predictive
  Models
Distribution Preserving Source Separation With Time Frequency Predictive Models
Pedro J. Villasana T
J. Klejsa
Lars Villemoes
P. Hedelin
27
2
0
10 Mar 2023
X-SepFormer: End-to-end Speaker Extraction Network with Explicit
  Optimization on Speaker Confusion
X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion
Kai Liu
Z.C. Du
Xucheng Wan
Huan Zhou
52
21
0
09 Mar 2023
Speech Modeling with a Hierarchical Transformer Dynamical VAE
Speech Modeling with a Hierarchical Transformer Dynamical VAE
Xiaoyu Lin
Xiaoyu Bie
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
BDL
50
2
0
07 Mar 2023
Multi-Dimensional and Multi-Scale Modeling for Speech Separation
  Optimized by Discriminative Learning
Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Zhaoxi Mu
Xinyu Yang
Wenjing Zhu
31
5
0
07 Mar 2023
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and
  Reverberant Environments
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments
Zhaoxi Mu
Xinyu Yang
Xiangyuan Yang
Wenjing Zhu
18
5
0
07 Mar 2023
Extending DNN-based Multiplicative Masking to Deep Subband Filtering for
  Improved Dereverberation
Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Jean-Marie Lemercier
Julian Tobergte
Timo Gerkmann
24
2
0
01 Mar 2023
Reducing the Prior Mismatch of Stochastic Differential Equations for
  Diffusion-based Speech Enhancement
Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement
Bunlong Lay
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
16
24
0
28 Feb 2023
3D Neural Beamforming for Multi-channel Speech Separation Against
  Location Uncertainty
3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Rongzhi Gu
Shi-Xiong Zhang
Dong Yu
14
2
0
27 Feb 2023
DFSNet: A Steerable Neural Beamformer Invariant to Microphone Array
  Configuration for Real-Time, Low-Latency Speech Enhancement
DFSNet: A Steerable Neural Beamformer Invariant to Microphone Array Configuration for Real-Time, Low-Latency Speech Enhancement
A. Kovalyov
Kashyap Patel
Issa Panahi
31
3
0
26 Feb 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation
  using Gated Single-Head Transformer with Convolution-Augmented Joint
  Self-Attentions
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
Shengkui Zhao
Bin Ma
38
53
0
23 Feb 2023
Unifying Speech Enhancement and Separation with Gradient Modulation for
  End-to-End Noise-Robust Speech Separation
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation
Yuchen Hu
Chen Chen
Heqing Zou
Xionghu Zhong
Chng Eng Siong
50
16
0
22 Feb 2023
Deep AHS: A Deep Learning Approach to Acoustic Howling Suppression
Deep AHS: A Deep Learning Approach to Acoustic Howling Suppression
Huatian Zhang
Meng Yu
Dong Yu
34
8
0
18 Feb 2023
Multi-Source Diffusion Models for Simultaneous Music Generation and
  Separation
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
Giorgio Mariani
Irene Tallini
Emilian Postolache
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
DiffM
30
38
0
04 Feb 2023
Relating EEG to continuous speech using deep neural networks: a review
Relating EEG to continuous speech using deep neural networks: a review
Corentin Puffay
Bernd Accou
Lies Bollens
Mohammad Jalilpour-Monesi
Jonas Vanthornhout
Hugo Van hamme
T. Francart
19
41
0
03 Feb 2023
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse
  Problems with Denoising Diffusion Restoration
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Naoki Murata
Koichi Saito
Chieh-Hsin Lai
Yuhta Takida
Toshimitsu Uesaka
Yuki Mitsufuji
Stefano Ermon
DiffM
56
49
0
30 Jan 2023
NeuralKalman: A Learnable Kalman Filter for Acoustic Echo Cancellation
NeuralKalman: A Learnable Kalman Filter for Acoustic Echo Cancellation
Yixuan Zhang
Meng Yu
Huatian Zhang
Dong Yu
DeLiang Wang
41
7
0
29 Jan 2023
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving
  Source Separation
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation
Shahar Lutati
Eliya Nachmani
Lior Wolf
DiffM
38
14
0
25 Jan 2023
On Batching Variable Size Inputs for Training End-to-End Speech
  Enhancement Systems
On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems
Philippe Gonzalez
T. S. Alstrøm
Tobias May
24
9
0
25 Jan 2023
Perceive and predict: self-supervised speech representation based loss
  functions for speech enhancement
Perceive and predict: self-supervised speech representation based loss functions for speech enhancement
George Close
William Ravenscroft
Thomas Hain
Stefan Goetze
SSL
38
12
0
11 Jan 2023
Rethinking complex-valued deep neural networks for monaural speech
  enhancement
Rethinking complex-valued deep neural networks for monaural speech enhancement
Haibin Wu
Ke Tan
Buye Xu
Anurag Kumar
Daniel D. E. Wong
29
6
0
11 Jan 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech
  Enhancement and Dereverberation
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
155
81
0
22 Dec 2022
An Audio-Visual Speech Separation Model Inspired by
  Cortico-Thalamo-Cortical Circuits
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
Kai Li
Fenghua Xie
Hang Chen
K. Yuan
Xiaolin Hu
34
14
0
21 Dec 2022
Towards Unified All-Neural Beamforming for Time and Frequency Domain
  Speech Separation
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Rongzhi Gu
Shi-Xiong Zhang
Yuexian Zou
Dong Yu
AI4TS
30
24
0
16 Dec 2022
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech
  Enhancement
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Dongheon Lee
Jung-Woo Choi
32
25
0
15 Dec 2022
Tackling the Cocktail Fork Problem for Separation and Transcription of
  Real-World Soundtracks
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Darius Petermann
Gordon Wichern
Aswin Shanmugam Subramanian
Zhong-Qiu Wang
Jonathan Le Roux
27
10
0
14 Dec 2022
Hyperbolic Audio Source Separation
Hyperbolic Audio Source Separation
Darius Petermann
Gordon Wichern
Aswin Shanmugam Subramanian
Jonathan Le Roux
27
10
0
09 Dec 2022
NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer
NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer
Changsheng Quan
Xiaofei Li
35
2
0
05 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
37
21
0
01 Dec 2022
A General Unfolding Speech Enhancement Method Motivated by Taylor's
  Theorem
A General Unfolding Speech Enhancement Method Motivated by Taylor's Theorem
Andong Li
Guochen Yu
C. Zheng
Wenzhe Liu
Xiaodong Li
48
10
0
30 Nov 2022
JaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
JaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Tomohiko Nakamura
Shinnosuke Takamichi
Naoko Tanji
Satoru Fukayama
Hiroshi Saruwatari
23
4
0
29 Nov 2022
Neural Vocoder Feature Estimation for Dry Singing Voice Separation
Neural Vocoder Feature Estimation for Dry Singing Voice Separation
Jae-Yeol Im
Soonbeom Choi
Sangeon Yong
Juhan Nam
32
1
0
29 Nov 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech
  Separation
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
48
121
0
22 Nov 2022
TaylorBeamixer: Learning Taylor-Inspired All-Neural Multi-Channel Speech
  Enhancement from Beam-Space Dictionary Perspective
TaylorBeamixer: Learning Taylor-Inspired All-Neural Multi-Channel Speech Enhancement from Beam-Space Dictionary Perspective
Andong Li
Guochen Yu
Wenzhe Liu
Xiaodong Li
C. Zheng
32
2
0
22 Nov 2022
Latent Iterative Refinement for Modular Source Separation
Latent Iterative Refinement for Modular Source Separation
Dimitrios Bralios
Efthymios Tzinis
Gordon Wichern
Paris Smaragdis
Jonathan Le Roux
BDL
33
5
0
22 Nov 2022
Self-Remixing: Unsupervised Speech Separation via Separation and
  Remixing
Self-Remixing: Unsupervised Speech Separation via Separation and Remixing
Kohei Saijo
Tetsuji Ogawa
SSL
22
11
0
18 Nov 2022
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method
  Using Variational Autoencoder and Adversarial Training
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
25
5
0
16 Nov 2022
Array Configuration-Agnostic Personalized Speech Enhancement using
  Long-Short-Term Spatial Coherence
Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Yicheng Hsu
Yonghan Lee
M. Bai
32
2
0
16 Nov 2022
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral
  Mapping for Single-channel Speech Enhancement
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Kuan-Lin Chen
Daniel D. E. Wong
Ke Tan
Buye Xu
Anurag Kumar
V. Ithapu
35
1
0
16 Nov 2022
Reverberation as Supervision for Speech Separation
Reverberation as Supervision for Speech Separation
R. Aralikatti
Christoph Boeddeker
Gordon Wichern
Aswin Shanmugam Subramanian
Jonathan Le Roux
24
7
0
15 Nov 2022
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
Chang-Bin Jeon
Hyeongi Moon
Keunwoo Choi
Ben Sangbae Chon
Kyogu Lee
20
5
0
14 Nov 2022
Previous
123...567...111213
Next