ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02508
  4. Cited By
SDR - half-baked or well done?

SDR - half-baked or well done?

6 November 2018
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
ArXivPDFHTML

Papers citing "SDR - half-baked or well done?"

50 / 614 papers shown
Title
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral
  Mapping for Single-channel Speech Enhancement
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Kuan-Lin Chen
Daniel D. E. Wong
Ke Tan
Buye Xu
Anurag Kumar
V. Ithapu
35
1
0
16 Nov 2022
Reverberation as Supervision for Speech Separation
Reverberation as Supervision for Speech Separation
R. Aralikatti
Christoph Boeddeker
Gordon Wichern
Aswin Shanmugam Subramanian
Jonathan Le Roux
26
7
0
15 Nov 2022
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
Chang-Bin Jeon
Hyeongi Moon
Keunwoo Choi
Ben Sangbae Chon
Kyogu Lee
22
5
0
14 Nov 2022
Optimal Condition Training for Target Source Separation
Optimal Condition Training for Target Source Separation
Efthymios Tzinis
Gordon Wichern
Paris Smaragdis
Jonathan Le Roux
42
5
0
11 Nov 2022
Egocentric Audio-Visual Noise Suppression
Egocentric Audio-Visual Noise Suppression
Roshan S. Sharma
Weipeng He
Ju Lin
Egor Lakomkin
Yang Liu
Kaustubh Kalgaonkar
EgoV
24
1
0
07 Nov 2022
Preserving background sound in noise-robust voice conversion via
  multi-task learning
Preserving background sound in noise-robust voice conversion via multi-task learning
Jixun Yao
Yi Lei
Qing Wang
Pengcheng Guo
Ziqian Ning
Linfu Xie
Hai Li
Junhui Liu
Danming Xie
44
10
0
06 Nov 2022
Analysing Diffusion-based Generative Approaches versus Discriminative
  Approaches for Speech Restoration
Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
33
35
0
04 Nov 2022
Real-Time Target Sound Extraction
Real-Time Target Sound Extraction
Bandhav Veluri
Justin Chan
Malek Itani
Tuochao Chen
Takuya Yoshioka
Shyamnath Gollakota
44
30
0
04 Nov 2022
Iterative autoregression: a novel trick to improve your low-latency
  speech enhancement model
Iterative autoregression: a novel trick to improve your low-latency speech enhancement model
Pavel Andreev
Nicholas Babaev
Azat Saginbaev
Ivan Shchekotov
Aibek Alanov
29
4
0
03 Nov 2022
Fast and efficient speech enhancement with variational autoencoders
Fast and efficient speech enhancement with variational autoencoders
M. Sadeghi
Romain Serizel
DRL
BDL
11
2
0
02 Nov 2022
A weighted-variance variational autoencoder model for speech enhancement
A weighted-variance variational autoencoder model for speech enhancement
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
33
1
0
02 Nov 2022
Audio-visual speech enhancement with a deep Kalman filter generative
  model
Audio-visual speech enhancement with a deep Kalman filter generative model
A. Golmakani
M. Sadeghi
Romain Serizel
DiffM
11
6
0
02 Nov 2022
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech
  Recognition in Multi-party Meetings
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings
Mohan Shi
Jie Zhang
Zhihao Du
Fan Yu
Qian Chen
Shiliang Zhang
Lirong Dai
51
4
0
01 Nov 2022
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue
  through Embedding Inpainting
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting
Zexu Pan
Wupeng Wang
Marvin Borsdorf
Haizhou Li
14
10
0
31 Oct 2022
Diffusion-based Generative Speech Source Separation
Diffusion-based Generative Speech Source Separation
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
31
41
0
31 Oct 2022
Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Ayal Schwartz
Sharon Gannot
Shlomo E. Chazan
11
0
0
31 Oct 2022
UX-NET: Filter-and-Process-based Improved U-Net for Real-time
  Time-domain Audio Separation
UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio Separation
Kashyap Patel
A. Kovalyov
Issa Panahi
24
6
0
28 Oct 2022
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech
  Enhancement
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Ryosuke Sawata
Naoki Murata
Yuhta Takida
Toshimitsu Uesaka
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
DiffM
36
15
0
27 Oct 2022
Deformable Temporal Convolutional Networks for Monaural Noisy
  Reverberant Speech Separation
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
William Ravenscroft
Stefan Goetze
Thomas Hain
40
11
0
27 Oct 2022
Audio Signal Enhancement with Learning from Positive and Unlabelled Data
Audio Signal Enhancement with Learning from Positive and Unlabelled Data
N. Ito
Masashi Sugiyama
21
7
0
27 Oct 2022
Parallel Gated Neural Network With Attention Mechanism For Speech
  Enhancement
Parallel Gated Neural Network With Attention Mechanism For Speech Enhancement
Jia Cui
S. Bleeck
21
0
0
26 Oct 2022
EBEN: Extreme bandwidth extension network applied to speech signals
  captured with noise-resilient body-conduction microphones
EBEN: Extreme bandwidth extension network applied to speech signals captured with noise-resilient body-conduction microphones
J. Hauret
Thomas Joubaud
V. Zimpfer
Éric Bavu
9
9
0
25 Oct 2022
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker
  Embeddings for Target Speaker Separation
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation
Xiaoyu Liu
Xu Li
Joan Serrà
44
9
0
23 Oct 2022
How to Leverage DNN-based speech enhancement for multi-channel speaker
  verification?
How to Leverage DNN-based speech enhancement for multi-channel speaker verification?
Sandipana Dowerah
Romain Serizel
D. Jouvet
Mohammad MohammadAmini
D. Matrouf
45
0
0
17 Oct 2022
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Junjie Li
Meng Ge
Zexu Pan
Longbiao Wang
J. Dang
18
10
0
09 Oct 2022
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via
  Generative Modeling
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling
Yusong Wu
Josh Gardner
Ethan Manilow
Ian Simon
Curtis Hawthorne
Jesse Engel
40
10
0
28 Sep 2022
Meta-Learning for Adaptive Filters with Higher-Order Frequency
  Dependencies
Meta-Learning for Adaptive Filters with Higher-Order Frequency Dependencies
Junkai Wu
Jonah Casebeer
Nicholas J. Bryan
Paris Smaragdis
30
5
0
20 Sep 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural
  Speaker Separation
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
74
99
0
08 Sep 2022
Music Separation Enhancement with Generative Modeling
Music Separation Enhancement with Generative Modeling
N. Schaffer
Boaz Cogan
Ethan Manilow
Max Morrison
Prem Seetharaman
Bryan Pardo
34
9
0
26 Aug 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
24
185
0
11 Aug 2022
Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event
  Detection with Segment-level Metric Learning
Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning
Haohe Liu
Xubo Liu
Xinhao Mei
Qiuqiang Kong
Wenwu Wang
Mark D. Plumbley
28
9
0
21 Jul 2022
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated
  Open-Domain On-Screen Sound Separation
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation
Efthymios Tzinis
Scott Wisdom
Tal Remez
J. Hershey
41
30
0
20 Jul 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition,
  Translation, and Understanding
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
Yen-Ju Lu
Xuankai Chang
Chenda Li
Wangyou Zhang
Samuele Cornell
...
Robin Scheibler
Zhong-Qiu Wang
Yu Tsao
Y. Qian
Shinji Watanabe
VLM
26
28
0
19 Jul 2022
PodcastMix: A dataset for separating music and speech in podcasts
PodcastMix: A dataset for separating music and speech in podcasts
Nico M. Schmidt
Jordi Pons
M. Miron
25
2
0
15 Jul 2022
Direction-Aware Adaptive Online Neural Speech Enhancement with an
  Augmented Reality Headset in Real Noisy Conversational Environments
Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments
Kouhei Sekiguchi
Aditya Arie Nugraha
Yicheng Du
Yoshiaki Bando
Mathieu Fontaine
Kazuyoshi Yoshii
18
8
0
15 Jul 2022
Dual-Path Cross-Modal Attention for better Audio-Visual Speech
  Extraction
Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction
Zhongweiyang Xu
Xulin Fan
M. Hasegawa-Johnson
24
2
0
09 Jul 2022
Learning to Separate Voices by Spatial Regions
Learning to Separate Voices by Spatial Regions
Alan Xu
Romit Roy Choudhury
39
10
0
09 Jul 2022
Multi-Modal Multi-Correlation Learning for Audio-Visual Speech
  Separation
Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation
Xiaoyu Wang
Xiangyu Kong
Xiulian Peng
Yan Lu
17
6
0
04 Jul 2022
Distance-Based Sound Separation
Distance-Based Sound Separation
K. Patterson
K. Wilson
Scott Wisdom
J. Hershey
19
21
0
01 Jul 2022
An Evaluation of Three-Stage Voice Conversion Framework for Noisy and
  Reverberant Conditions
An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions
Yeonjong Choi
Chao Xie
T. Toda
DiffM
38
2
0
30 Jun 2022
ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech
  Enhancement
ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Ishan Chatterjee
Maruchi Kim
V. Jayaram
Shyamnath Gollakota
Ira Kemelmacher-Shlizerman
Shwetak N. Patel
S. M. Seitz
27
24
0
27 Jun 2022
Challenges and Opportunities in Multi-device Speech Processing
Challenges and Opportunities in Multi-device Speech Processing
G. Ciccarelli
Jarred Barber
A. Nair
Israel Cohen
Tao Zhang
35
5
0
27 Jun 2022
Efficient Transformer-based Speech Enhancement Using Long Frames and
  STFT Magnitudes
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Danilo de Oliveira
Tal Peer
Timo Gerkmann
21
20
0
23 Jun 2022
On the Role of Spatial, Spectral, and Temporal Processing for DNN-based
  Non-linear Multi-channel Speech Enhancement
On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement
Kristina Tesch
Nils-Hendrik Mohrmann
Timo Gerkmann
30
6
0
22 Jun 2022
An Empirical Analysis on the Vulnerabilities of End-to-End Speech
  Segregation Models
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Rahil Parikh
G. Rochette
C. Espy-Wilson
S. Shamma
UQCV
22
0
0
20 Jun 2022
Resource-Efficient Separation Transformer
Resource-Efficient Separation Transformer
Luca Della Libera
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Frédéric Lepoutre
François Grondin
VLM
45
16
0
19 Jun 2022
Semi-supervised Time Domain Target Speaker Extraction with Attention
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang
Ritwik Giri
Shrikant Venkataramani
Umut Isik
J. Valin
Paris Smaragdis
Mike Goodwin
A. Krishnaswamy
24
7
0
18 Jun 2022
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional
  Resampling
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Chi-Chang Lee
Cheng-Hung Hu
Yu-Chen Lin
Chu-Song Chen
Hsin-Min Wang
Yu Tsao
41
2
0
18 Jun 2022
Simultaneous Speech Extraction for Multiple Target Speakers under the
  Meeting Scenarios
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios
Bang Zeng
Weiqing Wang
Yuanyuan Bao
Ming Li
27
0
0
17 Jun 2022
On the Use of Deep Mask Estimation Module for Neural Source Separation
  Systems
On the Use of Deep Mask Estimation Module for Neural Source Separation Systems
Kai Li
Xiaolin Hu
Yi Luo
20
16
0
15 Jun 2022
Previous
123...678...111213
Next