ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02508
  4. Cited By
SDR - half-baked or well done?

SDR - half-baked or well done?

6 November 2018
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
ArXivPDFHTML

Papers citing "SDR - half-baked or well done?"

50 / 611 papers shown
Title
Boosting Unknown-number Speaker Separation with Transformer
  Decoder-based Attractor
Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor
Younglo Lee
Shukjae Choi
Byeonghak Kim
Zhong-Qiu Wang
Shinji Watanabe
MoE
16
9
0
23 Jan 2024
Resource-constrained stereo singing voice cancellation
Resource-constrained stereo singing voice cancellation
Clara Borrelli
James Rae
Dogac Basaran
Matt McVicar
M. Souden
Matthias Mauch
28
0
0
22 Jan 2024
DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
Jae-Yeol Im
Juhan Nam
DiffM
20
3
0
16 Jan 2024
Hyperbolic Distance-Based Speech Separation
Hyperbolic Distance-Based Speech Separation
Darius Petermann
Minje Kim
47
4
0
07 Jan 2024
Single-Microphone Speaker Separation and Voice Activity Detection in
  Noisy and Reverberant Environments
Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments
Renana Opochinsky
Mordehay Moradi
Sharon Gannot
28
4
0
07 Jan 2024
Remixed2Remixed: Domain adaptation for speech enhancement by Noise2Noise
  learning with Remixing
Remixed2Remixed: Domain adaptation for speech enhancement by Noise2Noise learning with Remixing
Li Li
Shogo Seki
33
2
0
28 Dec 2023
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for
  Enhanced Time-Domain Monaural Speech Separation
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation
Shengkui Zhao
Yukun Ma
Chongjia Ni
Chong Zhang
Hao Wang
Trung Hieu Nguyen
Kun Zhou
J. Yip
Dianwen Ng
Bin Ma
38
23
0
19 Dec 2023
A Refining Underlying Information Framework for Monaural Speech
  Enhancement
A Refining Underlying Information Framework for Monaural Speech Enhancement
Rui Cao
Tianrui Wang
Meng Ge
Longbiao Wang
Jianwu Dang
28
1
0
18 Dec 2023
Self-Supervised Disentangled Representation Learning for Robust Target
  Speech Extraction
Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction
Zhaoxi Mu
Xinyu Yang
Sining Sun
Qing Yang
SSL
28
9
0
16 Dec 2023
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Xueyao Zhang
Liumeng Xue
Yicheng Gu
Yuancheng Wang
Haorui He
...
Mingxuan Wang
Jun Han
Kai Chen
Haizhou Li
Zhizheng Wu
31
29
0
15 Dec 2023
Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric
  Prediction for Speech Enhancement
Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement
George Close
William Ravenscroft
Thomas Hain
Stefan Goetze
34
2
0
14 Dec 2023
Ultra Low Complexity Deep Learning Based Noise Suppression
Ultra Low Complexity Deep Learning Based Noise Suppression
Shrishti Saha Shetu
Soumitro Chakrabarty
Oliver Thiergart
E. Mabande
31
8
0
13 Dec 2023
NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint
  Auditory Attention Detection
NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection
Zexu Pan
Gordon Wichern
François Germain
Sameer Khurana
Jonathan Le Roux
36
8
0
12 Dec 2023
Binaural multichannel blind speaker separation with a causal low-latency
  and low-complexity approach
Binaural multichannel blind speaker separation with a causal low-latency and low-complexity approach
Nils L. Westhausen
Bernd T. Meyer
BDL
43
3
0
08 Dec 2023
Investigating the Design Space of Diffusion Models for Speech
  Enhancement
Investigating the Design Space of Diffusion Models for Speech Enhancement
Philippe Gonzalez
Zheng-Hua Tan
Jan Østergaard
Jesper Jensen
T. S. Alstrøm
Tobias May
DiffM
33
6
0
07 Dec 2023
Mixture of Dynamical Variational Autoencoders for Multi-Source
  Trajectory Modeling and Separation
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
Xiaoyu Lin
Laurent Girin
Xavier Alameda-Pineda
24
2
0
07 Dec 2023
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker
  Verification Models
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models
Chi-Chang Lee
Hong-Wei Chen
Chu-Song Chen
Hsin-Min Wang
Tsung-Te Liu
Yu Tsao
30
1
0
28 Nov 2023
D4AM: A General Denoising Framework for Downstream Acoustic Models
D4AM: A General Denoising Framework for Downstream Acoustic Models
H. Wang
Yu Tsao
Hsin-Min Wang
Chu-Song Chen
21
4
0
28 Nov 2023
Self-Supervised Music Source Separation Using Vector-Quantized Source
  Category Estimates
Self-Supervised Music Source Separation Using Vector-Quantized Source Category Estimates
Marco Pasini
Stefan Lattner
George Fazekas
35
1
0
21 Nov 2023
HPCNeuroNet: Advancing Neuromorphic Audio Signal Processing with
  Transformer-Enhanced Spiking Neural Networks
HPCNeuroNet: Advancing Neuromorphic Audio Signal Processing with Transformer-Enhanced Spiking Neural Networks
Murat Isik
Hiruna Vishwamith
Kayode Inadagbo
I. C. Dikmen
25
6
0
21 Nov 2023
Improving Label Assignments Learning by Dynamic Sample Dropout Combined
  with Layer-wise Optimization in Speech Separation
Improving Label Assignments Learning by Dynamic Sample Dropout Combined with Layer-wise Optimization in Speech Separation
Chenyu Gao
Yue Gu
I. Marsic
30
0
0
20 Nov 2023
Semantic Hearing: Programming Acoustic Scenes with Binaural Hearables
Semantic Hearing: Programming Acoustic Scenes with Binaural Hearables
Bandhav Veluri
Malek Itani
Justin Chan
Takuya Yoshioka
Shyamnath Gollakota
31
15
0
01 Nov 2023
Seeing Through the Conversation: Audio-Visual Speech Separation based on
  Diffusion Model
Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model
Suyeon Lee
Chaeyoung Jung
Youngjoon Jang
Jaehun Kim
Joon Son Chung
35
7
0
30 Oct 2023
Generative Pre-training for Speech with Flow Matching
Generative Pre-training for Speech with Flow Matching
Alexander H. Liu
Matt Le
Apoorv Vyas
Bowen Shi
Andros Tjandra
Wei-Ning Hsu
27
31
0
25 Oct 2023
LC-TTFS: Towards Lossless Network Conversion for Spiking Neural Networks
  with TTFS Coding
LC-TTFS: Towards Lossless Network Conversion for Spiking Neural Networks with TTFS Coding
Qu Yang
Malu Zhang
Jibin Wu
Kay Chen Tan
Haizhou Li
32
9
0
23 Oct 2023
Real-time Speech Enhancement and Separation with a Unified Deep Neural
  Network for Single/Dual Talker Scenarios
Real-time Speech Enhancement and Separation with a Unified Deep Neural Network for Single/Dual Talker Scenarios
Kashyap Patel
A. Kovalyov
Issa Panahi
15
0
0
16 Oct 2023
CORN: Co-Trained Full- And No-Reference Speech Quality Assessment
CORN: Co-Trained Full- And No-Reference Speech Quality Assessment
Pranay Manocha
Donald Williamson
Adam Finkelstein
19
2
0
13 Oct 2023
A Single Speech Enhancement Model Unifying Dereverberation, Denoising,
  Speaker Counting, Separation, and Extraction
A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction
Kohei Saijo
Wangyou Zhang
Zhong-Qiu Wang
Shinji Watanabe
Tetsunori Kobayashi
Tetsuji Ogawa
VLM
28
6
0
12 Oct 2023
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker
  Extraction
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Xiang Hao
Jibin Wu
Jianwei Yu
Chenglin Xu
Kay Chen Tan
32
10
0
11 Oct 2023
On Time Domain Conformer Models for Monaural Speech Separation in Noisy
  Reverberant Acoustic Environments
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
William Ravenscroft
Stefan Goetze
Thomas Hain
38
7
0
09 Oct 2023
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction
Jiarui Hai
Helin Wang
Dongchao Yang
Karan Thakkar
Najim Dehak
Mounya Elhilali
DiffM
31
7
0
06 Oct 2023
MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice
  Enhancement
MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement
Weiming Xu
Zhouxuan Chen
Zhili Tan
Shubo Lv
Ru Han
Wenjiang Zhou
Weifeng Zhao
Lei Xie
27
2
0
06 Oct 2023
GASS: Generalizing Audio Source Separation with Large-scale Data
GASS: Generalizing Audio Source Separation with Large-scale Data
Jordi Pons
Xiaoyu Liu
Santiago Pascual
Joan Serrà
18
12
0
29 Sep 2023
Toward Universal Speech Enhancement for Diverse Input Conditions
Toward Universal Speech Enhancement for Diverse Input Conditions
Wangyou Zhang
Kohei Saijo
Zhong-Qiu Wang
Shinji Watanabe
Yanmin Qian
VLM
32
19
0
29 Sep 2023
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual
  Speech Separation
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation
Samuel Pegg
Kai Li
Xiaolin Hu
32
4
0
29 Sep 2023
Advancing Acoustic Howling Suppression through Recursive Training of
  Neural Networks
Advancing Acoustic Howling Suppression through Recursive Training of Neural Networks
Huatian Zhang
Yixuan Zhang
Meng Yu
Dong Yu
24
3
0
27 Sep 2023
Directional Source Separation for Robust Speech Recognition on Smart
  Glasses
Directional Source Separation for Robust Speech Recognition on Smart Glasses
Tiantian Feng
Ju Lin
Yiteng Huang
Weipeng He
Kaustubh Kalgaonkar
Niko Moritz
Liting Wan
Xin Lei
Ming Sun
Frank Seide
18
4
0
20 Sep 2023
Diffusion-based speech enhancement with a weighted generative-supervised
  learning loss
Diffusion-based speech enhancement with a weighted generative-supervised learning loss
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
DiffM
33
8
0
19 Sep 2023
Unsupervised speech enhancement with diffusion-based generative models
Unsupervised speech enhancement with diffusion-based generative models
Berné Nortier
Mostafa Sadeghi
Romain Serizel
DiffM
29
7
0
19 Sep 2023
Posterior sampling algorithms for unsupervised speech enhancement with
  recurrent variational autoencoder
Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder
Mostafa Sadeghi
Romain Serizel
BDL
13
0
0
19 Sep 2023
Single and Few-step Diffusion for Generative Speech Enhancement
Single and Few-step Diffusion for Generative Speech Enhancement
Bunlong Lay
Jean-Marie Lemercier
Julius Richter
Timo Gerkmann
DiffM
27
9
0
18 Sep 2023
Audio-Visual Active Speaker Extraction for Sparsely Overlapped
  Multi-talker Speech
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech
Jun Yu Li
Ruijie Tao
Zexu Pan
Meng Ge
Shuai Wang
Haizhou Li
35
5
0
15 Sep 2023
Two-Step Knowledge Distillation for Tiny Speech Enhancement
Two-Step Knowledge Distillation for Tiny Speech Enhancement
Rayan Daod Nathoo
M. Kegler
Marko Stamenovic
19
4
0
15 Sep 2023
AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised
  Features for Audio-Visual Speech Enhancement
AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Ju-Chieh Chou
Chung-Ming Chien
Karen Livescu
DiffM
26
4
0
14 Sep 2023
VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple
  Guidance
VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance
Carlos Hernandez-Olivan
Koichi Saito
Naoki Murata
Chieh-Hsin Lai
Marco A. Martínez-Ramírez
Wei-Hsiang Liao
Yuki Mitsufuji
DiffM
25
8
0
13 Sep 2023
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
Qinghua Liu
Meng Ge
Zhizheng Wu
Haizhou Li
29
0
0
13 Sep 2023
Assessing the Generalization Gap of Learning-Based Speech Enhancement
  Systems in Noisy and Reverberant Environments
Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments
Philippe Gonzalez
T. S. Alstrøm
Tobias May
28
13
0
12 Sep 2023
Addressing Feature Imbalance in Sound Source Separation
Addressing Feature Imbalance in Sound Source Separation
Jaechang Kim
Jeongyeon Hwang
Soheun Yi
Jaewoong Cho
Jungseul Ok
22
0
0
11 Sep 2023
Causal Signal-Based DCCRN with Overlapped-Frame Prediction for Online
  Speech Enhancement
Causal Signal-Based DCCRN with Overlapped-Frame Prediction for Online Speech Enhancement
Julitta Bartolewska
Stanisław Kacprzak
K. Kowalczyk
31
2
0
07 Sep 2023
A Generalized Bandsplit Neural Network for Cinematic Audio Source
  Separation
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
Karn N. Watcharasupat
Chih-Wei Wu
Yiwei Ding
Iroro Orife
Aaron J. Hipple
Aaron J. Hipple. Phillip A. Williams
Scott Kramer
Alexander Lerch
W. Wolcott
32
5
0
05 Sep 2023
Previous
12345...111213
Next