ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02508
  4. Cited By
SDR - half-baked or well done?

SDR - half-baked or well done?

6 November 2018
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
ArXivPDFHTML

Papers citing "SDR - half-baked or well done?"

50 / 614 papers shown
Title
On the Design and Training Strategies for RNN-based Online Neural Speech
  Separation Systems
On the Design and Training Strategies for RNN-based Online Neural Speech Separation Systems
Kai Li
Yi Luo
29
13
0
15 Jun 2022
Sampling Frequency Independent Dialogue Separation
Sampling Frequency Independent Dialogue Separation
Jouni Paulus
Matteo Torcoli
22
12
0
05 Jun 2022
SepIt: Approaching a Single Channel Speech Separation Bound
SepIt: Approaching a Single Channel Speech Separation Bound
Shahar Lutati
Eliya Nachmani
Lior Wolf
VLM
46
27
0
24 May 2022
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified
  Acoustic Echo Suppression And Speech Enhancement
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Meng Yu
Yong-mei Xu
Chunlei Zhang
Shizhong Zhang
Dong Yu
28
11
0
20 May 2022
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for
  Monaural Speech Dereverberation
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation
William Ravenscroft
Stefan Goetze
Thomas Hain
29
6
0
17 May 2022
A deep representation learning speech enhancement method using
  $β$-VAE
A deep representation learning speech enhancement method using βββ-VAE
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
27
2
0
11 May 2022
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level
  Quality
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
Xu Tan
Jiawei Chen
Haohe Liu
Jian Cong
Chen Zhang
...
Lei He
Frank Soong
Tao Qin
Sheng Zhao
Tie-Yan Liu
46
213
0
09 May 2022
Acoustic echo suppression using a learning-based multi-frame minimum
  variance distortionless response filter
Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Y.-R. Tsai
Yicheng Hsu
M. Bai
21
1
0
07 May 2022
Mask-based Neural Beamforming for Moving Speakers with
  Self-Attention-based Tracking
Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
S. Araki
14
20
0
07 May 2022
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller
  Optimized for ASR Accuracy
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
S. Panchapagesan
A. Narayanan
T. Shabestary
Shuai Shao
N. Howard
Alex Park
James Walker
A. Gruenstein
29
3
0
06 May 2022
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural
  Speech Enhancement
Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Andong Li
Shan You
Guochen Yu
C. Zheng
Xiaodong Li
38
26
0
30 Apr 2022
Meta-AF: Meta-Learning for Adaptive Filters
Meta-AF: Meta-Learning for Adaptive Filters
Jonah Casebeer
Nicholas J. Bryan
Paris Smaragdis
175
28
0
25 Apr 2022
Heterogeneous Separation Consistency Training for Adaptation of
  Unsupervised Speech Separation
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Jiangyu Han
Yanhua Long
28
6
0
23 Apr 2022
STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency
STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency
Zhong-Qiu Wang
Gordon Wichern
Shinji Watanabe
Jonathan Le Roux
25
36
0
21 Apr 2022
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker
  Extraction
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Zifeng Zhao
Rongzhi Gu
Dongchao Yang
Jinchuan Tian
Yuexian Zou
33
2
0
15 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation
  System
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
M. Z. Ozturk
Chenshu Wu
Beibei Wang
Min Wu
K. Liu
27
20
0
14 Apr 2022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Haohe Liu
Xubo Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
21
51
0
12 Apr 2022
Text-Driven Separation of Arbitrary Sounds
Text-Driven Separation of Arbitrary Sounds
Kevin Kilgour
Beat Gfeller
Qingqing Huang
A. Jansen
Scott Wisdom
Marco Tagliasacchi
30
30
0
12 Apr 2022
Listen only to me! How well can target speech extraction handle false
  alarms?
Listen only to me! How well can target speech extraction handle false alarms?
Marc Delcroix
K. Kinoshita
Tsubasa Ochiai
Kateřina Žmolíková
Hiroshi Sato
Tomohiro Nakatani
34
15
0
11 Apr 2022
Multichannel Speech Separation with Narrow-band Conformer
Multichannel Speech Separation with Narrow-band Conformer
Changsheng Quan
Xiaofei Li
31
12
0
09 Apr 2022
Heterogeneous Target Speech Separation
Heterogeneous Target Speech Separation
Hyunjae Cho
Wonbin Jung
Junhyeok Lee
Paris Smaragdis
Sanghyun Woo
51
26
0
07 Apr 2022
FFC-SE: Fast Fourier Convolution for Speech Enhancement
FFC-SE: Fast Fourier Convolution for Speech Enhancement
Ivan Shchekotov
Pavel Andreev
Oleg Ivanov
Aibek Alanov
Dmitry Vetrov
40
23
0
06 Apr 2022
Expression-preserving face frontalization improves visually assisted
  speech processing
Expression-preserving face frontalization improves visually assisted speech processing
Zhiqi Kang
M. Sadeghi
Radu Horaud
Xavier Alameda-Pineda
CVBM
28
8
0
06 Apr 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
Zhenyu Tang
R. Aralikatti
Anton Ratnarajah
Tianyi Zhou
40
31
0
04 Apr 2022
Target Confusion in End-to-end Speaker Extraction: Analysis and
  Approaches
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches
Zifeng Zhao
Dongchao Yang
Rongzhi Gu
Haoran Zhang
Yuexian Zou
30
16
0
04 Apr 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech
  Separation for Flexible Number of Speakers
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Soumi Maiti
Yushi Ueda
Shinji Watanabe
Chunlei Zhang
Meng Yu
Shi-Xiong Zhang
Yong-mei Xu
42
33
0
31 Mar 2022
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
33
110
0
31 Mar 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain
  Target Speaker Extraction
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Zexu Pan
Meng Ge
Haizhou Li
31
17
0
31 Mar 2022
Speaker Extraction with Co-Speech Gestures Cue
Speaker Extraction with Co-Speech Gestures Cue
Zexu Pan
Xinyuan Qian
Haizhou Li
SLR
31
27
0
31 Mar 2022
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Phase-Aware Deep Speech Enhancement: It's All About The Frame Length
Tal Peer
Timo Gerkmann
22
21
0
30 Mar 2022
Separate What You Describe: Language-Queried Audio Source Separation
Separate What You Describe: Language-Queried Audio Source Separation
Xubo Liu
Haohe Liu
Qiuqiang Kong
Xinhao Mei
Jinzheng Zhao
Qiushi Huang
Mark D. Plumbley
Wenwu Wang
42
58
0
28 Mar 2022
Improving Source Separation by Explicitly Modeling Dependencies Between
  Sources
Improving Source Separation by Explicitly Modeling Dependencies Between Sources
Ethan Manilow
Curtis Hawthorne
Cheng-Zhi Anna Huang
Bryan Pardo
Jesse Engel
BDL
28
7
0
28 Mar 2022
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of
  Convolutional Network for Speaker-Independent Speech Separation
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Xue Yang
C. Bao
27
3
0
25 Mar 2022
HiFi++: a Unified Framework for Bandwidth Extension and Speech
  Enhancement
HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement
Pavel Andreev
Aibek Alanov
Oleg Ivanov
Dmitry Vetrov
38
38
0
24 Mar 2022
Upmixing via style transfer: a variational autoencoder for disentangling
  spatial images and musical content
Upmixing via style transfer: a variational autoencoder for disentangling spatial images and musical content
Haici Yang
Sanna Wager
S. Russell
Mike Luo
Minje Kim
Wontak Kim
8
2
0
22 Mar 2022
RoSS: Utilizing Robotic Rotation for Audio Source Separation
RoSS: Utilizing Robotic Rotation for Audio Source Separation
Hyungjoo Seo
Sahil Bhandary Karnoor
Romit Roy Choudhury
28
0
0
18 Mar 2022
TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel
  Speech Enhancement from Taylor's Approximation Theory
TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory
Andong Li
Guochen Yu
C. Zheng
Xiaodong Li
17
10
0
14 Mar 2022
MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Andong Li
C. Zheng
Ziyang Zhang
Xiaodong Li
29
3
0
14 Mar 2022
Single microphone speaker extraction using unified time-frequency
  Siamese-Unet
Single microphone speaker extraction using unified time-frequency Siamese-Unet
Aviad Eisenberg
Sharon Gannot
Shlomo E. Chazan
32
3
0
06 Mar 2022
Integrating Statistical Uncertainty into Neural Network-Based Speech
  Enhancement
Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement
Hu Fang
Tal Peer
S. Wermter
Timo Gerkmann
31
6
0
04 Mar 2022
Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE
  Submission to The L3DAS22 Challenge
Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge
Yen-Ju Lu
Samuele Cornell
Xuankai Chang
Wangyou Zhang
Chenda Li
Zhaoheng Ni
Zhong-Qiu Wang
Shinji Watanabe
19
28
0
24 Feb 2022
RemixIT: Continual self-training of speech enhancement models via
  bootstrapped remixing
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
30
54
0
17 Feb 2022
On loss functions and evaluation metrics for music source separation
On loss functions and evaluation metrics for music source separation
Enric Gusó
Jordi Pons
Santiago Pascual
Joan Serrà
22
19
0
16 Feb 2022
A Novel Speech Intelligibility Enhancement Model based on
  CanonicalCorrelation and Deep Learning
A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning
Tassadaq Hussain
Muhammad Diyan
M. Gogate
K. Dashtipour
Ahsan Adeel
Yu Tsao
Amir Hussain
AuLLM
21
3
0
11 Feb 2022
A Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning for Hearing-Assistive Technologies
Tassadaq Hussain
Muhammad Diyan
M. Gogate
K. Dashtipour
Ahsan Adeel
Yu Tsao
Amir Hussain
AuLLM
22
2
0
08 Feb 2022
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation
  Invariant Training
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training
Ertuğ Karamatlı
S. Kırbız
SSL
36
9
0
08 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation
Exploring Self-Attention Mechanisms for Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
Mirko Bronzi
40
23
0
06 Feb 2022
Distortion Audio Effects: Learning How to Recover the Clean Signal
Distortion Audio Effects: Learning How to Recover the Clean Signal
Johannes Imort
Giorgio Fabbro
Marco A. Martínez-Ramírez
Stefan Uhlich
Yuichiro Koyama
Yuki Mitsufuji
11
10
0
03 Feb 2022
Active Audio-Visual Separation of Dynamic Sound Sources
Active Audio-Visual Separation of Dynamic Sound Sources
Sagnik Majumder
Kristen Grauman
29
21
0
02 Feb 2022
New Insights on Target Speaker Extraction
New Insights on Target Speaker Extraction
Mohamed Elminshawi
Wolfgang Mack
Srikanth Raj Chetupalli
Soumitro Chakrabarty
Emanuel Habets
19
18
0
01 Feb 2022
Previous
123...789...111213
Next