Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.02508
Cited By
SDR - half-baked or well done?
6 November 2018
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDR - half-baked or well done?"
50 / 611 papers shown
Title
Variable Bitrate Residual Vector Quantization for Audio Coding
Yunkee Chae
Woosung Choi
Yuhta Takida
Junghyun Koo
Yukara Ikemiya
...
K. Cheuk
Marco A. Martínez-Ramírez
Kyogu Lee
Wei-Hsiang Liao
Yuki Mitsufuji
91
0
0
08 Oct 2024
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Xiang Hao
Chenxiang Ma
Qu Yang
Jibin Wu
Kay Chen Tan
28
0
0
07 Oct 2024
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
30
0
0
04 Oct 2024
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules
Hsin-Tien Chiang
Hao Zhang
Yong Xu
Meng Yu
Dong Yu
33
1
0
02 Oct 2024
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
Mohan Xu
Kai Li
Guo Chen
Xiaolin Hu
51
0
0
02 Oct 2024
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
39
2
0
02 Oct 2024
Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Jinyi Mi
Xiaohan Shi
D. Ma
Jiajun He
Takuya Fujimura
Tomoki Toda
28
0
0
29 Sep 2024
The IEEE-IS2 2024 Music Packet Loss Concealment Challenge
Alessandro Ilic Mezza
Alberto Bernardini
19
0
0
27 Sep 2024
MC-SEMamba: A Simple Multi-channel Extension of SEMamba
Wen-Yuan Ting
Wenze Ren
Rong-Yu Chao
Hsin-Yi Lin
Yu Tsao
Fan-Gang Zeng
Mamba
42
0
0
26 Sep 2024
Towards Sub-millisecond Latency Real-Time Speech Enhancement Models on Hearables
Artem Dementyev
Chandan K. A. Reddy
Scott Wisdom
Navin Chatlani
J. Hershey
R. Lyon
20
0
0
26 Sep 2024
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration
Pin-Jui Ku
Alexander H. Liu
Roman Korostik
Sung-Feng Huang
Szu-Wei Fu
Ante Jukić
44
2
0
24 Sep 2024
Leveraging Audio-Only Data for Text-Queried Target Sound Extraction
Kohei Saijo
Janek Ebbers
François Germain
Sameer Khurana
Gordon Wichern
Jonathan Le Roux
44
1
0
20 Sep 2024
NDVQ: Robust Neural Audio Codec with Normal Distribution-Based Vector Quantization
Zhikang Niu
Sanyuan Chen
Long Zhou
Ziyang Ma
Xie Chen
Shujie Liu
29
2
0
19 Sep 2024
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech Enhancement
Keying Zuo
Qingtian Xu
Jie Zhang
Zhenhua Ling
39
0
0
19 Sep 2024
Multichannel-to-Multichannel Target Sound Extraction Using Direction and Timestamp Clues
Dayun Choi
Jung-Woo Choi
42
0
0
19 Sep 2024
Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference
Edresson Casanova
Ryan Langman
Paarth Neekhara
Shehzeen Samarah Hussain
Jason Chun Lok Li
Subhankar Ghosh
Ante Jukić
Sang-gil Lee
AuLLM
44
2
0
18 Sep 2024
Learning Source Disentanglement in Neural Audio Codec
Xiaoyu Bie
Xubo Liu
Gaël Richard
34
1
0
17 Sep 2024
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang
Zhan Liu
Wenyi Yu
Guangzhi Sun
Qiuqiang Kong
Chao Zhang
DiffM
51
0
0
15 Sep 2024
On the effectiveness of enrollment speech augmentation for Target Speaker Extraction
Junjie Li
Ke Zhang
Shuai Wang
Haizhou Li
Man-Wai Mak
Kong Aik Lee
35
1
0
15 Sep 2024
Language-Queried Target Sound Extraction Without Parallel Training Data
Hao Ma
Zhiyuan Peng
Xu Li
Yukai Li
Mingjie Shao
Qiuqiang Kong
Xuelong Li
VLM
80
1
0
14 Sep 2024
Attention-Based Beamformer For Multi-Channel Speech Enhancement
Jinglin Bai
Hao Li
Xueliang Zhang
Fei Chen
25
0
0
10 Sep 2024
DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing
Kuang Yuan
Shuo Han
Swarun Kumar
Bhiksha Raj
37
2
0
10 Sep 2024
Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule
Siyi Wang
Siyi Liu
Andrew Harper
Paul Kendrick
Mathieu Salzmann
Milos Cernak
DiffM
40
2
0
08 Sep 2024
NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
Dashanka De Silva
Siqi Cai
Saurav Pahuja
Tanja Schultz
Haizhou Li
41
0
0
04 Sep 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Bang Zeng
Ming Li
45
3
0
04 Sep 2024
Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement
Tathagata Bandyopadhyay
ViT
20
0
0
02 Sep 2024
Progressive Residual Extraction based Pre-training for Speech Representation Learning
Tianrui Wang
Jin Li
Ziyang Ma
Rui Cao
Xie Chen
...
Meng Ge
Xiaobao Wang
Yuguang Wang
Jianwu Dang
Nyima Tashi
SSL
45
0
0
31 Aug 2024
Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
K. Chen
Jiaqi Su
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Zeyu Jin
45
1
0
28 Aug 2024
A Hybrid Approach for Low-Complexity Joint Acoustic Echo and Noise Reduction
Shrishti Saha Shetu
Naveen Kumar Desiraju
Jose Miguel Martinez Aponte
Emanuël A. P. Habets
Edwin Mabande
39
2
0
28 Aug 2024
Comparative Analysis Of Discriminative Deep Learning-Based Noise Reduction Methods In Low SNR Scenarios
Shrishti Saha Shetu
Emanuël A. P. Habets
Andreas Brendel
47
2
0
26 Aug 2024
Efficient Area-based and Speaker-Agnostic Source Separation
Martin Strauss
Okan Kopuklu
34
3
0
19 Aug 2024
Unsupervised Composable Representations for Audio
Giovanni Bindi
P. Esling
DiffM
OCL
CoGe
37
0
0
19 Aug 2024
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
Tao Sun
Sander Bohté
28
2
0
14 Aug 2024
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
Jean-Marie Lemercier
Eloi Moliner
Simon Welker
Vesa Valimaki
Timo Gerkmann
54
2
0
14 Aug 2024
Music2Latent: Consistency Autoencoders for Latent Audio Compression
Marco Pasini
Stefan Lattner
George Fazekas
24
7
0
12 Aug 2024
FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses
Zhongweiyang Xu
Ali Aroudi
Ke Tan
Ashutosh Pandey
Jung-Suk Lee
Buye Xu
Francesco Nesta
37
1
0
12 Aug 2024
Source Separation of Multi-source Raw Music using a Residual Quantized Variational Autoencoder
Leonardo Berti
DRL
40
0
0
12 Aug 2024
Distil-DCCRN: A Small-footprint DCCRN Leveraging Feature-based Knowledge Distillation in Speech Enhancement
Runduo Han
Weiming Xu
Zihan Zhang
Mingshuai Liu
Lei Xie
40
1
0
08 Aug 2024
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Kohei Saijo
Gordon Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
46
8
0
06 Aug 2024
Interaural time difference loss for binaural target sound extraction
Carlos Hernandez-Olivan
Marc Delcroix
Tsubasa Ochiai
Naohiro Tawara
Tomohiro Nakatani
Shoko Araki
26
1
0
01 Aug 2024
ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement
Zhong-Qiu Wang
30
1
0
28 Jul 2024
RAVSS: Robust Audio-Visual Speech Separation in Multi-Speaker Scenarios with Missing Visual Cues
Tianrui Pan
Jie Liu
Bohan Wang
Jie Tang
Gangshan Wu
40
2
0
27 Jul 2024
Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Ying-Shuo Lee
Yueh-Po Peng
Jui-Te Wu
Ming Cheng
Li Su
Yi-Hsuan Yang
35
0
0
23 Jul 2024
Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Robert Sutherland
George Close
Thomas Hain
Stefan Goetze
Jon Barker
36
1
0
18 Jul 2024
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis
Xilin Jiang
Yinghao Aaron Li
Adrian Nicolas Florea
Cong Han
N. Mesgarani
Mamba
51
10
0
13 Jul 2024
Speech dereverberation constrained on room impulse response characteristics
Louis Bahrman
Mathieu Fontaine
Jonathan Le Roux
Gaël Richard
44
1
0
10 Jul 2024
Knowledge boosting during low-latency inference
Vidya Srinivas
Malek Itani
Tuochao Chen
Sefik Emre Eskimez
Takuya Yoshioka
Shyamnath Gollakota
32
2
0
09 Jul 2024
Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer
Jizhen Li
Xinmeng Xu
Weiping Tu
Yuhong Yang
Rong Zhu
32
1
0
09 Jul 2024
Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation
J. Lee
Jaehyun Park
Min Jun Choi
Kyogu Lee
42
2
0
07 Jul 2024
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Feiyang Xiao
Jian Guan
Qiaoxi Zhu
Xubo Liu
Wenbo Wang
Shuhan Qi
Kejia Zhang
Jianyuan Sun
Wenwu Wang
30
4
0
06 Jul 2024
Previous
1
2
3
4
5
...
11
12
13
Next