ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02508
  4. Cited By
SDR - half-baked or well done?

SDR - half-baked or well done?

6 November 2018
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
ArXivPDFHTML

Papers citing "SDR - half-baked or well done?"

50 / 611 papers shown
Title
Variable Bitrate Residual Vector Quantization for Audio Coding
Variable Bitrate Residual Vector Quantization for Audio Coding
Yunkee Chae
Woosung Choi
Yuhta Takida
Junghyun Koo
Yukara Ikemiya
...
K. Cheuk
Marco A. Martínez-Ramírez
Kyogu Lee
Wei-Hsiang Liao
Yuki Mitsufuji
91
0
0
08 Oct 2024
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with
  Spiking-FullSubNet
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Xiang Hao
Chenxiang Ma
Qu Yang
Jibin Wu
Kay Chen Tan
28
0
0
07 Oct 2024
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
30
0
0
04 Oct 2024
Restorative Speech Enhancement: A Progressive Approach Using SE and
  Codec Modules
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules
Hsin-Tien Chiang
Hao Zhang
Yong Xu
Meng Yu
Dong Yu
33
1
0
02 Oct 2024
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
Mohan Xu
Kai Li
Guo Chen
Xiaolin Hu
51
0
0
02 Oct 2024
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
39
2
0
02 Oct 2024
Two-stage Framework for Robust Speech Emotion Recognition Using Target
  Speaker Extraction in Human Speech Noise Conditions
Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Jinyi Mi
Xiaohan Shi
D. Ma
Jiajun He
Takuya Fujimura
Tomoki Toda
28
0
0
29 Sep 2024
The IEEE-IS2 2024 Music Packet Loss Concealment Challenge
The IEEE-IS2 2024 Music Packet Loss Concealment Challenge
Alessandro Ilic Mezza
Alberto Bernardini
19
0
0
27 Sep 2024
MC-SEMamba: A Simple Multi-channel Extension of SEMamba
MC-SEMamba: A Simple Multi-channel Extension of SEMamba
Wen-Yuan Ting
Wenze Ren
Rong-Yu Chao
Hsin-Yi Lin
Yu Tsao
Fan-Gang Zeng
Mamba
42
0
0
26 Sep 2024
Towards Sub-millisecond Latency Real-Time Speech Enhancement Models on Hearables
Towards Sub-millisecond Latency Real-Time Speech Enhancement Models on Hearables
Artem Dementyev
Chandan K. A. Reddy
Scott Wisdom
Navin Chatlani
J. Hershey
R. Lyon
20
0
0
26 Sep 2024
Generative Speech Foundation Model Pretraining for High-Quality Speech
  Extraction and Restoration
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration
Pin-Jui Ku
Alexander H. Liu
Roman Korostik
Sung-Feng Huang
Szu-Wei Fu
Ante Jukić
44
2
0
24 Sep 2024
Leveraging Audio-Only Data for Text-Queried Target Sound Extraction
Leveraging Audio-Only Data for Text-Queried Target Sound Extraction
Kohei Saijo
Janek Ebbers
François Germain
Sameer Khurana
Gordon Wichern
Jonathan Le Roux
44
1
0
20 Sep 2024
NDVQ: Robust Neural Audio Codec with Normal Distribution-Based Vector
  Quantization
NDVQ: Robust Neural Audio Codec with Normal Distribution-Based Vector Quantization
Zhikang Niu
Sanyuan Chen
Long Zhou
Ziyang Ma
Xie Chen
Shujie Liu
29
2
0
19 Sep 2024
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech
  Enhancement
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech Enhancement
Keying Zuo
Qingtian Xu
Jie Zhang
Zhenhua Ling
39
0
0
19 Sep 2024
Multichannel-to-Multichannel Target Sound Extraction Using Direction and
  Timestamp Clues
Multichannel-to-Multichannel Target Sound Extraction Using Direction and Timestamp Clues
Dayun Choi
Jung-Woo Choi
42
0
0
19 Sep 2024
Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality
  Speech LLM Training and Inference
Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference
Edresson Casanova
Ryan Langman
Paarth Neekhara
Shehzeen Samarah Hussain
Jason Chun Lok Li
Subhankar Ghosh
Ante Jukić
Sang-gil Lee
AuLLM
44
2
0
18 Sep 2024
Learning Source Disentanglement in Neural Audio Codec
Learning Source Disentanglement in Neural Audio Codec
Xiaoyu Bie
Xubo Liu
Gaël Richard
34
1
0
17 Sep 2024
Extract and Diffuse: Latent Integration for Improved Diffusion-based
  Speech and Vocal Enhancement
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang
Zhan Liu
Wenyi Yu
Guangzhi Sun
Qiuqiang Kong
Chao Zhang
DiffM
51
0
0
15 Sep 2024
On the effectiveness of enrollment speech augmentation for Target
  Speaker Extraction
On the effectiveness of enrollment speech augmentation for Target Speaker Extraction
Junjie Li
Ke Zhang
Shuai Wang
Haizhou Li
Man-Wai Mak
Kong Aik Lee
35
1
0
15 Sep 2024
Language-Queried Target Sound Extraction Without Parallel Training Data
Language-Queried Target Sound Extraction Without Parallel Training Data
Hao Ma
Zhiyuan Peng
Xu Li
Yukai Li
Mingjie Shao
Qiuqiang Kong
Xuelong Li
VLM
80
1
0
14 Sep 2024
Attention-Based Beamformer For Multi-Channel Speech Enhancement
Attention-Based Beamformer For Multi-Channel Speech Enhancement
Jinglin Bai
Hao Li
Xueliang Zhang
Fei Chen
25
0
0
10 Sep 2024
DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing
DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing
Kuang Yuan
Shuo Han
Swarun Kumar
Bhiksha Raj
37
2
0
10 Sep 2024
Diffusion-based Speech Enhancement with Schrödinger Bridge and
  Symmetric Noise Schedule
Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule
Siyi Wang
Siyi Liu
Andrew Harper
Paul Kendrick
Mathieu Salzmann
Milos Cernak
DiffM
40
2
0
08 Sep 2024
NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
Dashanka De Silva
Siqi Cai
Saurav Pahuja
Tanja Schultz
Haizhou Li
41
0
0
04 Sep 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Bang Zeng
Ming Li
45
3
0
04 Sep 2024
Spectron: Target Speaker Extraction using Conditional Transformer with
  Adversarial Refinement
Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement
Tathagata Bandyopadhyay
ViT
20
0
0
02 Sep 2024
Progressive Residual Extraction based Pre-training for Speech
  Representation Learning
Progressive Residual Extraction based Pre-training for Speech Representation Learning
Tianrui Wang
Jin Li
Ziyang Ma
Rui Cao
Xie Chen
...
Meng Ge
Xiaobao Wang
Yuguang Wang
Jianwu Dang
Nyima Tashi
SSL
45
0
0
31 Aug 2024
Improving Generalization of Speech Separation in Real-World Scenarios:
  Strategies in Simulation, Optimization, and Evaluation
Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
K. Chen
Jiaqi Su
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Zeyu Jin
45
1
0
28 Aug 2024
A Hybrid Approach for Low-Complexity Joint Acoustic Echo and Noise
  Reduction
A Hybrid Approach for Low-Complexity Joint Acoustic Echo and Noise Reduction
Shrishti Saha Shetu
Naveen Kumar Desiraju
Jose Miguel Martinez Aponte
Emanuël A. P. Habets
Edwin Mabande
39
2
0
28 Aug 2024
Comparative Analysis Of Discriminative Deep Learning-Based Noise
  Reduction Methods In Low SNR Scenarios
Comparative Analysis Of Discriminative Deep Learning-Based Noise Reduction Methods In Low SNR Scenarios
Shrishti Saha Shetu
Emanuël A. P. Habets
Andreas Brendel
47
2
0
26 Aug 2024
Efficient Area-based and Speaker-Agnostic Source Separation
Efficient Area-based and Speaker-Agnostic Source Separation
Martin Strauss
Okan Kopuklu
34
3
0
19 Aug 2024
Unsupervised Composable Representations for Audio
Unsupervised Composable Representations for Audio
Giovanni Bindi
P. Esling
DiffM
OCL
CoGe
37
0
0
19 Aug 2024
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech
  Enhancement
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
Tao Sun
Sander Bohté
28
2
0
14 Aug 2024
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
Jean-Marie Lemercier
Eloi Moliner
Simon Welker
Vesa Valimaki
Timo Gerkmann
54
2
0
14 Aug 2024
Music2Latent: Consistency Autoencoders for Latent Audio Compression
Music2Latent: Consistency Autoencoders for Latent Audio Compression
Marco Pasini
Stefan Lattner
George Fazekas
24
7
0
12 Aug 2024
FoVNet: Configurable Field-of-View Speech Enhancement with Low
  Computation and Distortion for Smart Glasses
FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses
Zhongweiyang Xu
Ali Aroudi
Ke Tan
Ashutosh Pandey
Jung-Suk Lee
Buye Xu
Francesco Nesta
37
1
0
12 Aug 2024
Source Separation of Multi-source Raw Music using a Residual Quantized
  Variational Autoencoder
Source Separation of Multi-source Raw Music using a Residual Quantized Variational Autoencoder
Leonardo Berti
DRL
40
0
0
12 Aug 2024
Distil-DCCRN: A Small-footprint DCCRN Leveraging Feature-based Knowledge
  Distillation in Speech Enhancement
Distil-DCCRN: A Small-footprint DCCRN Leveraging Feature-based Knowledge Distillation in Speech Enhancement
Runduo Han
Weiming Xu
Zihan Zhang
Mingshuai Liu
Lei Xie
40
1
0
08 Aug 2024
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech
  Separation and Enhancement
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Kohei Saijo
Gordon Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
46
8
0
06 Aug 2024
Interaural time difference loss for binaural target sound extraction
Interaural time difference loss for binaural target sound extraction
Carlos Hernandez-Olivan
Marc Delcroix
Tsubasa Ochiai
Naohiro Tawara
Tomohiro Nakatani
Shoko Araki
26
1
0
01 Aug 2024
ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech
  Enhancement
ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement
Zhong-Qiu Wang
30
1
0
28 Jul 2024
RAVSS: Robust Audio-Visual Speech Separation in Multi-Speaker Scenarios
  with Missing Visual Cues
RAVSS: Robust Audio-Visual Speech Separation in Multi-Speaker Scenarios with Missing Visual Cues
Tianrui Pan
Jie Liu
Bohan Wang
Jie Tang
Gangshan Wu
40
2
0
27 Jul 2024
Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Ying-Shuo Lee
Yueh-Po Peng
Jui-Te Wu
Ming Cheng
Li Su
Yi-Hsuan Yang
35
0
0
23 Jul 2024
Using Speech Foundational Models in Loss Functions for Hearing Aid
  Speech Enhancement
Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Robert Sutherland
George Close
Thomas Hain
Stefan Goetze
Jon Barker
36
1
0
18 Jul 2024
Speech Slytherin: Examining the Performance and Efficiency of Mamba for
  Speech Separation, Recognition, and Synthesis
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis
Xilin Jiang
Yinghao Aaron Li
Adrian Nicolas Florea
Cong Han
N. Mesgarani
Mamba
51
10
0
13 Jul 2024
Speech dereverberation constrained on room impulse response
  characteristics
Speech dereverberation constrained on room impulse response characteristics
Louis Bahrman
Mathieu Fontaine
Jonathan Le Roux
Gaël Richard
44
1
0
10 Jul 2024
Knowledge boosting during low-latency inference
Knowledge boosting during low-latency inference
Vidya Srinivas
Malek Itani
Tuochao Chen
Sefik Emre Eskimez
Takuya Yoshioka
Shyamnath Gollakota
32
2
0
09 Jul 2024
Improving Speech Enhancement by Integrating Inter-Channel and Band
  Features with Dual-branch Conformer
Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer
Jizhen Li
Xinmeng Xu
Weiping Tu
Yuhong Yang
Rong Zhu
32
1
0
09 Jul 2024
Differentiable Modal Synthesis for Physical Modeling of Planar String
  Sound and Motion Simulation
Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation
J. Lee
Jaehyun Park
Min Jun Choi
Kyogu Lee
42
2
0
07 Jul 2024
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Feiyang Xiao
Jian Guan
Qiaoxi Zhu
Xubo Liu
Wenbo Wang
Shuhan Qi
Kejia Zhang
Jianyuan Sun
Wenwu Wang
30
4
0
06 Jul 2024
Previous
12345...111213
Next