Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.12433
Cited By
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
22 November 2022
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation"
50 / 68 papers shown
Title
Unsupervised Blind Speech Separation with a Diffusion Prior
Zhongweiyang Xu
Xulin Fan
Zhong-Qiu Wang
Xilin Jiang
Romit Roy Choudhury
DiffM
38
0
0
08 May 2025
Listen to Extract: Onset-Prompted Target Speaker Extraction
Pengjie Shen
Kangrui Chen
Shulin He
Pengru Chen
Shuqi Yuan
He Kong
Xueliang Zhang
Z. Wang
46
0
0
08 May 2025
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
Kohei Saijo
Tetsuji Ogawa
45
1
0
28 Apr 2025
SonicSieve: Bringing Directional Speech Extraction to Smartphones Using Acoustic Microstructures
Kuang Yuan
Yifeng Wang
Xiyuxing Zhang
Chengyi Shen
Swarun Kumar
Justin Chan
26
0
0
15 Apr 2025
Elevating Robust Multi-Talker ASR by Decoupling Speaker Separation and Speech Recognition
Yufeng Yang
H. Taherian
Vahid Ahmadi Kalkhorani
DeLiang Wang
37
0
0
23 Mar 2025
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
Boyi Kang
Xinfa Zhu
Zihan Zhang
Zhen Ye
Mingshuai Liu
...
Jun Chen
Longshuai Xiao
Chao Weng
Wei Xue
Lei Xie
AuLLM
55
3
0
01 Mar 2025
Improving Speech Enhancement by Cross- and Sub-band Processing with State Space Model
Jizhen Li
Weiping Tu
Yuhong Yang
Xinmeng Xu
Yiqun Zhang
Yanzhen Ren
Mamba
38
0
0
22 Feb 2025
Summary of the NOTSOFAR-1 Challenge: Highlights and Learnings
Igor Abramovski
Alon Vinnikov
Shalev Shaer
Naoyuki Kanda
Xiaofei Wang
Amir Ivry
Eyal Krupka
34
0
0
28 Jan 2025
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement
Junan Zhang
Jing Yang
Zihao Fang
Y. Wang
Zehua Zhang
Zhuo Wang
Fan Fan
Z. Wu
39
2
0
26 Jan 2025
30+ Years of Source Separation Research: Achievements and Future Challenges
S. Araki
N. Ito
Reinhold Haeb-Umbach
G. Wichern
Zhong-Qiu Wang
Yuki Mitsufuji
AI4TS
34
0
0
21 Jan 2025
Microphone Array Signal Processing and Deep Learning for Speech Enhancement
Reinhold Haeb-Umbach
Tomohiro Nakatani
Marc Delcroix
Christoph Boeddeker
Tsubasa Ochiai
35
0
0
13 Jan 2025
Distance Based Single-Channel Target Speech Extraction
Runwu Shi
Benjamin Yen
Kazuhiro Nakadai
28
0
0
31 Dec 2024
Multiple Choice Learning for Efficient Speech Separation with Many Speakers
David Perera
François Derrida
Théo Mariotte
Gaël Richard
S. Essid
57
0
0
27 Nov 2024
Task-Aware Unified Source Separation
Kohei Saijo
Janek Ebbers
François G. Germain
G. Wichern
Jonathan Le Roux
32
2
0
31 Oct 2024
SepMamba: State-space models for speaker separation using Mamba
Thor Højhus Avenstrup
Boldizsár Elek
István László Mádi
András Bence Schin
Morten Mørup
Bjørn Sand Jensen
Kenny Falkær Olsen
Mamba
21
0
0
28 Oct 2024
STCON System for the CHiME-8 Challenge
Anton Mitrofanov
Tatiana Prisyach
Tatiana Timofeeva
Sergei Novoselov
M. Korenevsky
...
Dmitriy Miroshnichenko
Nikita Mamaev
Ilya Odegov
Olga Rudnitskaya
A. Romanenko
26
1
0
17 Oct 2024
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Xiang Hao
Chenxiang Ma
Qu Yang
Jibin Wu
Kay Chen Tan
18
0
0
07 Oct 2024
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Kai Li
Wendi Sang
Chang Zeng
Runxuan Yang
Guo Chen
Xiaolin Hu
26
2
0
02 Oct 2024
An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Pin-Jui Ku
Chun-Wei Ho
Hao Yen
Sabato Marco Siniscalchi
Chin-Hui Lee
19
0
0
24 Sep 2024
Speech-Declipping Transformer with Complex Spectrogram and Learnerble Temporal Features
Younghoo Kwon
Jung-Woo Choi
30
2
0
19 Sep 2024
Multichannel-to-Multichannel Target Sound Extraction Using Direction and Timestamp Clues
Dayun Choi
Jung-Woo Choi
27
0
0
19 Sep 2024
DeFT-Mamba: Universal Multichannel Sound Separation and Polyphonic Audio Classification
Dongheon Lee
Jung-Woo Choi
Mamba
24
1
0
19 Sep 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Bang Zeng
Ming Li
24
2
0
04 Sep 2024
Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
K. Chen
Jiaqi Su
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Zeyu Jin
26
0
0
28 Aug 2024
ConcateNet: Dialogue Separation Using Local And Global Feature Concatenation
Mhd Modar Halimeh
Matteo Torcoli
Emanuel Habets
33
0
0
16 Aug 2024
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Kohei Saijo
G. Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
34
7
0
06 Aug 2024
Enhanced Reverberation as Supervision for Unsupervised Speech Separation
Kohei Saijo
G. Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
16
1
0
06 Aug 2024
ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement
Zhong-Qiu Wang
20
1
0
28 Jul 2024
Target conversation extraction: Source separation using turn-taking dynamics
Tuochao Chen
Qirui Wang
Bohan Wu
Malek Itani
Sefik Emre Eskimez
Takuya Yoshioka
Shyamnath Gollakota
20
4
0
15 Jul 2024
Speech dereverberation constrained on room impulse response characteristics
Louis Bahrman
Mathieu Fontaine
Jonathan Le Roux
Gaël Richard
23
1
0
10 Jul 2024
Knowledge boosting during low-latency inference
Vidya Srinivas
Malek Itani
Tuochao Chen
Sefik Emre Eskimez
Takuya Yoshioka
Shyamnath Gollakota
19
2
0
09 Jul 2024
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Bing Yang
Changsheng Quan
Yabo Wang
Pengyu Wang
Yujie Yang
Ying Fang
Nian Shao
Hui Bu
Xin Xu
Xiaofei Li
38
5
0
28 Jun 2024
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling
Vahid Ahmadi Kalkhorani
Cheng Yu
Anurag Kumar
Ke Tan
Buye Xu
DeLiang Wang
32
0
0
17 Jun 2024
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Wangyou Zhang
Robin Scheibler
Kohei Saijo
Samuele Cornell
Chenda Li
...
Jan Pirklbauer
Marvin Sach
Shinji Watanabe
Tim Fingscheidt
Yanmin Qian
VLM
32
6
0
07 Jun 2024
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement
Wangyou Zhang
Kohei Saijo
Jee-weon Jung
Chenda Li
Shinji Watanabe
Yanmin Qian
30
4
0
06 Jun 2024
Cross-Talk Reduction
Zhong-Qiu Wang
Anurag Kumar
Shinji Watanabe
16
1
0
30 May 2024
Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Changsheng Quan
Xiaofei Li
39
23
0
12 Mar 2024
CrossNet: Leveraging Global, Cross-Band, Narrow-Band, and Positional Encoding for Single- and Multi-Channel Speaker Separation
Vahid Ahmadi Kalkhorani
DeLiang Wang
28
3
0
06 Mar 2024
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
16
8
0
06 Feb 2024
Binaural Angular Separation Network
Yang Yang
George Sung
Shao-fu Shih
Hakan Erdogan
Chehung Lee
Matthias Grundmann
22
2
0
16 Jan 2024
On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement
Tsun-An Hsieh
Jacob Donley
Daniel D. E. Wong
Buye Xu
Ashutosh Pandey
15
2
0
15 Jan 2024
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement
Ashutosh Pandey
Buye Xu
40
1
0
15 Jan 2024
Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments
Renana Opochinsky
Mordehay Moradi
Sharon Gannot
13
4
0
07 Jan 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Xincheng Yu
Dongyue Guo
Jianwei Zhang
Yi Lin
13
3
0
11 Dec 2023
Multi-channel Conversational Speaker Separation via Neural Diarization
H. Taherian
DeLiang Wang
BDL
20
16
0
15 Nov 2023
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
William Ravenscroft
Stefan Goetze
Thomas Hain
25
7
0
09 Oct 2023
Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Thilo von Neumann
Christoph Boeddeker
Tobias Cord-Landwehr
Marc Delcroix
Reinhold Haeb-Umbach
16
7
0
28 Sep 2023
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription
Peter Vieting
Simon Berger
Thilo von Neumann
Christoph Boeddeker
Ralf Schluter
Reinhold Haeb-Umbach
19
0
0
15 Sep 2023
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
J. Yip
Dianwen Ng
Bin Ma
Chng Eng Siong
16
0
0
14 Sep 2023
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
Karn N. Watcharasupat
Chih-Wei Wu
Yiwei Ding
Iroro Orife
Aaron J. Hipple
Aaron J. Hipple. Phillip A. Williams
Scott Kramer
Alexander Lerch
W. Wolcott
22
5
0
05 Sep 2023
1
2
Next