One model to enhance them all: array geometry agnostic multi-channel
personalized speech enhancement

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

20 October 2021

Sefik Emre Eskimez

Takuya Yoshioka

Papers citing "One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement"

16 / 16 papers shown

Title
ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior Zhongweiyang Xu Xulin Fan Zhong-Qiu Wang Xilin Jiang Romit Roy Choudhury DiffM 46 0 0 20 May 2025
Efficient Area-based and Speaker-Agnostic Source Separation Martin Strauss Okan Kopuklu 26 3 0 19 Aug 2024
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition Guinan Li Jiajun Deng Youjun Chen Mengzhe Geng Shujie Hu ... Zengrui Jin Tianzi Wang Xurong Xie Helen Meng Xunying Liu VLM 29 0 0 14 Jun 2024
IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization Ya-Bin Wang Bing Yang Xiaofei Li 40 1 0 11 May 2024
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition Ju Lin Niko Moritz Yiteng Huang Ruiming Xie Ming Sun Christian Fuegen Frank Seide 30 4 0 18 Jan 2024
Multi-channel Conversational Speaker Separation via Neural Diarization H. Taherian DeLiang Wang BDL 34 16 0 15 Nov 2023
Multichannel Voice Trigger Detection Based on Transform-average-concatenate Takuya Higuchi Avamarie Brueggeman Han Fang Stephen Shum 16 0 0 27 Sep 2023
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling Zhongqiu Wang Samuele Cornell Shukjae Choi Younglo Lee Byeonghak Kim Shinji Watanabe AI4TS 15 10 0 18 Apr 2023
Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence Yicheng Hsu Yonghan Lee M. Bai 19 2 0 16 Nov 2022
Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation H. Taherian Sefik Emre Eskimez Takuya Yoshioka 16 1 0 05 Nov 2022
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation Xiaoyu Liu Xu Li Joan Serra 42 9 0 23 Oct 2022
Multi-channel target speech enhancement based on ERB-scaled spatial coherence features Yicheng Hsu Yonghan Lee M. Bai 8 1 0 17 Jul 2022
Challenges and Opportunities in Multi-device Speech Processing G. Ciccarelli Jarred Barber A. Nair Israel Cohen Tao Zhang 25 5 0 27 Jun 2022
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation Manthan Thakker Sefik Emre Eskimez Takuya Yoshioka Huaming Wang 14 28 0 02 Apr 2022
VarArray: Array-Geometry-Agnostic Continuous Speech Separation Takuya Yoshioka Xiaofei Wang Dongmei Wang M. Tang Zirun Zhu Zhuo Chen Naoyuki Kanda 17 37 0 12 Oct 2021
Interspeech 2021 Deep Noise Suppression Challenge Chandan K. A. Reddy Harishchandra Dubey K. Koishida A. Nair Vishak Gopal Ross Cutler Sebastian Braun H. Gamper R. Aichner Sriram Srinivasan AI4CE 74 160 0 06 Jan 2021