A scalable noisy speech dataset and online subjective test framework

Interspeech (Interspeech), 2019

17 September 2019

Papers citing "A scalable noisy speech dataset and online subjective test framework"

50 / 68 papers shown

Title
DHAuDS: A Dynamic and Heterogeneous Audio Benchmark for Test-Time Adaptation Weichuang Shao I. Liao Tomas Henrique Bode Maul T. Chandesa TTA 108 0 0 23 Nov 2025
Pretrained Conformers for Audio Fingerprinting and Retrieval Kemal Altwlkany Elmedin Selmanovic Sead Delalic 84 0 0 15 Aug 2025
Tiny Noise-Robust Voice Activity Detector for Voice Assistants Hamed Jafarzadeh Asl Mahsa Ghazvini Nejad Amin Edraki M. Asgharian Vahid Partovi Nia 80 1 0 29 Jul 2025
Advances in Intelligent Hearing Aids: Deep Learning Approaches to Selective Noise Cancellation Haris Khan Shumaila Asif Hassan Nasir Kamran Aziz Bhatti Shahzad Amin Sheikh 102 1 0 25 Jun 2025
SUTA-LM: Bridging Test-Time Adaptation and Language Model Rescoring for Robust ASR Wei-Ping Huang Guan-Ting Lin Hung-yi Lee KELM 75 0 0 10 Jun 2025
Training-Free Multi-Step Audio Source Separation Yongyi Zang Jingyi Li Qiuqiang Kong 370 0 0 26 May 2025
AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation Yang Xiao Tianyi Peng Yanghao Zhou Rohan Kumar Das TTA 150 0 0 20 May 2025
RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users Suyu Ye Haojun Shi Darren Shih Hyokun Yun Tanya Roosta Tianmin Shu 265 11 0 14 Apr 2025
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025 Myeonghoon Ryu June-Woo Kim Minseok Oh Suji Lee Han Park 243 1 0 20 Jan 2025
Roadmap towards Superhuman Speech Understanding using Large Language Models Fan Bu Yuhao Zhang Xiang Wang Benyou Wang Qiang Liu Haoyang Li LM&MA ELM AuLLM 677 2 0 17 Oct 2024
Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind Lance Ying Xinyi Li Shivam Aarya Yizirui Fang Stefanie Tellex J. Tenenbaum Tianmin Shu Joshua B. Tenenbaum Tianmin Shu LM&Ro 255 3 0 17 Sep 2024
The VoiceMOS Challenge 2024: Beyond Speech Quality PredictionSpoken Language Technology Workshop (SLT), 2024 Wen-Chin Huang Szu-Wei Fu Erica Cooper Ryandhimas E. Zezario Tomoki Toda Hsin-Min Wang Junichi Yamagishi Yu Tsao 194 31 0 11 Sep 2024
Spectral oversubtraction? An approach for speech enhancement after robot ego speech filtering in semi-real-time Yue Li Koen V. Hindriks Florian A. Kunneman 172 1 0 10 Sep 2024
LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech RecognitionInterspeech (Interspeech), 2024 Eunseop Yoon Hee Suk Yoon John Harvill M. Hasegawa-Johnson Chang D. Yoo TTA VLM 190 1 0 11 Aug 2024
Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps Mattias Nilsson Riccardo Miccini Clément Laroche Tobias Piechowiak Friedemann Zenke MQ 130 2 0 05 Jul 2024
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech Guan-Ting Lin Wei-Ping Huang Hung-yi Lee VLM TTA 145 7 0 16 Jun 2024
Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning Brandon Colelough Andrew Zheng 258 1 0 30 May 2024
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks Alexandre Bittar Philip N. Garner 139 1 0 22 Apr 2024
Test-Time Training for Depression Detection Sri Harsha Dumpala Chandramouli Shama Sastry Rudolf Uher Sageev Oore 184 1 0 07 Apr 2024
Speech Robust Bench: A Robustness Benchmark For Speech RecognitionInternational Conference on Learning Representations (ICLR), 2024 Muhammad A. Shah David Solans Noguero Mikko A. Heikkilä Nicolas Kourtellis 176 11 0 08 Mar 2024
SECP: A Speech Enhancement-Based Curation Pipeline For Scalable Acquisition Of Clean Speech Adam Sabra C. Wronka Michelle Mao Samer Hijazi 95 4 0 19 Feb 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning Xincheng Yu Dongyue Guo Jianwei Zhang Yi Lin 160 6 0 11 Dec 2023
NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality AssessmentIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Alessandro Ragano Jan Skoglund Andrew Hines 257 15 0 28 Sep 2023
A Two-Step Approach for Narrowband Source Localization in Reverberant Rooms W. Lai L. Birnie T. Abhayapala Amy Bastine Shaoheng Xu P. Samarasinghe 55 1 0 25 Sep 2023
Test-Time Training for Speech Sri Harsha Dumpala Chandramouli Shama Sastry Sageev Oore 263 3 0 19 Sep 2023
Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Alexandre Bittar Paul Dixon Mohammad Samragh K. Nishu Devang Naik 200 6 0 31 Aug 2023
PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined KeywordsInterspeech (Interspeech), 2023 Yong-Hyeok Lee Namhyun Cho 156 25 0 31 Aug 2023
Fixed Inter-Neuron Covariability Induces Adversarial RobustnessIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Muhammad Ahmed Shah Bhiksha Raj AAML 56 0 0 07 Aug 2023
VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages Shivam Mhaskar Vineet Bhat Akshay Batheja S. Deoghare Paramveer Choudhary P. Bhattacharyya 143 7 0 21 May 2023
Improving the Intent Classification accuracy in Noisy Environment Mohamed Nabih Ali Alessio Brutti Daniele Falavigna 82 1 0 12 Mar 2023
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech RecognitionNeural Networks (Neural Netw.), 2023 Leyuan Qu C. Weber S. Wermter 132 12 0 20 Feb 2023
PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Muqiao Yang Joseph Konan David Bick YUNYANG ZENG Shuo Han Anurag Kumar Shinji Watanabe Bhiksha Raj 142 5 0 16 Feb 2023
TAPLoss: A Temporal Acoustic Parameter Loss for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 YUNYANG ZENG Joseph Konan Shuo Han David Bick Muqiao Yang Anurag Kumar Shinji Watanabe Bhiksha Raj 139 11 0 16 Feb 2023
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial TrainingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022 Yang Xiang Jesper Lisby Højvang M. Rasmussen M. G. Christensen DRL 160 7 0 16 Nov 2022
Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed FP16-INT8 Post-Training Quantization Manuele Rusci Marco Fariselli Martin Croome Francesco Paci Eric Flamand MQ 122 16 0 14 Oct 2022
Improving Speech Enhancement through Fine-Grained Speech CharacteristicsInterspeech (Interspeech), 2022 Muqiao Yang Joseph Konan David Bick Anurag Kumar Shinji Watanabe Bhiksha Raj 125 11 0 01 Jul 2022
Learning Audio-Text Agreement for Open-vocabulary Keyword SpottingInterspeech (Interspeech), 2022 Hyeon-Kyeong Shin Hyewon Han Doyeon Kim Soo-Whan Chung Hong-Goo Kang 215 43 0 30 Jun 2022
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixerInterspeech (Interspeech), 2022 Jinmiao Huang W. Gharbieh Qianhui Wan Han Suk Shim Chul Lee 105 10 0 23 Jun 2022
A Systematic Comparison of Phonetic Aware Techniques for Speech EnhancementInterspeech (Interspeech), 2022 Or Tal Moshe Mandel Felix Kreuk Yossi Adi AAML 190 10 0 22 Jun 2022
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets J. Valin Ritwik Giri Shrikant Venkataramani Umut Isik A. Krishnaswamy 111 2 0 16 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingInternational Conference on Learning Representations (ICLR), 2022 Sang-gil Lee Ming-Yu Liu Boris Ginsburg Bryan Catanzaro Sung-Hoon Yoon 259 368 0 09 Jun 2022
GWA: A Large High-Quality Acoustic Dataset for Audio ProcessingInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2022 Zhenyu Tang R. Aralikatti Anton Ratnarajah Tianyi Zhou 269 44 0 04 Apr 2022
Spiking Cochlea with System-level Local Automatic Gain ControlIEEE Transactions on Circuits and Systems Part 1: Regular Papers (TCAS I), 2022 Ilya Kiselev Chang Gao Shih-Chii Liu 139 13 0 14 Feb 2022
Hybrid Neural Networks for On-device Directional Hearing Anran Wang Maruchi Kim Hao Zhang Shyamnath Gollakota 139 18 0 11 Dec 2021
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Yihui Fu Yun Liu Jingdong Li Dawei Luo Shubo Lv Yukai Jv Lei Xie 298 63 0 11 Nov 2021
InQSS: a speech intelligibility and quality assessment model using a multi-task learning networkInterspeech (Interspeech), 2021 Yu-Wen Chen Yu Tsao 276 24 0 04 Nov 2021
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition Boris Bergsma Minhao Yang Milos Cernak 152 4 0 07 Oct 2021
DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors Chandan K. A. Reddy Vishak Gopal Ross Cutler 428 319 0 05 Oct 2021
DDS: A new device-degraded speech dataset for speech enhancement Haoyu Li Junichi Yamagishi 163 10 0 16 Sep 2021
Objective Metrics to Evaluate Residual-Echo Suppression During Double-TalkIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021 Amir Ivry Israel Cohen B. Berdugo 128 8 0 15 Jul 2021