Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.08050
Cited By
A scalable noisy speech dataset and online subjective test framework
Interspeech (Interspeech), 2019
17 September 2019
Chandan K. A. Reddy
Ebrahim Beyrami
Jamie Pool
Ross Cutler
Sriram Srinivasan
J. Gehrke
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A scalable noisy speech dataset and online subjective test framework"
50 / 68 papers shown
Title
DHAuDS: A Dynamic and Heterogeneous Audio Benchmark for Test-Time Adaptation
Weichuang Shao
I. Liao
Tomas Henrique Bode Maul
T. Chandesa
TTA
108
0
0
23 Nov 2025
Pretrained Conformers for Audio Fingerprinting and Retrieval
Kemal Altwlkany
Elmedin Selmanovic
Sead Delalic
84
0
0
15 Aug 2025
Tiny Noise-Robust Voice Activity Detector for Voice Assistants
Hamed Jafarzadeh Asl
Mahsa Ghazvini Nejad
Amin Edraki
M. Asgharian
Vahid Partovi Nia
80
1
0
29 Jul 2025
Advances in Intelligent Hearing Aids: Deep Learning Approaches to Selective Noise Cancellation
Haris Khan
Shumaila Asif
Hassan Nasir
Kamran Aziz Bhatti
Shahzad Amin Sheikh
102
1
0
25 Jun 2025
SUTA-LM: Bridging Test-Time Adaptation and Language Model Rescoring for Robust ASR
Wei-Ping Huang
Guan-Ting Lin
Hung-yi Lee
KELM
75
0
0
10 Jun 2025
Training-Free Multi-Step Audio Source Separation
Yongyi Zang
Jingyi Li
Qiuqiang Kong
370
0
0
26 May 2025
AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation
Yang Xiao
Tianyi Peng
Yanghao Zhou
Rohan Kumar Das
TTA
150
0
0
20 May 2025
RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users
Suyu Ye
Haojun Shi
Darren Shih
Hyokun Yun
Tanya Roosta
Tianmin Shu
265
11
0
14 Apr 2025
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Myeonghoon Ryu
June-Woo Kim
Minseok Oh
Suji Lee
Han Park
243
1
0
20 Jan 2025
Roadmap towards Superhuman Speech Understanding using Large Language Models
Fan Bu
Yuhao Zhang
Xiang Wang
Benyou Wang
Qiang Liu
Haoyang Li
LM&MA
ELM
AuLLM
677
2
0
17 Oct 2024
Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind
Lance Ying
Xinyi Li
Shivam Aarya
Yizirui Fang
Stefanie Tellex
J. Tenenbaum
Tianmin Shu
Joshua B. Tenenbaum
Tianmin Shu
LM&Ro
255
3
0
17 Sep 2024
The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
Spoken Language Technology Workshop (SLT), 2024
Wen-Chin Huang
Szu-Wei Fu
Erica Cooper
Ryandhimas E. Zezario
Tomoki Toda
Hsin-Min Wang
Junichi Yamagishi
Yu Tsao
194
31
0
11 Sep 2024
Spectral oversubtraction? An approach for speech enhancement after robot ego speech filtering in semi-real-time
Yue Li
Koen V. Hindriks
Florian A. Kunneman
172
1
0
10 Sep 2024
LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition
Interspeech (Interspeech), 2024
Eunseop Yoon
Hee Suk Yoon
John Harvill
M. Hasegawa-Johnson
Chang D. Yoo
TTA
VLM
190
1
0
11 Aug 2024
Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps
Mattias Nilsson
Riccardo Miccini
Clément Laroche
Tobias Piechowiak
Friedemann Zenke
MQ
130
2
0
05 Jul 2024
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech
Guan-Ting Lin
Wei-Ping Huang
Hung-yi Lee
VLM
TTA
145
7
0
16 Jun 2024
Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning
Brandon Colelough
Andrew Zheng
258
1
0
30 May 2024
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks
Alexandre Bittar
Philip N. Garner
139
1
0
22 Apr 2024
Test-Time Training for Depression Detection
Sri Harsha Dumpala
Chandramouli Shama Sastry
Rudolf Uher
Sageev Oore
184
1
0
07 Apr 2024
Speech Robust Bench: A Robustness Benchmark For Speech Recognition
International Conference on Learning Representations (ICLR), 2024
Muhammad A. Shah
David Solans Noguero
Mikko A. Heikkilä
Nicolas Kourtellis
176
11
0
08 Mar 2024
SECP: A Speech Enhancement-Based Curation Pipeline For Scalable Acquisition Of Clean Speech
Adam Sabra
C. Wronka
Michelle Mao
Samer Hijazi
95
4
0
19 Feb 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Xincheng Yu
Dongyue Guo
Jianwei Zhang
Yi Lin
160
6
0
11 Dec 2023
NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Alessandro Ragano
Jan Skoglund
Andrew Hines
257
15
0
28 Sep 2023
A Two-Step Approach for Narrowband Source Localization in Reverberant Rooms
W. Lai
L. Birnie
T. Abhayapala
Amy Bastine
Shaoheng Xu
P. Samarasinghe
55
1
0
25 Sep 2023
Test-Time Training for Speech
Sri Harsha Dumpala
Chandramouli Shama Sastry
Sageev Oore
263
3
0
19 Sep 2023
Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Alexandre Bittar
Paul Dixon
Mohammad Samragh
K. Nishu
Devang Naik
200
6
0
31 Aug 2023
PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords
Interspeech (Interspeech), 2023
Yong-Hyeok Lee
Namhyun Cho
156
25
0
31 Aug 2023
Fixed Inter-Neuron Covariability Induces Adversarial Robustness
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muhammad Ahmed Shah
Bhiksha Raj
AAML
56
0
0
07 Aug 2023
VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages
Shivam Mhaskar
Vineet Bhat
Akshay Batheja
S. Deoghare
Paramveer Choudhary
P. Bhattacharyya
143
7
0
21 May 2023
Improving the Intent Classification accuracy in Noisy Environment
Mohamed Nabih Ali
Alessio Brutti
Daniele Falavigna
82
1
0
12 Mar 2023
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition
Neural Networks (Neural Netw.), 2023
Leyuan Qu
C. Weber
S. Wermter
132
12
0
20 Feb 2023
PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muqiao Yang
Joseph Konan
David Bick
YUNYANG ZENG
Shuo Han
Anurag Kumar
Shinji Watanabe
Bhiksha Raj
142
5
0
16 Feb 2023
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
YUNYANG ZENG
Joseph Konan
Shuo Han
David Bick
Muqiao Yang
Anurag Kumar
Shinji Watanabe
Bhiksha Raj
139
11
0
16 Feb 2023
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
160
7
0
16 Nov 2022
Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed FP16-INT8 Post-Training Quantization
Manuele Rusci
Marco Fariselli
Martin Croome
Francesco Paci
Eric Flamand
MQ
122
16
0
14 Oct 2022
Improving Speech Enhancement through Fine-Grained Speech Characteristics
Interspeech (Interspeech), 2022
Muqiao Yang
Joseph Konan
David Bick
Anurag Kumar
Shinji Watanabe
Bhiksha Raj
125
11
0
01 Jul 2022
Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Interspeech (Interspeech), 2022
Hyeon-Kyeong Shin
Hyewon Han
Doyeon Kim
Soo-Whan Chung
Hong-Goo Kang
215
43
0
30 Jun 2022
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer
Interspeech (Interspeech), 2022
Jinmiao Huang
W. Gharbieh
Qianhui Wan
Han Suk Shim
Chul Lee
105
10
0
23 Jun 2022
A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement
Interspeech (Interspeech), 2022
Or Tal
Moshe Mandel
Felix Kreuk
Yossi Adi
AAML
190
10
0
22 Jun 2022
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
J. Valin
Ritwik Giri
Shrikant Venkataramani
Umut Isik
A. Krishnaswamy
111
2
0
16 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
International Conference on Learning Representations (ICLR), 2022
Sang-gil Lee
Ming-Yu Liu
Boris Ginsburg
Bryan Catanzaro
Sung-Hoon Yoon
259
368
0
09 Jun 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2022
Zhenyu Tang
R. Aralikatti
Anton Ratnarajah
Tianyi Zhou
269
44
0
04 Apr 2022
Spiking Cochlea with System-level Local Automatic Gain Control
IEEE Transactions on Circuits and Systems Part 1: Regular Papers (TCAS I), 2022
Ilya Kiselev
Chang Gao
Shih-Chii Liu
139
13
0
14 Feb 2022
Hybrid Neural Networks for On-device Directional Hearing
Anran Wang
Maruchi Kim
Hao Zhang
Shyamnath Gollakota
139
18
0
11 Dec 2021
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
298
63
0
11 Nov 2021
InQSS: a speech intelligibility and quality assessment model using a multi-task learning network
Interspeech (Interspeech), 2021
Yu-Wen Chen
Yu Tsao
276
24
0
04 Nov 2021
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition
Boris Bergsma
Minhao Yang
Milos Cernak
152
4
0
07 Oct 2021
DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
428
319
0
05 Oct 2021
DDS: A new device-degraded speech dataset for speech enhancement
Haoyu Li
Junichi Yamagishi
163
10
0
16 Sep 2021
Objective Metrics to Evaluate Residual-Echo Suppression During Double-Talk
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Amir Ivry
Israel Cohen
B. Berdugo
128
8
0
15 Jul 2021
1
2
Next