ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.08050
  4. Cited By
A scalable noisy speech dataset and online subjective test framework

A scalable noisy speech dataset and online subjective test framework

Interspeech (Interspeech), 2019
17 September 2019
Chandan K. A. Reddy
Ebrahim Beyrami
Jamie Pool
Ross Cutler
Sriram Srinivasan
J. Gehrke
ArXiv (abs)PDFHTML

Papers citing "A scalable noisy speech dataset and online subjective test framework"

50 / 68 papers shown
Title
DHAuDS: A Dynamic and Heterogeneous Audio Benchmark for Test-Time Adaptation
DHAuDS: A Dynamic and Heterogeneous Audio Benchmark for Test-Time Adaptation
Weichuang Shao
I. Liao
Tomas Henrique Bode Maul
T. Chandesa
TTA
108
0
0
23 Nov 2025
Pretrained Conformers for Audio Fingerprinting and Retrieval
Pretrained Conformers for Audio Fingerprinting and Retrieval
Kemal Altwlkany
Elmedin Selmanovic
Sead Delalic
84
0
0
15 Aug 2025
Tiny Noise-Robust Voice Activity Detector for Voice Assistants
Tiny Noise-Robust Voice Activity Detector for Voice Assistants
Hamed Jafarzadeh Asl
Mahsa Ghazvini Nejad
Amin Edraki
M. Asgharian
Vahid Partovi Nia
80
1
0
29 Jul 2025
Advances in Intelligent Hearing Aids: Deep Learning Approaches to Selective Noise Cancellation
Advances in Intelligent Hearing Aids: Deep Learning Approaches to Selective Noise Cancellation
Haris Khan
Shumaila Asif
Hassan Nasir
Kamran Aziz Bhatti
Shahzad Amin Sheikh
102
1
0
25 Jun 2025
SUTA-LM: Bridging Test-Time Adaptation and Language Model Rescoring for Robust ASR
SUTA-LM: Bridging Test-Time Adaptation and Language Model Rescoring for Robust ASR
Wei-Ping Huang
Guan-Ting Lin
Hung-yi Lee
KELM
75
0
0
10 Jun 2025
Training-Free Multi-Step Audio Source Separation
Training-Free Multi-Step Audio Source Separation
Yongyi Zang
Jingyi Li
Qiuqiang Kong
370
0
0
26 May 2025
AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation
AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation
Yang Xiao
Tianyi Peng
Yanghao Zhou
Rohan Kumar Das
TTA
150
0
0
20 May 2025
RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users
RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users
Suyu Ye
Haojun Shi
Darren Shih
Hyokun Yun
Tanya Roosta
Tianmin Shu
265
11
0
14 Apr 2025
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Myeonghoon Ryu
June-Woo Kim
Minseok Oh
Suji Lee
Han Park
243
1
0
20 Jan 2025
Roadmap towards Superhuman Speech Understanding using Large Language
  Models
Roadmap towards Superhuman Speech Understanding using Large Language Models
Fan Bu
Yuhao Zhang
Xiang Wang
Benyou Wang
Qiang Liu
Haoyang Li
LM&MAELMAuLLM
677
2
0
17 Oct 2024
Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind
Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind
Lance Ying
Xinyi Li
Shivam Aarya
Yizirui Fang
Stefanie Tellex
J. Tenenbaum
Tianmin Shu
Joshua B. Tenenbaum
Tianmin Shu
LM&Ro
255
3
0
17 Sep 2024
The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
The VoiceMOS Challenge 2024: Beyond Speech Quality PredictionSpoken Language Technology Workshop (SLT), 2024
Wen-Chin Huang
Szu-Wei Fu
Erica Cooper
Ryandhimas E. Zezario
Tomoki Toda
Hsin-Min Wang
Junichi Yamagishi
Yu Tsao
194
31
0
11 Sep 2024
Spectral oversubtraction? An approach for speech enhancement after robot
  ego speech filtering in semi-real-time
Spectral oversubtraction? An approach for speech enhancement after robot ego speech filtering in semi-real-time
Yue Li
Koen V. Hindriks
Florian A. Kunneman
172
1
0
10 Sep 2024
LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech
  Recognition
LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech RecognitionInterspeech (Interspeech), 2024
Eunseop Yoon
Hee Suk Yoon
John Harvill
M. Hasegawa-Johnson
Chang D. Yoo
TTAVLM
190
1
0
11 Aug 2024
Resource-Efficient Speech Quality Prediction through Quantization Aware
  Training and Binary Activation Maps
Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps
Mattias Nilsson
Riccardo Miccini
Clément Laroche
Tobias Piechowiak
Friedemann Zenke
MQ
130
2
0
05 Jul 2024
Continual Test-time Adaptation for End-to-end Speech Recognition on
  Noisy Speech
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech
Guan-Ting Lin
Wei-Ping Huang
Hung-yi Lee
VLMTTA
145
7
0
16 Jun 2024
Effects of Dataset Sampling Rate for Noise Cancellation through Deep
  Learning
Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning
Brandon Colelough
Andrew Zheng
258
1
0
30 May 2024
Exploring neural oscillations during speech perception via surrogate
  gradient spiking neural networks
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks
Alexandre Bittar
Philip N. Garner
139
1
0
22 Apr 2024
Test-Time Training for Depression Detection
Test-Time Training for Depression Detection
Sri Harsha Dumpala
Chandramouli Shama Sastry
Rudolf Uher
Sageev Oore
184
1
0
07 Apr 2024
Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Speech Robust Bench: A Robustness Benchmark For Speech RecognitionInternational Conference on Learning Representations (ICLR), 2024
Muhammad A. Shah
David Solans Noguero
Mikko A. Heikkilä
Nicolas Kourtellis
176
11
0
08 Mar 2024
SECP: A Speech Enhancement-Based Curation Pipeline For Scalable
  Acquisition Of Clean Speech
SECP: A Speech Enhancement-Based Curation Pipeline For Scalable Acquisition Of Clean Speech
Adam Sabra
C. Wronka
Michelle Mao
Samer Hijazi
95
4
0
19 Feb 2024
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic
  Control Using Multi-Objective Learning
ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Xincheng Yu
Dongyue Guo
Jianwei Zhang
Yi Lin
160
6
0
11 Dec 2023
NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech
  Enhancement and Non-matching Reference Audio Quality Assessment
NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality AssessmentIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Alessandro Ragano
Jan Skoglund
Andrew Hines
257
15
0
28 Sep 2023
A Two-Step Approach for Narrowband Source Localization in Reverberant
  Rooms
A Two-Step Approach for Narrowband Source Localization in Reverberant Rooms
W. Lai
L. Birnie
T. Abhayapala
Amy Bastine
Shaoheng Xu
P. Samarasinghe
55
1
0
25 Sep 2023
Test-Time Training for Speech
Test-Time Training for Speech
Sri Harsha Dumpala
Chandramouli Shama Sastry
Sageev Oore
263
3
0
19 Sep 2023
Improving vision-inspired keyword spotting using dynamic module skipping
  in streaming conformer encoder
Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Alexandre Bittar
Paul Dixon
Mohammad Samragh
K. Nishu
Devang Naik
200
6
0
31 Aug 2023
PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined
  Keywords
PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined KeywordsInterspeech (Interspeech), 2023
Yong-Hyeok Lee
Namhyun Cho
156
25
0
31 Aug 2023
Fixed Inter-Neuron Covariability Induces Adversarial Robustness
Fixed Inter-Neuron Covariability Induces Adversarial RobustnessIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muhammad Ahmed Shah
Bhiksha Raj
AAML
56
0
0
07 Aug 2023
VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select
  Indic Languages
VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages
Shivam Mhaskar
Vineet Bhat
Akshay Batheja
S. Deoghare
Paramveer Choudhary
P. Bhattacharyya
143
7
0
21 May 2023
Improving the Intent Classification accuracy in Noisy Environment
Improving the Intent Classification accuracy in Noisy Environment
Mohamed Nabih Ali
Alessio Brutti
Daniele Falavigna
82
1
0
12 Mar 2023
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End
  Speech Recognition
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech RecognitionNeural Networks (Neural Netw.), 2023
Leyuan Qu
C. Weber
S. Wermter
132
12
0
20 Feb 2023
PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech
  Enhancement
PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muqiao Yang
Joseph Konan
David Bick
YUNYANG ZENG
Shuo Han
Anurag Kumar
Shinji Watanabe
Bhiksha Raj
142
5
0
16 Feb 2023
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
TAPLoss: A Temporal Acoustic Parameter Loss for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
YUNYANG ZENG
Joseph Konan
Shuo Han
David Bick
Muqiao Yang
Anurag Kumar
Shinji Watanabe
Bhiksha Raj
139
11
0
16 Feb 2023
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method
  Using Variational Autoencoder and Adversarial Training
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial TrainingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
160
7
0
16 Nov 2022
Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed
  FP16-INT8 Post-Training Quantization
Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed FP16-INT8 Post-Training Quantization
Manuele Rusci
Marco Fariselli
Martin Croome
Francesco Paci
Eric Flamand
MQ
122
16
0
14 Oct 2022
Improving Speech Enhancement through Fine-Grained Speech Characteristics
Improving Speech Enhancement through Fine-Grained Speech CharacteristicsInterspeech (Interspeech), 2022
Muqiao Yang
Joseph Konan
David Bick
Anurag Kumar
Shinji Watanabe
Bhiksha Raj
125
11
0
01 Jul 2022
Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Learning Audio-Text Agreement for Open-vocabulary Keyword SpottingInterspeech (Interspeech), 2022
Hyeon-Kyeong Shin
Hyewon Han
Doyeon Kim
Soo-Whan Chung
Hong-Goo Kang
215
43
0
30 Jun 2022
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using
  MLPMixer
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixerInterspeech (Interspeech), 2022
Jinmiao Huang
W. Gharbieh
Qianhui Wan
Han Suk Shim
Chul Lee
105
10
0
23 Jun 2022
A Systematic Comparison of Phonetic Aware Techniques for Speech
  Enhancement
A Systematic Comparison of Phonetic Aware Techniques for Speech EnhancementInterspeech (Interspeech), 2022
Or Tal
Moshe Mandel
Felix Kreuk
Yossi Adi
AAML
190
10
0
22 Jun 2022
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time
  Dereverberation Targets
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
J. Valin
Ritwik Giri
Shrikant Venkataramani
Umut Isik
A. Krishnaswamy
111
2
0
16 Jun 2022
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingInternational Conference on Learning Representations (ICLR), 2022
Sang-gil Lee
Ming-Yu Liu
Boris Ginsburg
Bryan Catanzaro
Sung-Hoon Yoon
259
368
0
09 Jun 2022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
GWA: A Large High-Quality Acoustic Dataset for Audio ProcessingInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2022
Zhenyu Tang
R. Aralikatti
Anton Ratnarajah
Tianyi Zhou
269
44
0
04 Apr 2022
Spiking Cochlea with System-level Local Automatic Gain Control
Spiking Cochlea with System-level Local Automatic Gain ControlIEEE Transactions on Circuits and Systems Part 1: Regular Papers (TCAS I), 2022
Ilya Kiselev
Chang Gao
Shih-Chii Liu
139
13
0
14 Feb 2022
Hybrid Neural Networks for On-device Directional Hearing
Hybrid Neural Networks for On-device Directional Hearing
Anran Wang
Maruchi Kim
Hao Zhang
Shyamnath Gollakota
139
18
0
11 Dec 2021
Uformer: A Unet based dilated complex & real dual-path conformer network
  for simultaneous speech enhancement and dereverberation
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
298
63
0
11 Nov 2021
InQSS: a speech intelligibility and quality assessment model using a
  multi-task learning network
InQSS: a speech intelligibility and quality assessment model using a multi-task learning networkInterspeech (Interspeech), 2021
Yu-Wen Chen
Yu Tsao
276
24
0
04 Nov 2021
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio
  Recognition
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition
Boris Bergsma
Minhao Yang
Milos Cernak
152
4
0
07 Oct 2021
DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric
  to Evaluate Noise Suppressors
DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
428
319
0
05 Oct 2021
DDS: A new device-degraded speech dataset for speech enhancement
DDS: A new device-degraded speech dataset for speech enhancement
Haoyu Li
Junichi Yamagishi
163
10
0
16 Sep 2021
Objective Metrics to Evaluate Residual-Echo Suppression During
  Double-Talk
Objective Metrics to Evaluate Residual-Echo Suppression During Double-TalkIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Amir Ivry
Israel Cohen
B. Berdugo
128
8
0
15 Jul 2021
12
Next