Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.03209
Cited By
Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition
9 April 2018
Pete Warden
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition"
25 / 25 papers shown
Title
X-ARES: A Comprehensive Framework for Assessing Audio Encoder Performance
Junbo Zhang
Heinrich Dinkel
Yadong Niu
Chenyu Liu
Si Cheng
Anbei Zhao
Jian Luan
72
0
0
22 May 2025
FedFetch: Faster Federated Learning with Adaptive Downstream Prefetching
Qifan Yan
Andrew Liu
Shiqi He
Mathias Lécuyer
Ivan Beschastnikh
FedML
91
0
0
21 Apr 2025
DRIP: DRop unImportant data Points -- Enhancing Machine Learning Efficiency with Grad-CAM-Based Real-Time Data Prioritization for On-Device Training
Marcus Rüb
Daniel Konegen
Patrick Selle
Axel Sikora
Daniel Mueller-Gritschneder
37
0
0
11 Apr 2025
Three-Factor Learning in Spiking Neural Networks: An Overview of Methods and Trends from a Machine Learning Perspective
Szymon Mazurek
Jakub Caputa
Jan K. Argasiñski
Maciej Wielgosz
37
0
0
06 Apr 2025
Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems
Rachmad Vidya Wicaksana Putra
Pasindu Wickramasinghe
Mohamed Bennai
61
0
0
01 Apr 2025
Moss: Proxy Model-based Full-Weight Aggregation in Federated Learning with Heterogeneous Models
Y. Cai
Ziqi Zhang
Ding Li
Yao Guo
Xiangqun Chen
98
0
0
13 Mar 2025
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin
Lukas Miklautz
Sepp Hochreiter
Johannes Brandstetter
VLM
135
8
0
24 Feb 2025
Variational Bayesian Adaptive Learning of Deep Latent Variables for Acoustic Knowledge Transfer
Hu Hu
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Chin-Hui Lee
95
0
0
28 Jan 2025
How Redundant Is the Transformer Stack in Speech Representation Models?
Teresa Dorszewski
Albert Kjøller Jacobsen
Lenka Tětková
Lars Kai Hansen
121
0
0
20 Jan 2025
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks
Simon Rampp
Andreas Triantafyllopoulos
M. Milling
Björn Schuller
156
0
0
16 Dec 2024
Layer-Adaptive State Pruning for Deep State Space Models
Minseon Gwak
Seongrok Moon
Joohwan Ko
PooGyeon Park
73
0
0
05 Nov 2024
Differentiable Weightless Neural Networks
Alan T. L. Bacellar
Zachary Susskind
Mauricio Breternitz Jr.
E. John
L. John
P. Lima
F. M. G. França
59
4
0
14 Oct 2024
Improving Semantic Understanding in Speech Language Models via Brain-tuning
Omer Moussa
Dietrich Klakow
Mariya Toneva
59
5
0
11 Oct 2024
Audio xLSTMs: Learning Self-Supervised Audio Representations with xLSTMs
Sarthak Yadav
Sergios Theodoridis
Zheng-Hua Tan
68
3
0
29 Aug 2024
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Siavash Shams
Sukru Samet Dindar
Xilin Jiang
N. Mesgarani
Mamba
74
20
0
20 May 2024
FedSPU: Personalized Federated Learning for Resource-constrained Devices with Stochastic Parameter Update
Ziru Niu
Hai Dong
•. A. K. Qin
75
2
0
18 Mar 2024
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
114
31
0
27 Aug 2023
Loss shaping enhances exact gradient learning with Eventprop in spiking neural networks
Thomas Nowotny
J. Turner
James C. Knight
46
15
0
02 Dec 2022
Gated Recurrent Neural Networks with Weighted Time-Delay Feedback
N. Benjamin Erichson
Soon Hoe Lim
Michael W. Mahoney
44
6
0
01 Dec 2022
Speech Recognition: Keyword Spotting Through Image Recognition
Sanjay Krishna Gouda
S. Kanetkar
David J. Harrison
Manfred K. Warmuth
39
22
0
10 Mar 2018
CMSIS-NN: Efficient Neural Network Kernels for Arm Cortex-M CPUs
Liangzhen Lai
Naveen Suda
Vikas Chandra
39
378
0
19 Jan 2018
Did you hear that? Adversarial Examples Against Automatic Speech Recognition
M. Alzantot
Bharathan Balaji
Mani B. Srivastava
AAML
46
251
0
02 Jan 2018
Raw Waveform-based Audio Classification Using Sample-level CNN Architectures
Jongpil Lee
Taejun Kim
Jiyoung Park
Juhan Nam
16
67
0
04 Dec 2017
Deep Residual Learning for Small-Footprint Keyword Spotting
Raphael Tang
Jimmy J. Lin
42
237
0
28 Oct 2017
Listening to the World Improves Speech Command Recognition
B. McMahan
D. Rao
82
38
0
23 Oct 2017
1