Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.00769
Cited By
Keyword Transformer: A Self-Attention Model for Keyword Spotting
1 April 2021
Axel Berg
Mark O'Connor
M. T. Cruz
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Keyword Transformer: A Self-Attention Model for Keyword Spotting"
28 / 28 papers shown
Title
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
Myeonghoon Ryu
June-Woo Kim
Minseok Oh
Suji Lee
Han Park
36
0
0
20 Jan 2025
Effective Integration of KAN for Keyword Spotting
Anfeng Xu
Biqiao Zhang
Shuyu Kong
Yiteng Huang
Zhaojun Yang
Sangeeta Srivastava
Ming Sun
29
5
0
13 Sep 2024
Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition
Wenhan Yao
Jiangkun Yang
yongqiang He
Jia Liu
Weiping Wen
34
1
0
16 Jun 2024
Robust Dual-Modal Speech Keyword Spotting for XR Headsets
Zhuojiang Cai
Yuhan Ma
Feng Lu
22
0
0
26 Jan 2024
CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss
R. S. Srinivasa
Jaejin Cho
Chouchang Yang
Yashas Malur Saidutta
Ching Hua Lee
Yilin Shen
Hongxia Jin
VLM
21
8
0
26 Sep 2023
PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords
Yong-Hyeok Lee
Namhyun Cho
24
18
0
31 Aug 2023
Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound
Hanbo Cai
Pengcheng Zhang
Hai Dong
Yan Xiao
Stefanos Koffas
Yiming Li
AAML
21
28
0
17 Jul 2023
PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration
Ahmed F. AbouElhamayed
Angela Cui
Javier Fernandez-Marques
Nicholas D. Lane
Mohamed S. Abdelfattah
MQ
16
4
0
25 May 2023
Differentially Private Adapters for Parameter Efficient Acoustic Modeling
Chun-Wei Ho
Chao-Han Huck Yang
Sabato Marco Siniscalchi
16
1
0
19 May 2023
Contrastive Speech Mixup for Low-resource Keyword Spotting
Dianwen Ng
Ruixi Zhang
J. Yip
Chong Zhang
Yukun Ma
Trung Hieu Nguyen
Chongjia Ni
E. Chng
B. Ma
30
10
0
02 May 2023
Small-footprint slimmable networks for keyword spotting
Zuhaib Akhtar
Mohammad Omar Khursheed
Dongsu Du
Yuzong Liu
24
2
0
21 Apr 2023
Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
27
4
0
03 Mar 2023
BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance
Haotong Qin
Xudong Ma
Yifu Ding
X. Li
Yang Zhang
Zejun Ma
Jiakai Wang
Jie Luo
Xianglong Liu
MQ
38
20
0
13 Nov 2022
Metric Learning for User-defined Keyword Spotting
Jaemin Jung
You-kyong. Kim
Jihwan Park
Youshin Lim
Byeong-Yeol Kim
Youngjoon Jang
Joon Son Chung
32
9
0
01 Nov 2022
Taxonomic Classification of IoT Smart Home Voice Control
M. Hewitt
H. Cunningham
11
1
0
24 Oct 2022
UniKW-AT: Unified Keyword Spotting and Audio Tagging
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
37
3
0
23 Sep 2022
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization
Dianwen Ng
J. Yip
Tanmay Surana
Zhao Yang
Chong Zhang
Yukun Ma
Chongjia Ni
Chng Eng Siong
B. Ma
27
6
0
14 Sep 2022
Fall Detection from Audios with Audio Transformers
Prabhjot Kaur
Qifan Wang
Weisong Shi
8
16
0
23 Aug 2022
Boosting Tail Neural Network for Realtime Custom Keyword Spotting
Sihao Xue
Qianyao Shen
Guoqing Li
21
0
0
24 May 2022
Points to Patches: Enabling the Use of Self-Attention for 3D Shape Recognition
Axel Berg
Magnus Oskarsson
Mark O'Connor
3DPC
ViT
19
26
0
08 Apr 2022
Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention
Zuzana Jelčicová
Marian Verhelst
26
5
0
20 Mar 2022
Learning Audio Representations with MLPs
Mashrur M. Morshed
Ahmad Omar Ahsan
H. Mahmud
Md. Kamrul Hasan
19
4
0
16 Mar 2022
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Mario Esparza
17
0
0
21 Feb 2022
Exploiting Hybrid Models of Tensor-Train Networks for Spoken Command Recognition
Jun Qi
Javier Tejedor
19
4
0
11 Jan 2022
SSAST: Self-Supervised Audio Spectrogram Transformer
Yuan Gong
Cheng-I Jeff Lai
Yu-An Chung
James R. Glass
ViT
30
268
0
19 Oct 2021
Colorization Transformer
Manoj Kumar
Dirk Weissenborn
Nal Kalchbrenner
ViT
218
140
0
08 Feb 2021
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
193
421
0
01 Feb 2021
Learning Efficient Representations for Keyword Spotting with Triplet Loss
R. Vygon
N. Mikhaylovskiy
DML
SSL
60
63
0
12 Jan 2021
1