ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.00769
  4. Cited By
Keyword Transformer: A Self-Attention Model for Keyword Spotting

Keyword Transformer: A Self-Attention Model for Keyword Spotting

1 April 2021
Axel Berg
Mark O'Connor
M. T. Cruz
ArXivPDFHTML

Papers citing "Keyword Transformer: A Self-Attention Model for Keyword Spotting"

28 / 28 papers shown
Title
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
Myeonghoon Ryu
June-Woo Kim
Minseok Oh
Suji Lee
Han Park
36
0
0
20 Jan 2025
Effective Integration of KAN for Keyword Spotting
Effective Integration of KAN for Keyword Spotting
Anfeng Xu
Biqiao Zhang
Shuyu Kong
Yiteng Huang
Zhaojun Yang
Sangeeta Srivastava
Ming Sun
29
5
0
13 Sep 2024
Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation
  for Embedding Undetectable Vulnerabilities on Speech Recognition
Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition
Wenhan Yao
Jiangkun Yang
yongqiang He
Jia Liu
Weiping Wen
34
1
0
16 Jun 2024
Robust Dual-Modal Speech Keyword Spotting for XR Headsets
Robust Dual-Modal Speech Keyword Spotting for XR Headsets
Zhuojiang Cai
Yuhan Ma
Feng Lu
22
0
0
26 Jan 2024
CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss
CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss
R. S. Srinivasa
Jaejin Cho
Chouchang Yang
Yashas Malur Saidutta
Ching Hua Lee
Yilin Shen
Hongxia Jin
VLM
21
8
0
26 Sep 2023
PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined
  Keywords
PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords
Yong-Hyeok Lee
Namhyun Cho
24
18
0
31 Aug 2023
Towards Stealthy Backdoor Attacks against Speech Recognition via
  Elements of Sound
Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound
Hanbo Cai
Pengcheng Zhang
Hai Dong
Yan Xiao
Stefanos Koffas
Yiming Li
AAML
21
28
0
17 Jul 2023
PQA: Exploring the Potential of Product Quantization in DNN Hardware
  Acceleration
PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration
Ahmed F. AbouElhamayed
Angela Cui
Javier Fernandez-Marques
Nicholas D. Lane
Mohamed S. Abdelfattah
MQ
16
4
0
25 May 2023
Differentially Private Adapters for Parameter Efficient Acoustic
  Modeling
Differentially Private Adapters for Parameter Efficient Acoustic Modeling
Chun-Wei Ho
Chao-Han Huck Yang
Sabato Marco Siniscalchi
16
1
0
19 May 2023
Contrastive Speech Mixup for Low-resource Keyword Spotting
Contrastive Speech Mixup for Low-resource Keyword Spotting
Dianwen Ng
Ruixi Zhang
J. Yip
Chong Zhang
Yukun Ma
Trung Hieu Nguyen
Chongjia Ni
E. Chng
B. Ma
30
10
0
02 May 2023
Small-footprint slimmable networks for keyword spotting
Small-footprint slimmable networks for keyword spotting
Zuhaib Akhtar
Mohammad Omar Khursheed
Dongsu Du
Yuzong Liu
21
2
0
21 Apr 2023
Unified Keyword Spotting and Audio Tagging on Mobile Devices with
  Transformers
Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
27
4
0
03 Mar 2023
BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to
  Real-Network Performance
BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance
Haotong Qin
Xudong Ma
Yifu Ding
X. Li
Yang Zhang
Zejun Ma
Jiakai Wang
Jie Luo
Xianglong Liu
MQ
38
20
0
13 Nov 2022
Metric Learning for User-defined Keyword Spotting
Metric Learning for User-defined Keyword Spotting
Jaemin Jung
You-kyong. Kim
Jihwan Park
Youshin Lim
Byeong-Yeol Kim
Youngjoon Jang
Joon Son Chung
32
9
0
01 Nov 2022
Taxonomic Classification of IoT Smart Home Voice Control
Taxonomic Classification of IoT Smart Home Voice Control
M. Hewitt
H. Cunningham
11
1
0
24 Oct 2022
UniKW-AT: Unified Keyword Spotting and Audio Tagging
UniKW-AT: Unified Keyword Spotting and Audio Tagging
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
34
3
0
23 Sep 2022
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra
  Contrastive Regularization
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization
Dianwen Ng
J. Yip
Tanmay Surana
Zhao Yang
Chong Zhang
Yukun Ma
Chongjia Ni
Chng Eng Siong
B. Ma
27
6
0
14 Sep 2022
Fall Detection from Audios with Audio Transformers
Fall Detection from Audios with Audio Transformers
Prabhjot Kaur
Qifan Wang
Weisong Shi
8
16
0
23 Aug 2022
Boosting Tail Neural Network for Realtime Custom Keyword Spotting
Boosting Tail Neural Network for Realtime Custom Keyword Spotting
Sihao Xue
Qianyao Shen
Guoqing Li
19
0
0
24 May 2022
Points to Patches: Enabling the Use of Self-Attention for 3D Shape
  Recognition
Points to Patches: Enabling the Use of Self-Attention for 3D Shape Recognition
Axel Berg
Magnus Oskarsson
Mark O'Connor
3DPC
ViT
19
26
0
08 Apr 2022
Delta Keyword Transformer: Bringing Transformers to the Edge through
  Dynamically Pruned Multi-Head Self-Attention
Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention
Zuzana Jelčicová
Marian Verhelst
26
5
0
20 Mar 2022
Learning Audio Representations with MLPs
Learning Audio Representations with MLPs
Mashrur M. Morshed
Ahmad Omar Ahsan
H. Mahmud
Md. Kamrul Hasan
19
4
0
16 Mar 2022
Spanish and English Phoneme Recognition by Training on Simulated
  Classroom Audio Recordings of Collaborative Learning Environments
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Mario Esparza
17
0
0
21 Feb 2022
Exploiting Hybrid Models of Tensor-Train Networks for Spoken Command
  Recognition
Exploiting Hybrid Models of Tensor-Train Networks for Spoken Command Recognition
Jun Qi
Javier Tejedor
17
4
0
11 Jan 2022
SSAST: Self-Supervised Audio Spectrogram Transformer
SSAST: Self-Supervised Audio Spectrogram Transformer
Yuan Gong
Cheng-I Jeff Lai
Yu-An Chung
James R. Glass
ViT
30
268
0
19 Oct 2021
Colorization Transformer
Colorization Transformer
Manoj Kumar
Dirk Weissenborn
Nal Kalchbrenner
ViT
218
140
0
08 Feb 2021
Video Transformer Network
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
193
421
0
01 Feb 2021
Learning Efficient Representations for Keyword Spotting with Triplet
  Loss
Learning Efficient Representations for Keyword Spotting with Triplet Loss
R. Vygon
N. Mikhaylovskiy
DML
SSL
60
63
0
12 Jan 2021
1