ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.02802
  4. Cited By
End-to-End Streaming Keyword Spotting
v1v2 (latest)

End-to-End Streaming Keyword Spotting

6 December 2018
R. Álvarez
Hyun Jin Park
    AI4TS
ArXiv (abs)PDFHTML

Papers citing "End-to-End Streaming Keyword Spotting"

37 / 37 papers shown
Multichannel Keyword Spotting for Noisy Conditions
Multichannel Keyword Spotting for Noisy Conditions
Dzmitry Saladukha
Ivan Koriabkin
Kanstantsin Artsiom
Aliaksei Rak
Nikita Ryzhikov
78
0
0
21 Jul 2025
LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting
LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting
Pai Zhu
Quan Wang
Dhruuv Agarwal
Kurt Partridge
152
2
0
29 May 2025
GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot
  Keyword Spotting
GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot Keyword SpottingSpoken Language Technology Workshop (SLT), 2024
Pai Zhu
Jacob Bartel
Dhruuv Agarwal
Kurt Partridge
Hyun-jin Park
Quan Wang
268
4
0
22 Oct 2024
Self-Learning for Personalized Keyword Spotting on Ultra-Low-Power Audio Sensors
Self-Learning for Personalized Keyword Spotting on Ultra-Low-Power Audio SensorsIEEE Internet of Things Journal (IEEE IoT J.), 2024
Manuele Rusci
Francesco Paci
Marco Fariselli
Eric Flamand
Tinne Tuytelaars
311
6
0
22 Aug 2024
Adversarial training of Keyword Spotting to Minimize TTS Data
  Overfitting
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting
Hyun Jin Park
Dhruuv Agarwal
Neng Chen
Rentao Sun
Kurt Partridge
...
Jacob Bartel
Kyle Kastner
Gary Wang
Andrew Rosenberg
Quan Wang
230
2
0
20 Aug 2024
Utilizing TTS Synthesized Data for Efficient Development of Keyword
  Spotting Model
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model
Hyun Jin Park
Dhruuv Agarwal
Neng Chen
Rentao Sun
Kurt Partridge
...
Jacob Bartel
Kyle Kastner
Gary Wang
Andrew Rosenberg
Quan Wang
241
6
0
26 Jul 2024
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low
  Resource Environments
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments
Pai Zhu
Dhruuv Agarwal
Jacob Bartel
Kurt Partridge
Hyun Jin Park
Quan Wang
250
3
0
23 Jul 2024
RepCNN: Micro-sized, Mighty Models for Wakeword Detection
RepCNN: Micro-sized, Mighty Models for Wakeword Detection
Arnav Kundu
Prateeth Nayak
Priyanka Padmanabhan
Devang Naik
291
1
0
04 Jun 2024
High-precision Voice Search Query Correction via Retrievable Speech-text
  Embedings
High-precision Voice Search Query Correction via Retrievable Speech-text Embedings
Christopher Li
Gary Wang
Kyle Kastner
Heng Su
Allen Chen
...
Zelin Wu
L. Velikovich
Pat Rondon
D. Caseiro
Petar S. Aleksic
201
2
0
08 Jan 2024
Personalizing Keyword Spotting with Speaker Information
Personalizing Keyword Spotting with Speaker InformationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Beltrán Labrador
Pai Zhu
Guanlong Zhao
Angelo Scorza Scarpati
Quan Wang
Alicia Lozano-Diez
Alex Park
Ignacio López Moreno
208
4
0
06 Nov 2023
PhantomSound: Black-Box, Query-Efficient Audio Adversarial Attack via
  Split-Second Phoneme Injection
PhantomSound: Black-Box, Query-Efficient Audio Adversarial Attack via Split-Second Phoneme InjectionInternational Symposium on Recent Advances in Intrusion Detection (RAID), 2023
Hanqing Guo
Guangjing Wang
Yuanda Wang
Bocheng Chen
Qiben Yan
Li Xiao
AAML
242
14
0
13 Sep 2023
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech
  Recognition Models
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Steven M. Hernandez
Ding Zhao
Shaojin Ding
A. Bruguier
Rohit Prabhavalkar
Tara N. Sainath
Yanzhang He
Ian McGraw
328
12
0
15 Mar 2023
Locale Encoding For Scalable Multilingual Keyword Spotting Models
Locale Encoding For Scalable Multilingual Keyword Spotting ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Pai Zhu
Hyun Jin Park
Alex Park
Angelo Scorza Scarpati
Ignacio López Moreno
227
7
0
25 Feb 2023
Handling the Alignment for Wake Word Detection: A Comparison Between
  Alignment-Based, Alignment-Free and Hybrid Approaches
Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid ApproachesInterspeech (Interspeech), 2023
Vinícius Ribeiro
Yiteng Huang
Yuan Shangguan
Zhaojun Yang
Liting Wan
Ming Sun
257
3
0
17 Feb 2023
Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword
  Spotting
Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword SpottingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Beltrán Labrador
Guanlong Zhao
Ignacio López Moreno
Angelo Scorza Scarpati
Liam H. Fowl
Quan Wang
161
4
0
11 Nov 2022
LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword
  Spotting
LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Haichuan Yang
Zhaojun Yang
Liting Wan
Biqiao Zhang
Yangyang Shi
...
R. Álvarez
Ming Sun
Xin Lei
Raghuraman Krishnamoorthi
Vikas Chandra
195
7
0
09 Nov 2022
HEiMDaL: Highly Efficient Method for Detection and Localization of
  wake-words
HEiMDaL: Highly Efficient Method for Detection and Localization of wake-wordsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Arnav Kundu
Mohammad Samragh Razlighi
Minsik Cho
Priyanka Padmanabhan
Devang Naik
168
5
0
26 Oct 2022
I see what you hear: a vision-inspired method to localize words
I see what you hear: a vision-inspired method to localize wordsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Mohammad Samragh
Arnav Kundu
Ting-Yao Hu
Minsik Cho
Vasu Sharma
A. Shrivastava
Oncel Tuzel
Devang Naik
ObjD
150
2
0
24 Oct 2022
WakeUpNet: A Mobile-Transformer based Framework for End-to-End Streaming
  Voice Trigger
WakeUpNet: A Mobile-Transformer based Framework for End-to-End Streaming Voice Trigger
Zixing Zhang
Thorin Farnsworth
Senling Lin
S. Karout
292
2
0
06 Oct 2022
An Anchor-Free Detector for Continuous Speech Keyword Spotting
An Anchor-Free Detector for Continuous Speech Keyword SpottingInterspeech (Interspeech), 2022
Zhiyuan Zhao
Chuanxin Tang
C. Yao
Chong Luo
200
3
0
09 Aug 2022
Production federated keyword spotting via distillation, filtering, and
  joint federated-centralized training
Production federated keyword spotting via distillation, filtering, and joint federated-centralized trainingInterspeech (Interspeech), 2022
Andrew Straiton Hard
Kurt Partridge
Neng Chen
S. Augenstein
Aishanee Shah
...
Sara Ng
Jessica Nguyen
Ignacio López Moreno
Rajiv Mathews
F. Beaufays
FedML
326
16
0
11 Apr 2022
Deep Spoken Keyword Spotting: An Overview
Deep Spoken Keyword Spotting: An OverviewIEEE Access (IEEE Access), 2021
Iván López-Espejo
Zheng-Hua Tan
John H. L. Hansen
Jesper Jensen
253
142
0
20 Nov 2021
End-to-End Open Vocabulary Keyword Search
End-to-End Open Vocabulary Keyword SearchInterspeech (Interspeech), 2021
Bolaji Yusuf
Alican Gök
Batuhan Gündogdu
Murat Saraclar
162
10
0
23 Aug 2021
Multi-task Learning with Cross Attention for Keyword Spotting
Multi-task Learning with Cross Attention for Keyword SpottingAutomatic Speech Recognition & Understanding (ASRU), 2021
T. Higuchi
Anmol Gupta
C. Dhir
237
5
0
15 Jul 2021
Noisy student-teacher training for robust keyword spotting
Noisy student-teacher training for robust keyword spottingInterspeech (Interspeech), 2021
Hyun-jin Park
Pai Zhu
Ignacio López Moreno
Niranjan A. Subrahmanya
NoLa
146
21
0
03 Jun 2021
Personalized Keyphrase Detection using Speaker and Environment
  Information
Personalized Keyphrase Detection using Speaker and Environment InformationInterspeech (Interspeech), 2021
R. Rikhye
Quan Wang
Qiao Liang
Yanzhang He
Ding Zhao
Yiteng Huang
Huang
A. Narayanan
Ian McGraw
260
14
0
28 Apr 2021
Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End
  Metric
Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End MetricIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
A. Shrivastava
Arnav Kundu
C. Dhir
Devang Naik
Oncel Tuzel
283
17
0
02 Nov 2020
AutoKWS: Keyword Spotting with Differentiable Architecture Search
AutoKWS: Keyword Spotting with Differentiable Architecture SearchIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Bo Zhang
WenFeng Li
Qingyuan Li
Weiji Zhuang
Xiangxiang Chu
Yujun Wang
226
26
0
08 Sep 2020
Stacked 1D convolutional networks for end-to-end small footprint voice
  trigger detection
Stacked 1D convolutional networks for end-to-end small footprint voice trigger detectionInterspeech (Interspeech), 2020
T. Higuchi
M. Ghasemzadeh
Kisun You
C. Dhir
140
19
0
08 Aug 2020
Power Consumption Variation over Activation Functions
Power Consumption Variation over Activation Functions
Leon Derczynski
164
8
0
12 Jun 2020
Training Keyword Spotting Models on Non-IID Data with Federated Learning
Training Keyword Spotting Models on Non-IID Data with Federated Learning
Andrew Straiton Hard
Kurt Partridge
Cameron Nguyen
Niranjan A. Subrahmanya
Aishanee Shah
Pai Zhu
Ignacio López Moreno
Rajiv Mathews
OODFedML
277
71
0
21 May 2020
Streaming keyword spotting on mobile devices
Streaming keyword spotting on mobile devices
Oleg Rybakov
Natasha Kononenko
Niranjan A. Subrahmanya
Mirkó Visontai
Stella Laurenzo
AI4TS
417
123
0
14 May 2020
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker
  Verification using CTC-based Soft VAD and Global Query Attention
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
Myunghun Jung
Youngmoon Jung
Jahyun Goo
Hoirin Kim
349
23
0
08 May 2020
Learning To Detect Keyword Parts And Whole By Smoothed Max Pooling
Learning To Detect Keyword Parts And Whole By Smoothed Max PoolingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Hyun-jin Park
Patrick Violette
Niranjan A. Subrahmanya
146
21
0
25 Jan 2020
Metric Learning with Background Noise Class for Few-shot Detection of
  Rare Sound Events
Metric Learning with Background Noise Class for Few-shot Detection of Rare Sound EventsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Kazuki Shimada
Yuichiro Koyama
A. Inoue
306
26
0
30 Oct 2019
Query-by-example on-device keyword spotting
Query-by-example on-device keyword spottingAutomatic Speech Recognition & Understanding (ASRU), 2019
Byeonggeun Kim
Mingu Lee
Jinkyu Lee
Yeonseok Kim
Kyuwoong Hwang
260
54
0
11 Oct 2019
Streaming End-to-end Speech Recognition For Mobile Devices
Streaming End-to-end Speech Recognition For Mobile DevicesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018
Yanzhang He
Tara N. Sainath
Rohit Prabhavalkar
Ian McGraw
R. Álvarez
...
K. Sim
Tom Bagby
Shuo-yiin Chang
Kanishka Rao
A. Gruenstein
350
671
0
15 Nov 2018
1
Page 1 of 1