Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.14438
Cited By
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
21 March 2024
Dominik Wagner
Alexander W. Churchill
Siddharth Sigtia
Panayiotis Georgiou
Matt Mirsamadi
Aarshee Mishra
Erik Marchi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Multimodal Approach to Device-Directed Speech Detection with Large Language Models"
5 / 5 papers shown
Title
Large Language Models for Dysfluency Detection in Stuttered Speech
Dominik Wagner
Sebastian P. Bayerl
Ilja Baumann
K. Riedhammer
Elmar Nöth
Tobias Bocklet
35
3
0
16 Jun 2024
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
S. Radhakrishnan
Chao-Han Huck Yang
S. Khan
Rohit Kumar
N. Kiani
D. Gómez-Cabrero
Jesper N. Tegnér
38
47
0
10 Oct 2023
Contrastive Speech Mixup for Low-resource Keyword Spotting
Dianwen Ng
Ruixi Zhang
J. Yip
Chong Zhang
Yukun Ma
Trung Hieu Nguyen
Chongjia Ni
E. Chng
B. Ma
17
10
0
02 May 2023
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR
Pranay Dighe
Prateeth Nayak
Oggi Rudovic
Erik Marchi
Xiaochuan Niu
Ahmed H. Tewfik
39
4
0
21 Oct 2022
Multi-task Learning for Speaker Verification and Voice Trigger Detection
Siddharth Sigtia
Erik Marchi
S. Kajarekar
Devang Naik
J. Bridle
22
28
0
26 Jan 2020
1