ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.14438
  4. Cited By
A Multimodal Approach to Device-Directed Speech Detection with Large
  Language Models

A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

21 March 2024
Dominik Wagner
Alexander W. Churchill
Siddharth Sigtia
Panayiotis Georgiou
Matt Mirsamadi
Aarshee Mishra
Erik Marchi
ArXivPDFHTML

Papers citing "A Multimodal Approach to Device-Directed Speech Detection with Large Language Models"

5 / 5 papers shown
Title
Large Language Models for Dysfluency Detection in Stuttered Speech
Large Language Models for Dysfluency Detection in Stuttered Speech
Dominik Wagner
Sebastian P. Bayerl
Ilja Baumann
K. Riedhammer
Elmar Nöth
Tobias Bocklet
35
3
0
16 Jun 2024
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework
  for Speech Recognition
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
S. Radhakrishnan
Chao-Han Huck Yang
S. Khan
Rohit Kumar
N. Kiani
D. Gómez-Cabrero
Jesper N. Tegnér
38
47
0
10 Oct 2023
Contrastive Speech Mixup for Low-resource Keyword Spotting
Contrastive Speech Mixup for Low-resource Keyword Spotting
Dianwen Ng
Ruixi Zhang
J. Yip
Chong Zhang
Yukun Ma
Trung Hieu Nguyen
Chongjia Ni
E. Chng
B. Ma
17
10
0
02 May 2023
Audio-to-Intent Using Acoustic-Textual Subword Representations from
  End-to-End ASR
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR
Pranay Dighe
Prateeth Nayak
Oggi Rudovic
Erik Marchi
Xiaochuan Niu
Ahmed H. Tewfik
39
4
0
21 Oct 2022
Multi-task Learning for Speaker Verification and Voice Trigger Detection
Multi-task Learning for Speaker Verification and Voice Trigger Detection
Siddharth Sigtia
Erik Marchi
S. Kajarekar
Devang Naik
J. Bridle
22
28
0
26 Jan 2020
1