ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.03632
  4. Cited By
Multimodal Data and Resource Efficient Device-Directed Speech Detection
  with Large Foundation Models

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models

6 December 2023
Dominik Wagner
Alexander W. Churchill
Siddharth Sigtia
Panayiotis Georgiou
Matt Mirsamadi
Aarshee Mishra
Erik Marchi
ArXivPDFHTML

Papers citing "Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models"

7 / 7 papers shown
Title
Device-Directed Speech Detection for Follow-up Conversations Using Large
  Language Models
Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models
Ognjen
Rudovic
Pranay Dighe
Yi Su
Vineet Garg
Sameer Dharur
Xiaochuan Niu
Ahmed H. Abdelaziz
Saurabh N. Adya
Ahmed H. Tewfik
24
0
0
28 Oct 2024
Multimodal Large Language Models with Fusion Low Rank Adaptation for
  Device Directed Speech Detection
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
Shruti Palaskar
Oggi Rudovic
Sameer Dharur
Florian Pesce
G. Krishna
Aswin Sivaraman
Jack Berkowitz
Ahmed Hussen Abdelaziz
Saurabh N. Adya
Ahmed H. Tewfik
VLM
49
0
0
13 Jun 2024
A Multimodal Approach to Device-Directed Speech Detection with Large
  Language Models
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Dominik Wagner
Alexander W. Churchill
Siddharth Sigtia
Panayiotis Georgiou
Matt Mirsamadi
Aarshee Mishra
Erik Marchi
38
6
0
21 Mar 2024
Contrastive Speech Mixup for Low-resource Keyword Spotting
Contrastive Speech Mixup for Low-resource Keyword Spotting
Dianwen Ng
Ruixi Zhang
J. Yip
Chong Zhang
Yukun Ma
Trung Hieu Nguyen
Chongjia Ni
E. Chng
B. Ma
22
10
0
02 May 2023
Audio-to-Intent Using Acoustic-Textual Subword Representations from
  End-to-End ASR
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR
Pranay Dighe
Prateeth Nayak
Oggi Rudovic
Erik Marchi
Xiaochuan Niu
Ahmed H. Tewfik
44
4
0
21 Oct 2022
Multi-task Learning for Speaker Verification and Voice Trigger Detection
Multi-task Learning for Speaker Verification and Voice Trigger Detection
Siddharth Sigtia
Erik Marchi
S. Kajarekar
Devang Naik
J. Bridle
30
29
0
26 Jan 2020
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
223
4,424
0
23 Jan 2020
1