Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2312.03632
Cited By
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models
6 December 2023
Dominik Wagner
Alexander W. Churchill
Siddharth Sigtia
Panayiotis Georgiou
Matt Mirsamadi
Aarshee Mishra
Erik Marchi
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (7 upvotes)
Papers citing
"Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models"
3 / 3 papers shown
Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models
Ognjen
Rudovic
Pranay Dighe
Yi Su
Vineet Garg
Sameer Dharur
Xiaochuan Niu
Ahmed H. Abdelaziz
Saurabh N. Adya
Ahmed H. Tewfik
147
0
0
28 Oct 2024
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
Interspeech (Interspeech), 2024
Shruti Palaskar
Oggi Rudovic
Sameer Dharur
Florian Pesce
G. Krishna
Aswin Sivaraman
Jack Berkowitz
Ahmed Hussen Abdelaziz
Saurabh N. Adya
Ahmed H. Tewfik
VLM
176
3
0
13 Jun 2024
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Dominik Wagner
Alexander W. Churchill
Siddharth Sigtia
Panayiotis Georgiou
Matt Mirsamadi
Aarshee Mishra
Erik Marchi
264
8
0
21 Mar 2024
1