Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.14717
Cited By
Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
26 January 2024
Jinhan Wang
Long Chen
Aparna Khare
A. Raju
Pranav Dheram
Di He
Minhua Wu
A. Stolcke
Venkatesh Ravichandran
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion"
6 / 6 papers shown
Title
ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Siddhant Arora
Yifan Peng
Jiatong Shi
Jinchuan Tian
William Chen
...
Yosuke Kashiwagi
E. Tsunoo
Shuichiro Shimizu
Vaibhav Srivastav
Shinji Watanabe
36
0
0
11 Mar 2025
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Siddhant Arora
Zhiyun Lu
Chung-Cheng Chiu
Ruoming Pang
Shinji Watanabe
43
2
0
03 Mar 2025
SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation
Wenyi Yu
Siyin Wang
Xiaoyu Yang
Xianzhao Chen
Xiaohai Tian
J. Zhang
Guangzhi Sun
Lu Lu
Y. Wang
Chao Zhang
AuLLM
64
6
0
27 Nov 2024
Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models
Ognjen
Rudovic
Pranay Dighe
Yi Su
Vineet Garg
Sameer Dharur
Xiaochuan Niu
Ahmed H. Abdelaziz
Saurabh N. Adya
Ahmed H. Tewfik
24
0
0
28 Oct 2024
Multilingual Dyadic Interaction Corpus NoXi+J: Toward Understanding Asian-European Non-verbal Cultural Characteristics and their Influences on Engagement
Marius Funk
Shogo Okada
Elisabeth André
21
0
0
09 Sep 2024
TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog
Erik Ekstedt
Gabriel Skantze
34
53
0
21 Oct 2020
1