ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.06487
  4. Cited By
Multilingual Turn-taking Prediction Using Voice Activity Projection
v1v2v3 (latest)

Multilingual Turn-taking Prediction Using Voice Activity Projection

International Conference on Language Resources and Evaluation (LREC), 2024
11 March 2024
K. Inoue
Bing’er Jiang
Erik Ekstedt
Tatsuya Kawahara
Gabriel Skantze
ArXiv (abs)PDFHTMLGithub

Papers citing "Multilingual Turn-taking Prediction Using Voice Activity Projection"

9 / 9 papers shown
Triadic Multi-party Voice Activity Projection for Turn-taking in Spoken Dialogue Systems
Triadic Multi-party Voice Activity Projection for Turn-taking in Spoken Dialogue Systems
Mikey Elmers
K. Inoue
Divesh Lala
Tatsuya Kawahara
186
0
0
10 Jul 2025
Streaming Endpointer for Spoken Dialogue using Neural Audio Codecs and Label-Delayed Training
Streaming Endpointer for Spoken Dialogue using Neural Audio Codecs and Label-Delayed Training
Sathvik Udupa
Shinji Watanabe
Petr Schwarz
Jan ''Honza'' Cernocký
291
2
0
08 Jun 2025
Voice Activity Projection Model with Multimodal Encoders
Voice Activity Projection Model with Multimodal Encoders
Takeshi Saga
Catherine Pelachaud
224
2
0
04 Jun 2025
"Dyadosyncrasy", Idiosyncrasy and Demographic Factors in Turn-Taking
"Dyadosyncrasy", Idiosyncrasy and Demographic Factors in Turn-Taking
Julio Cesar Cavalcanti
Gabriel Skantze
118
1
0
30 May 2025
Visual Cues Support Robust Turn-taking Prediction in Noise
Visual Cues Support Robust Turn-taking Prediction in NoiseInterspeech (Interspeech), 2025
Sam O'Connor Russell
Naomi Harte
319
1
0
28 May 2025
Visual Cues Enhance Predictive Turn-Taking for Two-Party Human Interaction
Visual Cues Enhance Predictive Turn-Taking for Two-Party Human InteractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Sam O'Connor Russell
Naomi Harte
286
4
0
27 May 2025
A Noise-Robust Turn-Taking System for Real-World Dialogue Robots: A Field Experiment
A Noise-Robust Turn-Taking System for Real-World Dialogue Robots: A Field Experiment
K. Inoue
Yuki Okafuji
Jun Baba
Yoshiki Ohira
Katsuya Hyodo
Tatsuya Kawahara
262
3
0
08 Mar 2025
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking DynamicsInternational Conference on Learning Representations (ICLR), 2025
Siddhant Arora
Zhiyun Lu
Chung-Cheng Chiu
Ruoming Pang
Shinji Watanabe
392
30
0
03 Mar 2025
Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection
Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity ProjectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
K. Inoue
Divesh Lala
Gabriel Skantze
Tatsuya Kawahara
305
12
0
21 Oct 2024
1
Page 1 of 1