Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2208.13321
Cited By
Turn-Taking Prediction for Natural Conversational Speech
Interspeech (Interspeech), 2022
29 August 2022
Shuo-yiin Chang
Yue Liu
Tara N. Sainath
Chaoyang Zhang
Trevor Strohman
Qiao Liang
Yanzhang He
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Turn-Taking Prediction for Natural Conversational Speech"
15 / 15 papers shown
ASR-Synchronized Speaker-Role Diarization
Arindam Ghosh
Mark C. Fuhs
Bongjun Kim
Anurag Chowdhury
Monika Woszczyna
239
0
0
14 Jul 2025
Streaming Endpointer for Spoken Dialogue using Neural Audio Codecs and Label-Delayed Training
Sathvik Udupa
Shinji Watanabe
Petr Schwarz
Jan ''Honza'' Cernocký
277
2
0
08 Jun 2025
CoHear: Conversation Enhancement via Multi-Earphone Collaboration
Lixing He
Yunqi Guo
Zhenyu Yan
Guoliang Xing
270
0
0
27 May 2025
Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yuxin Lin
Yinglin Zheng
Ming Zeng
Wangzheng Shi
348
1
0
19 May 2025
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant
Hyunjong Ok
Suho Yoo
Jaeho Lee
347
2
0
30 Mar 2025
SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation
Wenyi Yu
Siyin Wang
Xiaoyu Yang
Xianzhao Chen
Xiaohai Tian
Jing Zhang
Guangzhi Sun
Lu Lu
Longji Xu
Chao Zhang
AuLLM
384
26
0
27 Nov 2024
Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jinhan Wang
Long Chen
Aparna Khare
A. Raju
Pranav Dheram
Di He
Minhua Wu
A. Stolcke
Venkatesh Ravichandran
236
19
0
26 Jan 2024
Two-pass Endpoint Detection for Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2023
A. Raju
Aparna Khare
Di He
Ilya Sklyar
Long Chen
...
Zhe Zhang
Colin Vaz
Venkatesh Ravichandran
Roland Maas
Ariya Rastrow
270
3
0
17 Jan 2024
STEER: Semantic Turn Extension-Expansion Recognition for Voice Assistants
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Leon Liyang Zhang
Jiarui Lu
Joel Ruben Antony Moniz
Aditya Kulkarni
Dhivya Piraviperumal
Tien Dung Tran
Nicholas Tzou
Hong-ye Yu
LLMSV
208
0
0
25 Oct 2023
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Interspeech (Interspeech), 2023
Shaan Bijwadia
Shuo-yiin Chang
Weiran Wang
Zhong Meng
Hao Zhang
Tara N. Sainath
188
3
0
14 Aug 2023
Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR
Interspeech (Interspeech), 2023
Wenjie Huang
Hao Zhang
Shankar Kumar
Shuo-yiin Chang
Tara N. Sainath
265
3
0
28 May 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Do June Min
A. Stolcke
A. Raju
Colin Vaz
Di He
Venkatesh Ravichandran
V. Trinh
OffRL
153
1
0
23 Mar 2023
Speaker Change Detection for Transformer Transducer ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jian Wu
Zhuo Chen
Min Hu
Xiong Xiao
Jinyu Li
206
5
0
16 Feb 2023
E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Wenjie Huang
Shuo-yiin Chang
Tara N. Sainath
Yanzhang He
David Rybach
R. David
Rohit Prabhavalkar
Cyril Allauzen
Cal Peyser
Trevor Strohman
281
6
0
28 Nov 2022
Conversation-oriented ASR with multi-look-ahead CBS architecture
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Huaibo Zhao
S. Fujie
Tetsuji Ogawa
Jin Sakuma
Yusuke Kida
Tetsunori Kobayashi
298
3
0
02 Nov 2022
1
Page 1 of 1