ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.16351
  4. Cited By
Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection
v1v2 (latest)

Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection

22 May 2025
Chenxu Guo
Jiachen Lian
Xuanru Zhou
Jinming Zhang
Shuhe Li
Zongli Ye
Hwi Joo Park
Anaisha Das
Z. Ezzes
Jet M J Vonk
Brittany Morin
Rian Bogley
Lisa Wauters
Zachary Miller
M. G. Tempini
Gopala Anumanchipalli
ArXiv (abs)PDFHTML

Papers citing "Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection"

25 / 25 papers shown
Local MAP Sampling for Diffusion Models
Local MAP Sampling for Diffusion Models
Shaorong Zhang
Rob Brekelmans
Greg Ver Steeg
147
1
0
07 Oct 2025
Deploying UDM Series in Real-Life Stuttered Speech Applications: A Clinical Evaluation Framework
Deploying UDM Series in Real-Life Stuttered Speech Applications: A Clinical Evaluation Framework
Eric Zhang
Li Wei
Sarah Chen
Michael Wang
109
0
0
17 Sep 2025
A Comparative Study of Controllability, Explainability, and Performance in Dysfluency Detection Models
A Comparative Study of Controllability, Explainability, and Performance in Dysfluency Detection Models
Eric Zhang
Li Wei
Sarah Chen
Michael Wang
AAML
108
0
0
25 Aug 2025
EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems
EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems
Jingwen Liu
Kan Jen Cheng
Jiachen Lian
Akshay Anand
Rishi Jain
...
Robin Netzorg
Huang-Cheng Chou
Tingle Li
Guan-Ting Lin
Gopala Krishna Anumanchipalli
134
3
0
25 Aug 2025
Towards Accurate Phonetic Error Detection Through Phoneme Similarity Modeling
Towards Accurate Phonetic Error Detection Through Phoneme Similarity Modeling
Xuanru Zhou
Jiachen Lian
Cheol Jun Cho
Tejas S. Prabhune
Shuhe Li
...
Rian Bogley
Lisa Wauters
Zachary Miller
M. G. Tempini
Gopala Anumanchipalli
135
4
0
18 Jul 2025
Seamless Dysfluent Speech Text Alignment for Disordered Speech Analysis
Seamless Dysfluent Speech Text Alignment for Disordered Speech Analysis
Zongli Ye
Jiachen Lian
Xuanru Zhou
Jinming Zhang
Haodong Li
...
Rian Bogley
Lisa Wauters
Zachary Miller
M. G. Tempini
Gopala Anumanchipalli
125
9
0
05 Jun 2025
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection
Xuanru Zhou
Jiachen Lian
Cheol Jun Cho
Jingwen Liu
Zongli Ye
...
Jet M J Vonk
Z. Ezzes
Zachary Miller
M. G. Tempini
Gopala Anumanchipalli
187
12
0
20 Sep 2024
Self-supervised Speech Models for Word-Level Stuttered Speech Detection
Self-supervised Speech Models for Word-Level Stuttered Speech DetectionSpoken Language Technology Workshop (SLT), 2024
Yi-Jen Shih
Zoi Gkalitsiou
A. Dimakis
David Harwath
240
6
0
16 Sep 2024
Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection
Stutter-Solver: End-to-end Multi-lingual Dysfluency DetectionSpoken Language Technology Workshop (SLT), 2024
Xuanru Zhou
Cheol Jun Cho
Ayati Sharma
Brittany Morin
D. Baquirin
...
Zachary Miller
B. Tee
M. G. Tempini
Jiachen Lian
Gopala Anumanchipalli
174
15
0
15 Sep 2024
SSDM: Scalable Speech Dysfluency Modeling
SSDM: Scalable Speech Dysfluency ModelingNeural Information Processing Systems (NeurIPS), 2024
Jiachen Lian
Xuanru Zhou
Z. Ezzes
Jet M J Vonk
Brittany Morin
D. Baquirin
Zachary Mille
M. G. Tempini
Gopala Anumanchipalli
AuLLM
290
19
0
29 Aug 2024
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency DetectionInterspeech (Interspeech), 2024
Xuanru Zhou
Anshul Kashyap
Steve Li
Ayati Sharma
Brittany Morin
...
Z. Ezzes
Zachary Miller
M. G. Tempini
Jiachen Lian
Gopala Krishna Anumanchipalli
247
20
0
27 Aug 2024
Large Language Models for Dysfluency Detection in Stuttered Speech
Large Language Models for Dysfluency Detection in Stuttered Speech
Dominik Wagner
Sebastian P. Bayerl
Ilja Baumann
Korbinian Riedhammer
Elmar Nöth
Tobias Bocklet
357
16
0
16 Jun 2024
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
Maxim Enis
Mark Hopkins
260
73
0
22 Apr 2024
Towards Hierarchical Spoken Language Dysfluency Modeling
Towards Hierarchical Spoken Language Dysfluency Modeling
Jiachen Lian
Gopala Anumanchipalli
296
23
0
18 Jan 2024
Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and
  Detection
Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection
Jiachen Lian
Carly Feng
Naasir Farooqi
Steve Li
Anshul Kashyap
Cheol Jun Cho
Peter Wu
Robin Netzorg
Tingle Li
Gopala Krishna Anumanchipalli
220
27
0
20 Dec 2023
Weakly-supervised forced alignment of disfluent speech using
  phoneme-level modeling
Weakly-supervised forced alignment of disfluent speech using phoneme-level modelingInterspeech (Interspeech), 2023
Theodoros Kouzelis
Georgios Paraskevopoulos
Athanasios Katsamanis
Vassilis Katsouros
226
9
0
30 May 2023
Articulatory Representation Learning Via Joint Factor Analysis and
  Neural Matrix Factorization
Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix FactorizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jiachen Lian
A. Black
Yijingxiu Lu
Louis Goldstein
Shinji Watanabe
Gopala K. Anumanchipalli
309
23
0
29 Oct 2022
Deep Neural Convolutive Matrix Factorization for Articulatory
  Representation Decomposition
Deep Neural Convolutive Matrix Factorization for Articulatory Representation DecompositionInterspeech (Interspeech), 2022
Jiachen Lian
A. Black
Louis Goldstein
Gopala Krishna Anumanchipalli
310
23
0
01 Apr 2022
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and
  Pass
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass
Olabanji Shonibare
Xiaosu Tong
Venkatesh Ravichandran
226
33
0
08 Feb 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
1.2K
2,674
0
26 Oct 2021
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Simple and Effective Zero-shot Cross-lingual Phoneme RecognitionInterspeech (Interspeech), 2021
Qiantong Xu
Alexei Baevski
Michael Auli
VLM
324
116
0
23 Sep 2021
Conditional Variational Autoencoder with Adversarial Learning for
  End-to-End Text-to-Speech
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-SpeechInternational Conference on Machine Learning (ICML), 2021
Jaehyeon Kim
Jungil Kong
Juhee Son
DRL
295
1,151
0
11 Jun 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
2.4K
7,387
0
20 Jun 2020
Universal Phone Recognition with a Multilingual Allophone System
Universal Phone Recognition with a Multilingual Allophone SystemIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Xinjian Li
Siddharth Dalmia
Juncheng Billy Li
Matthew Russell Lee
Patrick Littell
...
Antonios Anastasopoulos
David R. Mortensen
Graham Neubig
A. Black
Florian Metze
159
154
0
26 Feb 2020
Disfluency Detection using a Bidirectional LSTM
Disfluency Detection using a Bidirectional LSTM
Vicky Zayats
Mari Ostendorf
Hannaneh Hajishirzi
162
124
0
12 Apr 2016
1