v1v2 (latest)

Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection

22 May 2025

Gopala Anumanchipalli

ArXiv (abs)PDF HTML

Papers citing "Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection"

25 / 25 papers shown

Local MAP Sampling for Diffusion Models

Shaorong Zhang

Rob Brekelmans

Greg Ver Steeg

147

07 Oct 2025

Deploying UDM Series in Real-Life Stuttered Speech Applications: A Clinical Evaluation Framework

109

17 Sep 2025

A Comparative Study of Controllability, Explainability, and Performance in Dysfluency Detection Models

108

25 Aug 2025

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

...

Gopala Krishna Anumanchipalli

134

25 Aug 2025

Towards Accurate Phonetic Error Detection Through Phoneme Similarity Modeling

...

Gopala Anumanchipalli

135

18 Jul 2025

Seamless Dysfluent Speech Text Alignment for Disordered Speech Analysis

...

Gopala Anumanchipalli

125

05 Jun 2025

Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection

Xuanru Zhou

Cheol Jun Cho

...

Gopala Anumanchipalli

187

20 Sep 2024

Self-supervised Speech Models for Word-Level Stuttered Speech DetectionSpoken Language Technology Workshop (SLT), 2024

Yi-Jen Shih

Zoi Gkalitsiou

A. Dimakis

David Harwath

240

16 Sep 2024

Stutter-Solver: End-to-end Multi-lingual Dysfluency DetectionSpoken Language Technology Workshop (SLT), 2024

Xuanru Zhou

Cheol Jun Cho

...

Gopala Anumanchipalli

174

15 Sep 2024

SSDM: Scalable Speech Dysfluency ModelingNeural Information Processing Systems (NeurIPS), 2024

Xuanru Zhou

Gopala Anumanchipalli

AuLLM

290

29 Aug 2024

YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency DetectionInterspeech (Interspeech), 2024

Xuanru Zhou

...

Gopala Krishna Anumanchipalli

247

27 Aug 2024

Large Language Models for Dysfluency Detection in Stuttered Speech

357

16 Jun 2024

From LLM to NMT: Advancing Low-Resource Machine Translation with Claude

Maxim Enis

Mark Hopkins

260

22 Apr 2024

Towards Hierarchical Spoken Language Dysfluency Modeling

Jiachen Lian

Gopala Anumanchipalli

296

18 Jan 2024

Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection

Cheol Jun Cho

Peter Wu

Robin Netzorg

Tingle Li

Gopala Krishna Anumanchipalli

220

20 Dec 2023

Weakly-supervised forced alignment of disfluent speech using phoneme-level modelingInterspeech (Interspeech), 2023

Theodoros Kouzelis

Georgios Paraskevopoulos

Athanasios Katsamanis

Vassilis Katsouros

226

30 May 2023

Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix FactorizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Gopala K. Anumanchipalli

309

29 Oct 2022

Deep Neural Convolutive Matrix Factorization for Articulatory Representation DecompositionInterspeech (Interspeech), 2022

Jiachen Lian

A. Black

Louis Goldstein

Gopala Krishna Anumanchipalli

310

01 Apr 2022

Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass

Olabanji Shonibare

Xiaosu Tong

Venkatesh Ravichandran

226

08 Feb 2022

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

...

Jian Wu

1.2K

2,674

26 Oct 2021

Simple and Effective Zero-shot Cross-lingual Phoneme RecognitionInterspeech (Interspeech), 2021

324

116

23 Sep 2021

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-SpeechInternational Conference on Machine Learning (ICML), 2021

295

1,151

11 Jun 2021

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

2.4K

7,387

20 Jun 2020

Universal Phone Recognition with a Multilingual Allophone SystemIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

...

Antonios Anastasopoulos

David R. Mortensen

Graham Neubig

A. Black

Florian Metze

159

154

26 Feb 2020

Disfluency Detection using a Bidirectional LSTM

Vicky Zayats

Mari Ostendorf

Hannaneh Hajishirzi

162

124

12 Apr 2016