Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.06892
Cited By
OverFlow: Putting flows on top of neural transducers for better TTS
13 November 2022
Shivam Mehta
Ambika Kirkland
Harm Lameris
Jonas Beskow
Éva Székely
G. Henter
AI4TS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OverFlow: Putting flows on top of neural transducers for better TTS"
11 / 11 papers shown
Title
A Language Modeling Approach to Diacritic-Free Hebrew TTS
Amit Roth
A. Turetzky
Yossi Adi
27
2
0
16 Jul 2024
Source Tracing of Audio Deepfake Systems
Nicholas Klein
Tianxiang Chen
Hemlata Tak
Ricardo Casal
Elie Khoury
19
4
0
10 Jul 2024
Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech
Shivam Mehta
Harm Lameris
Rajiv Punmiya
Jonas Beskow
Éva Székely
G. Henter
23
1
0
08 Jun 2024
Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis
Shivam Mehta
Anna Deichler
Jim O'Regan
Birger Moëll
Jonas Beskow
G. Henter
Simon Alexanderson
34
4
0
30 Apr 2024
MLAAD: The Multi-Language Audio Anti-Spoofing Dataset
Nicolas M. Muller
Piotr Kawa
Wei Herng Choong
Edresson Casanova
Eren Golge
Thorsten Muller
P. Syga
Philip Sperl
Konstantin Böttinger
14
35
0
17 Jan 2024
Unified speech and gesture synthesis using flow matching
Shivam Mehta
Ruibo Tu
Simon Alexanderson
Jonas Beskow
Éva Székely
G. Henter
22
3
0
08 Oct 2023
Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset
Ze Liu
17
0
0
08 Oct 2023
Matcha-TTS: A fast TTS architecture with conditional flow matching
Shivam Mehta
Ruibo Tu
Jonas Beskow
Éva Székely
G. Henter
14
69
0
06 Sep 2023
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Shivam Mehta
Siyang Wang
Simon Alexanderson
Jonas Beskow
Éva Székely
G. Henter
DiffM
24
14
0
15 Jun 2023
Robust Classification using Hidden Markov Models and Mixtures of Normalizing Flows
Anubhab Ghosh
Antoine Honoré
Dong Liu
G. Henter
S. Chatterjee
BDL
VLM
16
7
0
15 Feb 2021
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Yu Zhang
James Qin
Daniel S. Park
Wei Han
Chung-Cheng Chiu
Ruoming Pang
Quoc V. Le
Yonghui Wu
VLM
SSL
136
307
0
20 Oct 2020
1