ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.18932
  4. Cited By
Extending Multilingual Speech Synthesis to 100+ Languages without
  Transcribed Data

Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data

29 February 2024
Takaaki Saeki
Gary Wang
Nobuyuki Morioka
Isaac Elias
Kyle Kastner
Fadi Biadsy
Andrew Rosenberg
Bhuvana Ramabhadran
Heiga Zen
Francoise Beaufays
Hadar Shemtov
ArXivPDFHTML

Papers citing "Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data"

15 / 15 papers shown
Title
Everyday Speech in the Indian Subcontinent
Everyday Speech in the Indian Subcontinent
Utkarsh Pathak
51
1
0
24 Feb 2025
GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot
  Keyword Spotting
GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot Keyword Spotting
Pai Zhu
Jacob Bartel
Dhruuv Agarwal
Kurt Partridge
Hyun-jin Park
Quan Wang
13
0
0
22 Oct 2024
Zero-shot Cross-lingual Voice Transfer for TTS
Zero-shot Cross-lingual Voice Transfer for TTS
Fadi Biadsy
Youzheng Chen
Isaac Elias
Kyle Kastner
Gary Wang
Andrew Rosenberg
Bhuvana Ramabhadran
30
1
0
20 Sep 2024
Adversarial training of Keyword Spotting to Minimize TTS Data
  Overfitting
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting
Hyun Jin Park
Dhruuv Agarwal
Neng Chen
Rentao Sun
Kurt Partridge
...
Jacob Bartel
Kyle Kastner
Gary Wang
Andrew Rosenberg
Quan Wang
25
0
0
20 Aug 2024
FLEURS-R: A Restored Multilingual Speech Corpus for Generation Tasks
FLEURS-R: A Restored Multilingual Speech Corpus for Generation Tasks
Min Ma
Yuma Koizumi
Shigeki Karita
Heiga Zen
Jason Riesa
Haruko Ishikawa
M. Bacchiani
VLM
27
4
0
12 Aug 2024
Utilizing TTS Synthesized Data for Efficient Development of Keyword
  Spotting Model
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model
H. Park
Dhruuv Agarwal
Neng Chen
Rentao Sun
Kurt Partridge
...
Jacob Bartel
Kyle Kastner
Gary Wang
Andrew Rosenberg
Quan Wang
14
2
0
26 Jul 2024
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low
  Resource Environments
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments
Pai Zhu
Dhruuv Agarwal
Jacob Bartel
Kurt Partridge
H. Park
Quan Wang
31
1
0
23 Jul 2024
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis
  Conditioned on Self-supervised Discrete Speech Representations
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Cheng Gong
Xin Wang
Erica Cooper
Dan Wells
Longbiao Wang
Jianwu Dang
Korin Richmond
Junichi Yamagishi
19
20
0
22 Dec 2023
Scaling Speech Technology to 1,000+ Languages
Scaling Speech Technology to 1,000+ Languages
Vineel Pratap
Andros Tjandra
Bowen Shi
Paden Tomasello
Arun Babu
...
Yossi Adi
Xiaohui Zhang
Wei-Ning Hsu
Alexis Conneau
Michael Auli
VLM
73
297
0
22 May 2023
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Yu Zhang
Wei Han
James Qin
Yongqiang Wang
Ankur Bapna
...
Pedro J. Moreno
Chung-Cheng Chiu
J. Schalkwyk
Franccoise Beaufays
Yonghui Wu
VLM
77
250
0
02 Mar 2023
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised
  Learning for Text-To-Speech
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
Takaaki Saeki
Heiga Zen
Zhehuai Chen
Nobuyuki Morioka
Gary Wang
Yu Zhang
Ankur Bapna
Andrew Rosenberg
Bhuvana Ramabhadran
61
19
0
27 Oct 2022
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on
  Fixed-Point Iteration
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration
Yuma Koizumi
Kohei Yatabe
Heiga Zen
M. Bacchiani
DiffM
42
29
0
03 Oct 2022
DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial
  Vector-Quantized Auto-Encoders
DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders
Yanqing Liu
Rui Xue
Lei He
Xu Tan
Sheng Zhao
14
24
0
11 Jul 2022
FLEURS: Few-shot Learning Evaluation of Universal Representations of
  Speech
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
Alexis Conneau
Min Ma
Simran Khanuja
Yu Zhang
Vera Axelrod
Siddharth Dalmia
Jason Riesa
Clara E. Rivera
Ankur Bapna
VLM
78
282
0
25 May 2022
Listening while Speaking: Speech Chain by Deep Learning
Listening while Speaking: Speech Chain by Deep Learning
Andros Tjandra
S. Sakti
Satoshi Nakamura
AuLLM
115
165
0
16 Jul 2017
1