SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

27 February 2020

Papers citing "SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation"

16 / 16 papers shown

Recent Advances in Direct Speech-to-text TranslationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

Jingbo Zhu

377

20 Jun 2023

Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and InferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

241

14 Mar 2023

Improved Long-Form Spoken Language Translation with Large Language Models

230

19 Dec 2022

SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based AugmentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Ioannis Tsiamas

José A. R. Fonollosa

Marta R. Costa-jussá

351

19 Dec 2022

WACO: Word-Aligned Contrastive Learning for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Siqi Ouyang

Rong Ye

Lei Li

391

19 Dec 2022

M3ST: Mix at Three Levels for Speech TranslationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

338

07 Dec 2022

Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech TranslationInterspeech (Interspeech), 2022

280

18 May 2022

Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Zengrui Jin

Mengzhe Geng

Jiajun Deng

Tianzi Wang

Shujie Hu

Guinan Li

Xunying Liu

290

13 May 2022

Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Tsz Kin Lam

Shigehiko Schamoni

Stefan Riezler

247

16 Mar 2022

Learning When to Translate for Streaming Speech

Qianqian Dong

Yaoming Zhu

Mingxuan Wang

Lei Li

493

15 Sep 2021

Speaker-Conditioned Hierarchical Modeling for Automated Speech ScoringInternational Conference on Information and Knowledge Management (CIKM), 2021

216

30 Aug 2021

Translatotron 2: High-quality direct speech-to-speech translation with voice preservationInternational Conference on Machine Learning (ICML), 2021

Ye Jia

Michelle Tadmor Ramanovich

Tal Remez

Roi Pomerantz

526

103

19 Jul 2021

Large-Scale Self- and Semi-Supervised Learning for Speech TranslationInterspeech (Interspeech), 2021

326

14 Apr 2021

Tight Integrated End-to-End Training for Cascaded Speech TranslationSpoken Language Technology Workshop (SLT), 2020

229

24 Nov 2020

Self-Supervised Representations Improve End-to-End Speech Translation

289

22 Jun 2020

Unsupervised Morphological Paradigm CompletionAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

262

03 May 2020