Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.12231
Cited By
SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
27 February 2020
Arya D. McCarthy
Liezl Puzon
J. Pino
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation"
16 / 16 papers shown
Recent Advances in Direct Speech-to-text Translation
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Chen Xu
Rong Ye
Qianqian Dong
Chengqi Zhao
Tom Ko
Mingxuan Wang
Tong Xiao
Jingbo Zhu
377
35
0
20 Jun 2023
Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Biao Fu
Minpeng Liao
Kai Fan
Zhongqiang Huang
Boxing Chen
Yidong Chen
Xiaodon Shi
241
8
0
14 Mar 2023
Improved Long-Form Spoken Language Translation with Large Language Models
Arya D. McCarthy
Haotong Zhang
Shankar Kumar
Felix Stahlberg
Axel H. Ng
230
3
0
19 Dec 2022
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
351
6
0
19 Dec 2022
WACO: Word-Aligned Contrastive Learning for Speech Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Siqi Ouyang
Rong Ye
Lei Li
391
36
0
19 Dec 2022
M3ST: Mix at Three Levels for Speech Translation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Xuxin Cheng
Qianqian Dong
Fengpeng Yue
Tom Ko
Mingxuan Wang
Yuexian Zou
338
40
0
07 Dec 2022
Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Interspeech (Interspeech), 2022
Qianqian Dong
Fengpeng Yue
Tom Ko
Mingxuan Wang
Qibing Bai
Yu Zhang
280
19
0
18 May 2022
Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Zengrui Jin
Mengzhe Geng
Jiajun Deng
Tianzi Wang
Shujie Hu
Guinan Li
Xunying Liu
290
39
0
13 May 2022
Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
247
36
0
16 Mar 2022
Learning When to Translate for Streaming Speech
Qianqian Dong
Yaoming Zhu
Mingxuan Wang
Lei Li
493
36
0
15 Sep 2021
Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring
International Conference on Information and Knowledge Management (CIKM), 2021
Yaman Kumar Singla
Avykat Gupta
Shaurya Bagga
Changyou Chen
Balaji Krishnamurthy
R. Shah
216
15
0
30 Aug 2021
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation
International Conference on Machine Learning (ICML), 2021
Ye Jia
Michelle Tadmor Ramanovich
Tal Remez
Roi Pomerantz
526
103
0
19 Jul 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Interspeech (Interspeech), 2021
Changhan Wang
Anne Wu
J. Pino
Alexei Baevski
Michael Auli
Alexis Conneau
SSL
326
47
0
14 Apr 2021
Tight Integrated End-to-End Training for Cascaded Speech Translation
Spoken Language Technology Workshop (SLT), 2020
Parnia Bahar
Tobias Bieschke
Ralf Schluter
Hermann Ney
229
30
0
24 Nov 2020
Self-Supervised Representations Improve End-to-End Speech Translation
Anne Wu
Changhan Wang
J. Pino
Jiatao Gu
SSL
289
43
0
22 Jun 2020
Unsupervised Morphological Paradigm Completion
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Huiming Jin
Liwei Cai
Yihui Peng
Chen Xia
Arya D. McCarthy
Katharina Kann
262
27
0
03 May 2020
1
Page 1 of 1