v1v2 (latest)

Self-Training for End-to-End Speech Translation

Interspeech (Interspeech), 2020

3 June 2020

Papers citing "Self-Training for End-to-End Speech Translation"

41 / 41 papers shown

Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis

...

245

29 Sep 2025

DoCIA: An Online Document-Level Context Incorporation Agent for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

331

07 Apr 2025

When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation

423

01 Feb 2025

Speech Translation Refinement using Large Language Models

1.0K

28 Jan 2025

Representation Purification for End-to-End Speech TranslationInternational Conference on Computational Linguistics (COLING), 2024

219

05 Dec 2024

LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models

Kazi Ahmed Asif Fuad

Lizhong Chen

405

01 Nov 2024

CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving

Bhavani Shankar

Preethi Jyothi

Pushpak Bhattacharyya

356

16 Jun 2024

AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition

288

29 Sep 2023

Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic TranscriptsInternational Workshop on Spoken Language Translation (IWSLT), 2023

Rebekka Hubert

Artem Sokolov

Stefan Riezler

354

17 Jul 2023

Recent Advances in Direct Speech-to-text TranslationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

Jingbo Zhu

384

20 Jun 2023

CMOT: Cross-modal Mixup via Optimal Transport for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Yan Zhou

Qingkai Fang

Yang Feng

497

24 May 2023

Improving speech translation by fusing speech and textConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

273

23 May 2023

Duplex Diffusion Models Improve Speech-to-Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xianchao Wu

DiffM

272

22 May 2023

DUB: Discrete Unit Back-translation for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

271

19 May 2023

Improving Speech Translation by Cross-Modal Multi-Grained Contrastive LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

263

20 Apr 2023

Pre-training for Speech Translation: CTC Meets Optimal TransportInternational Conference on Machine Learning (ICML), 2023

459

27 Jan 2023

Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

281

20 Dec 2022

WACO: Word-Aligned Contrastive Learning for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Siqi Ouyang

Rong Ye

Lei Li

391

19 Dec 2022

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

403

15 Dec 2022

Improving End-to-end Speech Translation by Leveraging Auxiliary Speech and Text DataAAAI Conference on Artificial Intelligence (AAAI), 2022

Jingbo Zhu

222

04 Dec 2022

Efficient Speech Translation with Pre-trained Models

Zhaolin Li

Jan Niehues

188

09 Nov 2022

Cross-modal Contrastive Learning for Speech TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Rong Ye

Mingxuan Wang

Lei Li

SSL

280

105

05 May 2022

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data AugmentationInterspeech (Interspeech), 2022

Yossi Adi

337

06 Apr 2022

STEMM: Self-learning with Speech-text Manifold Mixup for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Lei Li

357

110

20 Mar 2022

Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task

Xian Li

333

12 Jul 2021

Kosp2e: Korean Speech to English Translation Corpus

140

06 Jul 2021

The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021International Workshop on Spoken Language Translation (IWSLT), 2021

Yuchen Hu

336

01 Jul 2021

The Volctrans Neural Speech Translation System for IWSLT 2021International Workshop on Spoken Language Translation (IWSLT), 2021

Lei Li

355

16 May 2021

Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation EncodersAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Jingbo Zhu

318

12 May 2021

Improving Cross-Lingual Reading Comprehension with Self-Training

233

08 May 2021

Learning Shared Semantic Space for Speech-to-Text TranslationFindings (Findings), 2021

Chi Han

Mingxuan Wang

Heng Ji

Lei Li

578

07 May 2021

End-to-end Speech Translation via Cross-modal Progressive TrainingInterspeech (Interspeech), 2021

Rong Ye

Mingxuan Wang

Lei Li

288

21 Apr 2021

Back-Training excels Self-Training at Unsupervised Domain Adaptation of Question Generation and Passage RetrievalConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Siva Reddy

308

18 Apr 2021

Large-Scale Self- and Semi-Supervised Learning for Speech TranslationInterspeech (Interspeech), 2021

326

14 Apr 2021

Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech TranslationInternational Conference on Machine Learning (ICML), 2021

361

10 Feb 2021

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and InterpretationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

784

675

02 Jan 2021

Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

417

25 Oct 2020

SlimIPL: Language-Model-Free Iterative Pseudo-Labeling

617

22 Oct 2020

A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks

319

21 Oct 2020

fairseq S2T: Fast Speech-to-Text Modeling with fairseq

477

324

11 Oct 2020

Consecutive Decoding for Speech-to-text TranslationAAAI Conference on Artificial Intelligence (AAAI), 2020

Lei Li

487

21 Sep 2020