ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.02490
  4. Cited By
Self-Training for End-to-End Speech Translation
v1v2 (latest)

Self-Training for End-to-End Speech Translation

Interspeech (Interspeech), 2020
3 June 2020
J. Pino
Qiantong Xu
Xutai Ma
M. Dousti
Yun Tang
ArXiv (abs)PDFHTML

Papers citing "Self-Training for End-to-End Speech Translation"

41 / 41 papers shown
Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis
Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis
Tianrui Wang
Haoyu Wang
Meng Ge
Cheng Gong
Chunyu Qiang
...
Xiaobao Wang
Eng Siong Chng
Xie Chen
Longbiao Wang
Jianwu Dang
245
2
0
29 Sep 2025
DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation
DoCIA: An Online Document-Level Context Incorporation Agent for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Xinglin Lyu
Wei Tang
Yongqian Li
X. Zhao
Ming Zhu
...
Yaojie Lu
Min Zhang
Daimeng Wei
Hao Yang
Min Zhang
331
0
0
07 Apr 2025
When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation
When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation
Anna Min
Chenxu Hu
Yi Ren
Hang Zhao
423
5
0
01 Feb 2025
Speech Translation Refinement using Large Language Models
Speech Translation Refinement using Large Language Models
Huaixia Dou
Xinyu Tian
Xinglin Lyu
Jie Zhu
Junhui Li
Lifan Guo
1.0K
1
0
28 Jan 2025
Representation Purification for End-to-End Speech Translation
Representation Purification for End-to-End Speech TranslationInternational Conference on Computational Linguistics (COLING), 2024
Chengwei Zhang
Yue Zhou
Rui Zhao
Yidong Chen
Xiaodong Shi
219
5
0
05 Dec 2024
LLM-Ref: Enhancing Reference Handling in Technical Writing with Large
  Language Models
LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models
Kazi Ahmed Asif Fuad
Lizhong Chen
405
2
0
01 Nov 2024
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text
  Interleaving
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving
Bhavani Shankar
Preethi Jyothi
Pushpak Bhattacharyya
356
5
0
16 Jun 2024
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Andrew Rouditchenko
R. Collobert
Tatiana Likhomanenko
VLM
288
6
0
29 Sep 2023
Improving End-to-End Speech Translation by Imitation-Based Knowledge
  Distillation with Synthetic Transcripts
Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic TranscriptsInternational Workshop on Spoken Language Translation (IWSLT), 2023
Rebekka Hubert
Artem Sokolov
Stefan Riezler
354
1
0
17 Jul 2023
Recent Advances in Direct Speech-to-text Translation
Recent Advances in Direct Speech-to-text TranslationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Chen Xu
Rong Ye
Qianqian Dong
Chengqi Zhao
Tom Ko
Mingxuan Wang
Tong Xiao
Jingbo Zhu
384
35
0
20 Jun 2023
CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation
CMOT: Cross-modal Mixup via Optimal Transport for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yan Zhou
Qingkai Fang
Yang Feng
OT
497
40
0
24 May 2023
Improving speech translation by fusing speech and text
Improving speech translation by fusing speech and textConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Wenbiao Yin
Zhicheng Liu
Chengqi Zhao
Tao Wang
Jian-Fei Tong
Rong Ye
273
4
0
23 May 2023
Duplex Diffusion Models Improve Speech-to-Speech Translation
Duplex Diffusion Models Improve Speech-to-Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Xianchao Wu
DiffM
272
6
0
22 May 2023
DUB: Discrete Unit Back-translation for Speech Translation
DUB: Discrete Unit Back-translation for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Dong Zhang
Rong Ye
Tom Ko
Mingxuan Wang
Yaqian Zhou
271
34
0
19 May 2023
Improving Speech Translation by Cross-Modal Multi-Grained Contrastive
  Learning
Improving Speech Translation by Cross-Modal Multi-Grained Contrastive LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Hao Zhang
Nianwen Si
Yaqi Chen
Wenlin Zhang
Xukui Yang
Dan Qu
Weiqiang Zhang
263
19
0
20 Apr 2023
Pre-training for Speech Translation: CTC Meets Optimal Transport
Pre-training for Speech Translation: CTC Meets Optimal TransportInternational Conference on Machine Learning (ICML), 2023
Hang Le
Hongyu Gong
Changhan Wang
J. Pino
Benjamin Lecouteux
D. Schwab
OT
459
32
0
27 Jan 2023
Joint Speech Transcription and Translation: Pseudo-Labeling with
  Out-of-Distribution Data
Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Mozhdeh Gheini
Tatiana Likhomanenko
Matthias Sperber
Hendra Setiawan
281
7
0
20 Dec 2022
WACO: Word-Aligned Contrastive Learning for Speech Translation
WACO: Word-Aligned Contrastive Learning for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Siqi Ouyang
Rong Ye
Lei Li
391
36
0
19 Dec 2022
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Hirofumi Inaguma
Sravya Popuri
Ilia Kulikov
Peng-Jen Chen
Changhan Wang
Yu-An Chung
Yun Tang
Ann Lee
Shinji Watanabe
J. Pino
403
82
0
15 Dec 2022
Improving End-to-end Speech Translation by Leveraging Auxiliary Speech
  and Text Data
Improving End-to-end Speech Translation by Leveraging Auxiliary Speech and Text DataAAAI Conference on Artificial Intelligence (AAAI), 2022
Yuhao Zhang
Chen Xu
Bojie Hu
Chunliang Zhang
Tong Xiao
Jingbo Zhu
222
17
0
04 Dec 2022
Efficient Speech Translation with Pre-trained Models
Efficient Speech Translation with Pre-trained Models
Zhaolin Li
Jan Niehues
188
2
0
09 Nov 2022
Cross-modal Contrastive Learning for Speech Translation
Cross-modal Contrastive Learning for Speech TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Rong Ye
Mingxuan Wang
Lei Li
SSL
280
105
0
05 May 2022
Enhanced Direct Speech-to-Speech Translation Using Self-supervised
  Pre-training and Data Augmentation
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data AugmentationInterspeech (Interspeech), 2022
Sravya Popuri
Peng-Jen Chen
Changhan Wang
J. Pino
Yossi Adi
Jiatao Gu
Wei-Ning Hsu
Ann Lee
337
68
0
06 Apr 2022
STEMM: Self-learning with Speech-text Manifold Mixup for Speech
  Translation
STEMM: Self-learning with Speech-text Manifold Mixup for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Qingkai Fang
Rong Ye
Lei Li
Yang Feng
Mingxuan Wang
357
110
0
20 Mar 2022
Improving Speech Translation by Understanding and Learning from the
  Auxiliary Text Translation Task
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task
Yun Tang
J. Pino
Xian Li
Changhan Wang
Dmitriy Genzel
333
95
0
12 Jul 2021
Kosp2e: Korean Speech to English Translation Corpus
Kosp2e: Korean Speech to English Translation Corpus
Won Ik Cho
Seokhwan Kim
Hyun Chang Cho
N. Kim
140
15
0
06 Jul 2021
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at
  IWSLT 2021
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021International Workshop on Spoken Language Translation (IWSLT), 2021
Dan Liu
Mengge Du
Xiaoxi Li
Yuchen Hu
Lirong Dai
336
23
0
01 Jul 2021
The Volctrans Neural Speech Translation System for IWSLT 2021
The Volctrans Neural Speech Translation System for IWSLT 2021International Workshop on Spoken Language Translation (IWSLT), 2021
Chengqi Zhao
Zhicheng Liu
Jian-Fei Tong
Tao Wang
Mingxuan Wang
Rong Ye
Qianqian Dong
Jun Cao
Lei Li
355
9
0
16 May 2021
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained
  Models into Speech Translation Encoders
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation EncodersAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Chen Xu
Bojie Hu
Yanyang Li
Yuhao Zhang
Shen Huang
Qi Ju
Tong Xiao
Jingbo Zhu
318
85
0
12 May 2021
Improving Cross-Lingual Reading Comprehension with Self-Training
Improving Cross-Lingual Reading Comprehension with Self-Training
Wei-Ping Huang
Chien-yu Huang
Hung-yi Lee
LRM
233
1
0
08 May 2021
Learning Shared Semantic Space for Speech-to-Text Translation
Learning Shared Semantic Space for Speech-to-Text TranslationFindings (Findings), 2021
Chi Han
Mingxuan Wang
Heng Ji
Lei Li
578
86
0
07 May 2021
End-to-end Speech Translation via Cross-modal Progressive Training
End-to-end Speech Translation via Cross-modal Progressive TrainingInterspeech (Interspeech), 2021
Rong Ye
Mingxuan Wang
Lei Li
288
79
0
21 Apr 2021
Back-Training excels Self-Training at Unsupervised Domain Adaptation of
  Question Generation and Passage Retrieval
Back-Training excels Self-Training at Unsupervised Domain Adaptation of Question Generation and Passage RetrievalConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Devang Kulshreshtha
Robert Belfer
Iulian Serban
Siva Reddy
OOD
308
17
0
18 Apr 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Large-Scale Self- and Semi-Supervised Learning for Speech TranslationInterspeech (Interspeech), 2021
Changhan Wang
Anne Wu
J. Pino
Alexei Baevski
Michael Auli
Alexis Conneau
SSL
326
47
0
14 Apr 2021
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining
  and Speech Translation
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech TranslationInternational Conference on Machine Learning (ICML), 2021
Renjie Zheng
Junkun Chen
Mingbo Ma
Liang Huang
361
74
0
10 Feb 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation
  Learning, Semi-Supervised Learning and Interpretation
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and InterpretationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Changhan Wang
M. Rivière
Ann Lee
Anne Wu
Chaitanya Talnikar
Daniel Haziza
Mary Williamson
J. Pino
Emmanuel Dupoux
SSL
784
675
0
02 Jan 2021
Orthros: Non-autoregressive End-to-end Speech Translation with
  Dual-decoder
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Hirofumi Inaguma
Yosuke Higuchi
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
417
26
0
25 Oct 2020
SlimIPL: Language-Model-Free Iterative Pseudo-Labeling
SlimIPL: Language-Model-Free Iterative Pseudo-Labeling
Tatiana Likhomanenko
Qiantong Xu
Jacob Kahn
Gabriel Synnaeve
R. Collobert
VLM
617
71
0
22 Oct 2020
A General Multi-Task Learning Framework to Leverage Text Data for Speech
  to Text Tasks
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
Yun Tang
J. Pino
Changhan Wang
Xutai Ma
Dmitriy Genzel
319
81
0
21 Oct 2020
fairseq S2T: Fast Speech-to-Text Modeling with fairseq
fairseq S2T: Fast Speech-to-Text Modeling with fairseq
Changhan Wang
Yun Tang
Xutai Ma
Anne Wu
Sravya Popuri
Dmytro Okhonko
J. Pino
VLMLRM
477
324
0
11 Oct 2020
Consecutive Decoding for Speech-to-text Translation
Consecutive Decoding for Speech-to-text TranslationAAAI Conference on Artificial Intelligence (AAAI), 2020
Qianqian Dong
Mingxuan Wang
Hao Zhou
Shuang Xu
Bo Xu
Lei Li
SLR
487
45
0
21 Sep 2020
1
Page 1 of 1