ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.00254
  4. Cited By
Multilingual End-to-End Speech Translation

Multilingual End-to-End Speech Translation

1 October 2019
H. Inaguma
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
    LRM
ArXivPDFHTML

Papers citing "Multilingual End-to-End Speech Translation"

50 / 57 papers shown
Title
AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation
AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation
Wuwei Huang
Dexin Wang
Deyi Xiong
72
4
0
18 Mar 2025
Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation
Wuwei Huang
Renren Jin
Wen Zhang
Jian Luan
Bin Wang
Deyi Xiong
66
1
0
14 Mar 2025
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations
Hyunji Lee
Danni Liu
Supriti Sinhamahapatra
Jan Niehues
106
0
0
21 Feb 2025
Investigating Decoder-only Large Language Models for Speech-to-text
  Translation
Investigating Decoder-only Large Language Models for Speech-to-text Translation
Chao-Wei Huang
Hui Lu
Hongyu Gong
H. Inaguma
Ilia Kulikov
Ruslan Mavlyutov
Sravya Popuri
AuLLM
LRM
55
6
0
03 Jul 2024
Cross-Lingual Transfer Learning for Speech Translation
Cross-Lingual Transfer Learning for Speech Translation
Rao Ma
Yassir Fathullah
Mengjie Qian
Siyuan Tang
Mark J. F. Gales
Kate Knill
22
1
0
01 Jul 2024
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
Yanis Labrak
Adel Moumen
Richard Dufour
Mickael Rouvier
ELM
LM&MA
MedIm
29
0
0
09 Jun 2024
SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
Marco Gaido
Sara Papi
Matteo Negri
Mauro Cettolo
L. Bentivogli
37
1
0
17 May 2024
Efficient Training for Multilingual Visual Speech Recognition:
  Pre-training with Discretized Visual Speech Representation
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation
Minsu Kim
Jeong Hun Yeo
Se Jin Park
J. Choi
Y. Ro
27
5
0
18 Jan 2024
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation
  with Unified Audio-Visual Speech Representation
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
J. Choi
Se Jin Park
Minsu Kim
Y. Ro
27
12
0
05 Dec 2023
End-to-End Speech-to-Text Translation: A Survey
End-to-End Speech-to-Text Translation: A Survey
Nivedita Sethiya
Chandresh Kumar Maurya
24
7
0
02 Dec 2023
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech
  Translation
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Juan Pablo Zuluaga
Zhaocheng Huang
Xing Niu
Rohit Paturi
S. Srinivasan
Prashant Mathur
Brian Thompson
Marcello Federico
BDL
29
2
0
01 Nov 2023
Towards a Deep Understanding of Multilingual End-to-End Speech
  Translation
Towards a Deep Understanding of Multilingual End-to-End Speech Translation
Haoran Sun
Xiaohu Zhao
Yikun Lei
Shaolin Zhu
Deyi Xiong
37
8
0
31 Oct 2023
How To Build Competitive Multi-gender Speech Translation Models For
  Controlling Speaker Gender Translation
How To Build Competitive Multi-gender Speech Translation Models For Controlling Speaker Gender Translation
Marco Gaido
Dennis Fucci
Matteo Negri
L. Bentivogli
37
2
0
23 Oct 2023
Recent Advances in Direct Speech-to-text Translation
Recent Advances in Direct Speech-to-text Translation
Chen Xu
Rong Ye
Qianqian Dong
Chengqi Zhao
Tom Ko
Mingxuan Wang
Tong Xiao
Jingbo Zhu
19
18
0
20 Jun 2023
SLTUNET: A Simple Unified Model for Sign Language Translation
SLTUNET: A Simple Unified Model for Sign Language Translation
Biao Zhang
Mathias Müller
Rico Sennrich
SLR
43
33
0
02 May 2023
Efficient CTC Regularization via Coarse Labels for End-to-End Speech
  Translation
Efficient CTC Regularization via Coarse Labels for End-to-End Speech Translation
Biao Zhang
Barry Haddow
Rico Sennrich
17
3
0
21 Feb 2023
Pre-training for Speech Translation: CTC Meets Optimal Transport
Pre-training for Speech Translation: CTC Meets Optimal Transport
Hang Le
Hongyu Gong
Changhan Wang
J. Pino
Benjamin Lecouteux
D. Schwab
OT
13
20
0
27 Jan 2023
LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and
  Translation Using Neural Transducers
LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Peidong Wang
Eric Sun
Jian Xue
Yu-Huan Wu
Long Zhou
Yashesh Gaur
Shujie Liu
Jinyu Li
26
8
0
05 Nov 2022
A Weakly-Supervised Streaming Multilingual Speech Model with Truly
  Zero-Shot Capability
A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Jian Xue
Peidong Wang
Jinyu Li
Eric Sun
29
10
0
04 Nov 2022
Does Joint Training Really Help Cascaded Speech Translation?
Does Joint Training Really Help Cascaded Speech Translation?
Viet Anh Khoa Tran
David Thulke
Yingbo Gao
Christian Herold
Hermann Ney
22
3
0
24 Oct 2022
Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation
Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation
Chen Wang
Yuchen Liu
Boxing Chen
Jiajun Zhang
Wei Luo
Zhongqiang Huang
Chengqing Zong
33
10
0
18 Oct 2022
Direct Speech Translation for Automatic Subtitling
Direct Speech Translation for Automatic Subtitling
Sara Papi
Marco Gaido
Alina Karakanta
Mauro Cettolo
Matteo Negri
Marco Turchi
54
11
0
27 Sep 2022
Dodging the Data Bottleneck: Automatic Subtitling with Automatically
  Segmented ST Corpora
Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora
Sara Papi
Alina Karakanta
Matteo Negri
Marco Turchi
28
8
0
21 Sep 2022
Revisiting End-to-End Speech-to-Text Translation From Scratch
Revisiting End-to-End Speech-to-Text Translation From Scratch
Biao Zhang
Barry Haddow
Rico Sennrich
16
36
0
09 Jun 2022
Blockwise Streaming Transformer for Spoken Language Understanding and
  Simultaneous Speech Translation
Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation
Keqi Deng
Shinji Watanabe
Jiatong Shi
Siddhant Arora
17
15
0
19 Apr 2022
Multilingual Simultaneous Speech Translation
Multilingual Simultaneous Speech Translation
Shashank Subramanya
J. Niehues
14
3
0
28 Mar 2022
Making AI 'Smart': Bridging AI and Cognitive Science
Making AI 'Smart': Bridging AI and Cognitive Science
Madhav Agarwal
Siddhant Bansal
31
0
0
31 Dec 2021
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
29
363
0
02 Nov 2021
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual
  Shared Task
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task
Yun Tang
Hongyu Gong
Xian Li
Changhan Wang
J. Pino
Holger Schwenk
Naman Goyal
36
10
0
14 Jul 2021
ESPnet-ST IWSLT 2021 Offline Speech Translation System
ESPnet-ST IWSLT 2021 Offline Speech Translation System
H. Inaguma
Shun Kiyono
Nelson Enrique Yalta Soplin
Pengcheng Guo
Jun Suzuki
Kevin Duh
Shinji Watanabe
3DV
35
2
0
01 Jul 2021
Pay Better Attention to Attention: Head Selection in Multilingual and
  Multi-Domain Sequence Modeling
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling
Hongyu Gong
Yun Tang
J. Pino
Xian Li
30
11
0
21 Jun 2021
Learning Shared Semantic Space for Speech-to-Text Translation
Learning Shared Semantic Space for Speech-to-Text Translation
Chi Han
Mingxuan Wang
Heng Ji
Lei Li
18
76
0
07 May 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Changhan Wang
Anne Wu
J. Pino
Alexei Baevski
Michael Auli
Alexis Conneau
SSL
31
44
0
14 Apr 2021
Source and Target Bidirectional Knowledge Distillation for End-to-end
  Speech Translation
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
H. Inaguma
Tatsuya Kawahara
Shinji Watanabe
29
42
0
13 Apr 2021
The Multilingual TEDx Corpus for Speech Recognition and Translation
The Multilingual TEDx Corpus for Speech Recognition and Translation
Elizabeth Salesky
Matthew Wiesner
Jacob Bremerman
R. Cattoni
Matteo Negri
Marco Turchi
Douglas W. Oard
Matt Post
9
119
0
02 Feb 2021
The 2020 ESPnet update: new features, broadened applications,
  performance improvements, and future plans
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
39
38
0
23 Dec 2020
Breeding Gender-aware Direct Speech Translation Systems
Breeding Gender-aware Direct Speech Translation Systems
Marco Gaido
Beatrice Savoldi
L. Bentivogli
Matteo Negri
Marco Turchi
40
20
0
09 Dec 2020
Tight Integrated End-to-End Training for Cascaded Speech Translation
Tight Integrated End-to-End Training for Cascaded Speech Translation
Parnia Bahar
Tobias Bieschke
Ralf Schluter
Hermann Ney
37
26
0
24 Nov 2020
Enabling Zero-shot Multilingual Spoken Language Translation with
  Language-Specific Encoders and Decoders
Enabling Zero-shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders
Carlos Escolano
Marta R. Costa-jussá
José A. R. Fonollosa
Carlos Segura
25
18
0
02 Nov 2020
Dual-decoder Transformer for Joint Automatic Speech Recognition and
  Multilingual Speech Translation
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation
Hang Le
J. Pino
Changhan Wang
Jiatao Gu
D. Schwab
Laurent Besacier
39
82
0
02 Nov 2020
Orthros: Non-autoregressive End-to-end Speech Translation with
  Dual-decoder
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
H. Inaguma
Yosuke Higuchi
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
19
22
0
25 Oct 2020
A Technical Report: BUT Speech Translation Systems
A Technical Report: BUT Speech Translation Systems
Hari Krishna Vydana
L. Burget
J. Černocký
24
0
0
22 Oct 2020
Towards End-to-End In-Image Neural Machine Translation
Towards End-to-End In-Image Neural Machine Translation
Elman Mansimov
Mitchell Stern
M. Chen
Orhan Firat
Jakob Uszkoreit
Puneet Jain
27
25
0
20 Oct 2020
Consecutive Decoding for Speech-to-text Translation
Consecutive Decoding for Speech-to-text Translation
Qianqian Dong
Mingxuan Wang
Hao Zhou
Shuang Xu
Bo Xu
Lei Li
SLR
29
40
0
21 Sep 2020
"Listen, Understand and Translate": Triple Supervision Decouples
  End-to-end Speech-to-text Translation
"Listen, Understand and Translate": Triple Supervision Decouples End-to-end Speech-to-text Translation
Qianqian Dong
Rong Ye
Mingxuan Wang
Hao Zhou
Shuang Xu
Bo Xu
Lei Li
33
3
0
21 Sep 2020
CoVoST 2 and Massively Multilingual Speech-to-Text Translation
CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Changhan Wang
Anne Wu
J. Pino
SLR
19
71
0
20 Jul 2020
Self-Supervised Representations Improve End-to-End Speech Translation
Self-Supervised Representations Improve End-to-End Speech Translation
Anne Wu
Changhan Wang
J. Pino
Jiatao Gu
SSL
25
40
0
22 Jun 2020
Self-Training for End-to-End Speech Translation
Self-Training for End-to-End Speech Translation
J. Pino
Qiantong Xu
Xutai Ma
M. Dousti
Yun Tang
33
59
0
03 Jun 2020
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in
  Multitask End-to-End Speech Translation
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation
Shun-Po Chuang
Tzu-Wei Sung
Alexander H. Liu
Hung-yi Lee
18
19
0
21 May 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
ESPnet-ST: All-in-One Speech Translation Toolkit
H. Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
23
161
0
21 Apr 2020
12
Next