Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.06358
Cited By
Speech Translation and the End-to-End Promise: Taking Stock of Where We Are
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
14 April 2020
Matthias Sperber
Matthias Paulik
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Speech Translation and the End-to-End Promise: Taking Stock of Where We Are"
50 / 73 papers shown
MCAT: Scaling Many-to-Many Speech-to-Text Translation with MLLMs to 70 Languages
Yexing Du
Kaiyuan Liu
Youcheng Pan
B. Yang
Keqi Deng
Xie Chen
Yang Xiang
Ming Liu
Bin Qin
Y. Wang
LRM
171
1
0
01 Dec 2025
V-SAT: Video Subtitle Annotation Tool
Arpita Kundu
Joyita Chakraborty
Anindita Desarkar
Aritra Sen
Srushti Anil Patil
Vishwanathan Raman
114
0
0
28 Oct 2025
Listening or Reading? Evaluating Speech Awareness in Chain-of-Thought Speech-to-Text Translation
Jacobo Romero-Díaz
Gerard I. Gállego
Oriol Pareras
Federico Costa
Javier Hernando
Cristina España-Bonet
LRM
137
0
0
03 Oct 2025
Vision-Grounded Machine Interpreting: Improving the Translation Process through Visual Cues
Claudio Fantinuoli
202
0
0
28 Sep 2025
Toward Machine Interpreting: Lessons from Human Interpreting Studies
Matthias Sperber
Maureen de Seyssel
Jiajun Bao
Matthias Paulik
AI4CE
193
2
0
11 Aug 2025
PHRASED: Phrase Dictionary Biasing for Speech Translation
Peidong Wang
Jian Xue
Rui Zhao
Junkun Chen
Aswin Shanmugam Subramanian
Jinyu Li
229
1
0
10 Jun 2025
Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages
Andrei Popescu-Belis
Alexis Allemann
Teo Ferrari
Gopal Krishnamani
228
0
0
02 Jun 2025
Different Speech Translation Models Encode and Translate Speaker Gender Differently
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Dennis Fucci
Marco Gaido
Matteo Negri
L. Bentivogli
Marcely Zanon Boito
Giuseppe Attanasio
317
1
0
02 Jun 2025
Spatial Speech Translation: Translating Across Space With Binaural Hearables
International Conference on Human Factors in Computing Systems (CHI), 2025
Tuochao Chen
Qirui Wang
Runlin He
Shyam Gollakota
241
4
0
25 Apr 2025
DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Xinglin Lyu
Wei Tang
Yongqian Li
X. Zhao
Ming Zhu
...
Yaojie Lu
Min Zhang
Daimeng Wei
Hao Yang
Min Zhang
323
0
0
07 Apr 2025
Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Wuwei Huang
Renren Jin
Wen Zhang
Jian Luan
Sijin Yu
Deyi Xiong
351
1
0
14 Mar 2025
Speech Translation Refinement using Large Language Models
Huaixia Dou
Xinyu Tian
Xinglin Lyu
Jie Zhu
Junhui Li
Lifan Guo
1.0K
1
0
28 Jan 2025
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Tsz Kin Lam
Marco Gaido
Sara Papi
L. Bentivogli
Barry Haddow
487
4
0
04 Jan 2025
Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages
Sparsh Jain
Ashwin Sankar
Devilal Choudhary
Dhairya Suman
Nikhil Narasimhan
Mohammed Safi Ur Rahman Khan
Anoop Kunchukuttan
Mitesh M. Khapra
Mary Dabre
571
2
0
07 Nov 2024
Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody?
Conference on Machine Translation (WMT), 2024
Ioannis Tsiamas
Matthias Sperber
Andrew Finch
Sarthak Garg
195
8
0
31 Oct 2024
CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation
Spoken Language Technology Workshop (SLT), 2024
Rui Zhao
Jinyu Li
Ruchao Fan
Matt Post
211
2
0
07 Oct 2024
Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Siqi Li
Danni Liu
Jan Niehues
309
4
0
13 Sep 2024
Lightweight Audio Segmentation for Long-form Speech Translation
Interspeech (Interspeech), 2024
Jaesong Lee
Soyoon Kim
Hanbyul Kim
Joon Son Chung
221
2
0
15 Jun 2024
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation
Peidong Wang
Jian Xue
Jinyu Li
Junkun Chen
Aswin Shanmugam Subramanian
265
0
0
12 Jun 2024
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation
Chenyang Le
Yao Qian
Dongmei Wang
Long Zhou
Shujie Liu
...
Midia Yousefi
Yanmin Qian
Jinyu Li
Sheng Zhao
Michael Zeng
385
15
0
28 May 2024
SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Marco Gaido
Sara Papi
Matteo Negri
Mauro Cettolo
L. Bentivogli
262
2
0
17 May 2024
Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?
Marco Gaido
Sara Papi
Matteo Negri
L. Bentivogli
518
28
0
19 Feb 2024
Pushing the Limits of Zero-shot End-to-End Speech Translation
Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa-jussá
362
15
0
16 Feb 2024
A Case Study on Filtering for End-to-End Speech Translation
Md Mahfuz Ibn Alam
Antonios Anastasopoulos
230
1
0
02 Feb 2024
Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases
Giulio Zhou
Tsz Kin Lam
Alexandra Birch
Barry Haddow
196
9
0
01 Feb 2024
Towards a Deep Understanding of Multilingual End-to-End Speech Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haoran Sun
Xiaohu Zhao
Yikun Lei
Shaolin Zhu
Deyi Xiong
238
8
0
31 Oct 2023
Long-form Simultaneous Speech Translation: Thesis Proposal
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Peter Polák
3DV
282
3
0
17 Oct 2023
Long-Form End-to-End Speech Translation via Latent Alignment Segmentation
Spoken Language Technology Workshop (SLT), 2023
Peter Polák
Ondrej Bojar
296
6
0
20 Sep 2023
DiariST: Streaming Speech Translation with Speaker Diarization
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muqiao Yang
Naoyuki Kanda
Xiaofei Wang
Junkun Chen
Peidong Wang
Jian Xue
Jinyu Li
Takuya Yoshioka
292
7
0
14 Sep 2023
On decoder-only architecture for speech-to-text and large language model integration
Automatic Speech Recognition & Understanding (ASRU), 2023
Jian Wu
Yashesh Gaur
Zhuo Chen
Long Zhou
Yilun Zhu
...
Jinyu Li
Shujie Liu
Bo Ren
Linquan Liu
Yu-Huan Wu
AuLLM
649
204
0
08 Jul 2023
Recent Advances in Direct Speech-to-text Translation
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Chen Xu
Rong Ye
Qianqian Dong
Chengqi Zhao
Tom Ko
Mingxuan Wang
Tong Xiao
Jingbo Zhu
377
34
0
20 Jun 2023
Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23
International Workshop on Spoken Language Translation (IWSLT), 2023
Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa-jussá
OT
269
3
0
02 Jun 2023
Robustness of Multi-Source MT to Transcription Errors
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Dominik Machávcek
Peter Polák
Ondrej Bojar
Mary Dabre
261
4
0
26 May 2023
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Neural Information Processing Systems (NeurIPS), 2023
Chenyang Le
Yao Qian
Long Zhou
Shujie Liu
Yanmin Qian
Michael Zeng
Xuedong Huang
338
20
0
24 May 2023
Understanding and Bridging the Modality Gap for Speech Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Qingkai Fang
Yang Feng
361
31
0
15 May 2023
Selective Data Augmentation for Robust Speech Translation
R. Acharya
Ashish Panda
Sunil Kumar Kopparapu
148
0
0
22 Mar 2023
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
351
6
0
19 Dec 2022
A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Automatic Speech Recognition & Understanding (ASRU), 2022
Jian Xue
Peidong Wang
Jinyu Li
Eric Sun
271
12
0
04 Nov 2022
Efficient Speech Translation with Dynamic Latent Perceivers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa-jussá
273
4
0
28 Oct 2022
Does Joint Training Really Help Cascaded Speech Translation?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Viet Anh Khoa Tran
David Thulke
Yingbo Gao
Christian Herold
Hermann Ney
331
6
0
24 Oct 2022
Towards Relation Extraction From Speech
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tongtong Wu
Guitao Wang
Jinming Zhao
Zhaoran Liu
Guilin Qi
Yuan-Fang Li
Gholamreza Haffari
251
13
0
17 Oct 2022
Generating Synthetic Speech from SpokenVocab for Speech Translation
Findings (Findings), 2022
Jinming Zhao
Gholamreza Haffar
Ehsan Shareghi
224
8
0
15 Oct 2022
Direct Speech Translation for Automatic Subtitling
Transactions of the Association for Computational Linguistics (TACL), 2022
Sara Papi
Marco Gaido
Alina Karakanta
Mauro Cettolo
Matteo Negri
Marco Turchi
276
19
0
27 Sep 2022
A Comprehensive Survey of Natural Language Generation Advances from the Perspective of Digital Deception
Keenan I. Jones
Enes ALTUNCU
V. N. Franqueira
Yi-Chia Wang
Shujun Li
DeLMO
263
4
0
11 Aug 2022
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation
Interspeech (Interspeech), 2022
L. T. Nguyen
Nguyen Luong Tran
Long Doan
Manh Luong
Dat Quoc Nguyen
203
5
0
08 Aug 2022
M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation
Interspeech (Interspeech), 2022
Jinming Zhao
Haomiao Yang
Ehsan Shareghi
Gholamreza Haffari
263
22
0
03 Jul 2022
Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Gerard Sant
Gerard I. Gállego
Belen Alastruey
Marta R. Costa-jussá
219
4
0
14 May 2022
Joint Generation of Captions and Subtitles with Dual Decoding
International Workshop on Spoken Language Translation (IWSLT), 2022
Jitao Xu
François Buet
Josep Crego
Elise Bertin-Lemée
François Yvon
184
9
0
13 May 2022
LibriS2S: A German-English Speech-to-Speech Translation Corpus
International Conference on Language Resources and Evaluation (LREC), 2022
Pedro Jeuris
Jan Niehues
AuLLM
197
7
0
22 Apr 2022
Large-Scale Streaming End-to-End Speech Translation with Neural Transducers
Interspeech (Interspeech), 2022
Jian Xue
Peidong Wang
Jinyu Li
Matt Post
Yashesh Gaur
AI4TS
331
36
0
11 Apr 2022
1
2
Next
Page 1 of 2