Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.08581
Cited By
Sequence-to-Sequence Models Can Directly Translate Foreign Speech
24 March 2017
Ron J. Weiss
J. Chorowski
Navdeep Jaitly
Yonghui Wu
Zhehuai Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequence-to-Sequence Models Can Directly Translate Foreign Speech"
50 / 204 papers shown
Title
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation
Shun-Po Chuang
Yung-Sung Chuang
Chih-Chiang Chang
Hung-yi Lee
34
25
0
11 May 2021
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021
Gerard I. Gállego
Ioannis Tsiamas
Carlos Escolano
José A. R. Fonollosa
Marta R. Costa-jussá
31
30
0
10 May 2021
Learning Shared Semantic Space for Speech-to-Text Translation
Chi Han
Mingxuan Wang
Heng Ji
Lei Li
18
76
0
07 May 2021
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Siddharth Dalmia
Brian Yan
Vikas Raunak
Florian Metze
Shinji Watanabe
47
30
0
02 May 2021
AlloST: Low-resource Speech Translation without Source Transcription
Yao-Fei Cheng
Hung-Shin Lee
Hsin-Min Wang
27
8
0
01 May 2021
Segmenting Subtitles for Correcting ASR Segmentation Errors
David Wan
Chris Kedzie
Faisal Ladhak
Elsbeth Turcan
P. Galuscáková
Elena Zotkina
Zhengping Jiang
P. Bell
Kathleen McKeown
10
4
0
16 Apr 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Changhan Wang
Anne Wu
J. Pino
Alexei Baevski
Michael Auli
Alexis Conneau
SSL
33
44
0
14 Apr 2021
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Hirofumi Inaguma
Tatsuya Kawahara
Shinji Watanabe
31
42
0
13 Apr 2021
BSTC: A Large-Scale Chinese-English Speech Translation Dataset
Ruiqing Zhang
Xiyang Wang
Chuanqiang Zhang
Zhongjun He
Hua Wu
Zhi Li
Haifeng Wang
Ying-Cong Chen
Qinfei Li
25
39
0
08 Apr 2021
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Renjie Zheng
Junkun Chen
Mingbo Ma
Liang Huang
36
69
0
10 Feb 2021
The Multilingual TEDx Corpus for Speech Recognition and Translation
Elizabeth Salesky
Sanjeev Khudanpur
Jacob Bremerman
R. Cattoni
Matteo Negri
Marco Turchi
Douglas W. Oard
Matt Post
22
119
0
02 Feb 2021
CTC-based Compression for Direct Speech Translation
Marco Gaido
Mauro Cettolo
Matteo Negri
Marco Turchi
22
57
0
02 Feb 2021
NeurST: Neural Speech Translation Toolkit
Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li
30
32
0
18 Dec 2020
Towards localisation of keywords in speech using weak supervision
Kayode Olaleye
Benjamin van Niekerk
Herman Kamper
24
5
0
14 Dec 2020
On Knowledge Distillation for Direct Speech Translation
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
32
14
0
09 Dec 2020
Breeding Gender-aware Direct Speech Translation Systems
Marco Gaido
Beatrice Savoldi
L. Bentivogli
Matteo Negri
Marco Turchi
48
20
0
09 Dec 2020
Tight Integrated End-to-End Training for Cascaded Speech Translation
Parnia Bahar
Tobias Bieschke
Ralf Schluter
Hermann Ney
47
26
0
24 Nov 2020
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation
Hang Le
J. Pino
Changhan Wang
Jiatao Gu
D. Schwab
Laurent Besacier
39
82
0
02 Nov 2020
Bridging the Modality Gap for Speech-to-Text Translation
Yuchen Liu
Junnan Zhu
Jiajun Zhang
Chengqing Zong
18
65
0
28 Oct 2020
Evaluating Gender Bias in Speech Translation
Marta R. Costa-jussá
Christine Basta
Gerard I. Gállego
32
21
0
27 Oct 2020
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Hirofumi Inaguma
Yosuke Higuchi
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
24
22
0
25 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
21
6
0
24 Oct 2020
A Technical Report: BUT Speech Translation Systems
Hari Krishna Vydana
L. Burget
J. Černocký
24
0
0
22 Oct 2020
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Junkun Chen
Mingbo Ma
Renjie Zheng
Liang Huang
19
21
0
22 Oct 2020
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
Yun Tang
J. Pino
Changhan Wang
Xutai Ma
Dmitriy Genzel
26
73
0
21 Oct 2020
Cascaded Models With Cyclic Feedback For Direct Speech Translation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
38
12
0
21 Oct 2020
Adaptive Feature Selection for End-to-End Speech Translation
Biao Zhang
Ivan Titov
Barry Haddow
Rico Sennrich
13
40
0
16 Oct 2020
Textual Supervision for Visually Grounded Spoken Language Understanding
Bertrand Higy
Desmond Eliott
Grzegorz Chrupała
15
10
0
06 Oct 2020
Consecutive Decoding for Speech-to-text Translation
Qianqian Dong
Mingxuan Wang
Hao Zhou
Shuang Xu
Bo Xu
Lei Li
SLR
39
40
0
21 Sep 2020
"Listen, Understand and Translate": Triple Supervision Decouples End-to-end Speech-to-text Translation
Qianqian Dong
Rong Ye
Mingxuan Wang
Hao Zhou
Shuang Xu
Bo Xu
Lei Li
43
3
0
21 Sep 2020
Video captioning with stacked attention and semantic hard pull
Md. Mushfiqur Rahman
Thasinul Abedin
Khondokar S. S. Prottoy
Ayana Moshruba
Fazlul Hasan Siddiqui
27
2
0
15 Sep 2020
On Target Segmentation for Direct Speech Translation
Mattia Antonino Di Gangi
Marco Gaido
Matteo Negri
Marco Turchi
37
14
0
10 Sep 2020
Convolutional Speech Recognition with Pitch and Voice Quality Features
Guillermo Cámbara
Jordi Luque
Mireia Farrús
11
8
0
02 Sep 2020
Contextualized Translation of Automatically Segmented Speech
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Mauro Cettolo
Marco Turchi
25
18
0
05 Aug 2020
Consistent Transcription and Translation of Speech
Matthias Sperber
Hendra Setiawan
Christian Gollan
Udhyakumar Nallasamy
Matthias Paulik
31
18
0
24 Jul 2020
Self-Supervised Representations Improve End-to-End Speech Translation
Anne Wu
Changhan Wang
J. Pino
Jiatao Gu
SSL
25
40
0
22 Jun 2020
UWSpeech: Speech to Speech Translation for Unwritten Languages
Chen Zhang
Xu Tan
Yi Ren
Tao Qin
Ke-jun Zhang
Tie-Yan Liu
17
53
0
14 Jun 2020
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation
Changhan Wang
J. Pino
Jiatao Gu
17
30
0
09 Jun 2020
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
19
53
0
04 Jun 2020
Self-Training for End-to-End Speech Translation
J. Pino
Qiantong Xu
Xutai Ma
M. Dousti
Yun Tang
33
59
0
03 Jun 2020
Phone Features Improve Speech Translation
Elizabeth Salesky
A. Black
30
27
0
27 May 2020
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation
Shun-Po Chuang
Tzu-Wei Sung
Alexander H. Liu
Hung-yi Lee
18
19
0
21 May 2020
Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Baiji Liu
Songjun Cao
Sining Sun
Weibin Zhang
Long Ma
23
9
0
01 May 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
42
161
0
21 Apr 2020
Curriculum Pre-training for End-to-End Speech Translation
Chengyi Wang
Yu Wu
Shujie Liu
Ming Zhou
Zhenglu Yang
21
108
0
21 Apr 2020
Neural Machine Translation: Challenges, Progress and Future
Jiajun Zhang
Chengqing Zong
26
52
0
13 Apr 2020
Unmet Needs and Opportunities for Mobile Translation AI
Susanne Putze
Michael Bonfert
Pitt Michelmann
Sebastian Höffner
Dirk Wenig
Rainer Malaka
Jan David Smeddinck
16
80
0
27 Feb 2020
SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation
Arya D. McCarthy
Liezl Puzon
J. Pino
33
24
0
27 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
27
138
0
18 Feb 2020
A Data Efficient End-To-End Spoken Language Understanding Architecture
Marco Dinarelli
Nikita Kapoor
Bassam Jabaian
Laurent Besacier
3DV
17
20
0
14 Feb 2020
Previous
1
2
3
4
5
Next