Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.08581
Cited By
Sequence-to-Sequence Models Can Directly Translate Foreign Speech
24 March 2017
Ron J. Weiss
J. Chorowski
Navdeep Jaitly
Yonghui Wu
Zhehuai Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequence-to-Sequence Models Can Directly Translate Foreign Speech"
50 / 204 papers shown
Title
CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus
Changhan Wang
J. Pino
Anne Wu
Jiatao Gu
SLR
36
82
0
04 Feb 2020
From Speech-to-Speech Translation to Automatic Dubbing
Marcello Federico
Robert Enyedi
Roberto Barra-Chicote
Ritwik Giri
Umut Isik
A. Krishnaswamy
Hassan Sawaf
29
41
0
19 Jan 2020
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding
Yuchen Liu
Jiajun Zhang
Hao Xiong
Long Zhou
Zhongjun He
Hua Wu
Haifeng Wang
Chengqing Zong
26
70
0
16 Dec 2019
Multimodal Machine Translation through Visuals and Speech
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
49
73
0
28 Nov 2019
On Using SpecAugment for End-to-End Speech Translation
Parnia Bahar
Albert Zeyer
Ralf Schluter
Hermann Ney
17
53
0
20 Nov 2019
A Comparative Study on End-to-end Speech to Text Translation
Parnia Bahar
Tobias Bieschke
Hermann Ney
20
78
0
20 Nov 2019
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning
Sathish Indurthi
HyoJung Han
Nikhil Kumar Lakumarapu
Beomseok Lee
Insoo Chung
Sangha Kim
Chanwoo Kim
22
26
0
11 Nov 2019
Europarl-ST: A Multilingual Corpus For Speech Translation Of Parliamentary Debates
Javier Iranzo-Sánchez
J. Silvestre-Cerdà
Javier Jorge
Nahuel Roselló
Adria Giménez
A. Sanchís
Jorge Civera Saiz
Alfons Juan-Císcar
19
180
0
08 Nov 2019
ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task
H. Nguyen
N. Tomashenko
Marcely Zanon Boito
Antoine Caubrière
Fethi Bougares
Mickael Rouvier
Laurent Besacier
Yannick Esteve
20
8
0
30 Oct 2019
Analyzing ASR pretraining for low-resource speech-to-text translation
Mihaela C. Stoian
Sameer Bansal
Sharon Goldwater
11
63
0
23 Oct 2019
Instance-Based Model Adaptation For Direct Speech Translation
Mattia Antonino Di Gangi
V. Nguyen
Matteo Negri
Marco Turchi
19
11
0
23 Oct 2019
LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech Recognition
Benjamin Beilharz
Xin Sun
Sariya Karimova
Stefan Riezler
8
28
0
17 Oct 2019
One-To-Many Multilingual End-to-end Speech Translation
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
33
50
0
08 Oct 2019
Speech-to-speech Translation between Untranscribed Unknown Languages
Andros Tjandra
S. Sakti
Satoshi Nakamura
14
49
0
02 Oct 2019
Multilingual End-to-End Speech Translation
Hirofumi Inaguma
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
LRM
28
86
0
01 Oct 2019
Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability Training
Qiao Cheng
Meiyuan Fang
Yaqian Han
Jin Huang
Yitao Duan
25
16
0
25 Sep 2019
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation
Chengyi Wang
Yu-Huan Wu
Shujie Liu
Zhenglu Yang
M. Zhou
18
83
0
17 Sep 2019
Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade
J. Pino
Liezl Puzon
Jiatao Gu
Xutai Ma
Arya D. McCarthy
D. Gopinath
23
3
0
14 Sep 2019
A Comparative Study on Transformer vs RNN in Speech Applications
Shigeki Karita
Nanxin Chen
Tomoki Hayashi
Takaaki Hori
Hirofumi Inaguma
...
Ryuichi Yamamoto
Xiao-fei Wang
Shinji Watanabe
Takenori Yoshimura
Wangyou Zhang
37
716
0
13 Sep 2019
Cross-lingual topic prediction for speech using translations
Sameer Bansal
Herman Kamper
Adam Lopez
Sharon Goldwater
6
1
0
29 Aug 2019
DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting
Hao Xiong
Ruiqing Zhang
Chuanqiang Zhang
Zhongjun He
Hua Wu
Haifeng Wang
41
25
0
30 Jul 2019
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible
Marcely Zanon Boito
William N. Havard
Mahault Garnerin
Éric Le Ferrand
Laurent Besacier
32
47
0
30 Jul 2019
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Antoine Caubrière
N. Tomashenko
Antoine Laurent
Emmanuel Morin
Nathalie Camelin
Yannick Esteve
18
54
0
18 Jun 2019
Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation
Elizabeth Salesky
Matthias Sperber
A. Black
6
35
0
04 Jun 2019
Fluent Translations from Disfluent Speech in End-to-End Speech Translation
Elizabeth Salesky
Matthias Sperber
A. Waibel
19
33
0
03 Jun 2019
End-to-End Speech Translation with Knowledge Distillation
Yuchen Liu
Hao Xiong
Zhongjun He
Jiajun Zhang
Hua Wu
Haifeng Wang
Chengqing Zong
32
151
0
17 Apr 2019
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation
Matthias Sperber
Graham Neubig
Jan Niehues
A. Waibel
14
101
0
15 Apr 2019
Direct speech-to-speech translation with a sequence-to-sequence model
Ye Jia
Ron J. Weiss
Fadi Biadsy
Wolfgang Macherey
Melvin Johnson
Zhehuai Chen
Yonghui Wu
21
223
0
12 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
27
113
0
08 Apr 2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Jonathan Shen
Patrick Nguyen
Yonghui Wu
Zhehuai Chen
Mengzhao Chen
...
William Chan
Shubham Toshniwal
Baohua Liao
M. Nirschl
Pat Rondon
VLM
27
209
0
21 Feb 2019
KINN: Incorporating Expert Knowledge in Neural Networks
M. A. Chattha
Shoaib Ahmed Siddiqui
M. I. Malik
L. V. Elst
Andreas Dengel
Sheraz Ahmed
6
6
0
15 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie
Rohit Prabhavalkar
Anjuli Kannan
A. Bruguier
David Rybach
Patrick Nguyen
12
37
0
05 Feb 2019
The USTC-NEL Speech Translation system at IWSLT 2018
Dan Liu
Junhua Liu
Wu Guo
Shifu Xiong
Zhiqiang Ma
Rui Song
Chongliang Wu
Quan Liu
21
18
0
06 Dec 2018
Improving Clinical Predictions through Unsupervised Time Series Representation Learning
Xinrui Lyu
Matthias Huser
Stephanie L. Hyland
George Zerveas
Gunnar Rätsch
SSL
OOD
AI4TS
17
43
0
02 Dec 2018
Towards Fluent Translations from Disfluent Speech
Mingming Wang
Lin-Ming Gong
Jan Niehues
Lian-He Shao
12
24
0
07 Nov 2018
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Ye Jia
Melvin Johnson
Wolfgang Macherey
Ron J. Weiss
Yuan Cao
Chung-Cheng Chiu
Naveen Ari
Stella Laurenzo
Yonghui Wu
31
159
0
05 Nov 2018
Towards Unsupervised Speech-to-Text Translation
Yu-An Chung
W. Weng
S. Tong
James R. Glass
42
42
0
04 Nov 2018
How2: A Large-scale Dataset for Multimodal Language Understanding
Ramon Sanabria
Ozan Caglayan
Shruti Palaskar
Desmond Elliott
Loïc Barrault
Lucia Specia
Florian Metze
VGen
MLLM
26
287
0
01 Nov 2018
Fine-tuning on Clean Data for End-to-End Speech Translation: FBK @ IWSLT 2018
Mattia Antonino Di Gangi
Roberto Dessì
R. Cattoni
Matteo Negri
Marco Turchi
26
10
0
16 Oct 2018
Indicatements that character language models learn English morpho-syntactic units and regularities
Yova Kementchedjhieva
Adam Lopez
16
10
0
31 Aug 2018
A small Griko-Italian speech translation corpus
Marcely Zanon Boito
Antonios Anastasopoulos
M. Lekakou
Aline Villavicencio
Laurent Besacier
26
11
0
27 Jul 2018
Unsupervised Word Segmentation from Speech with Attention
Pierre Godard
Marcely Zanon Boito
Lucas Ondel
Alexandre Berard
François Yvon
Aline Villavicencio
Laurent Besacier
24
27
0
18 Jun 2018
Visually grounded cross-lingual keyword spotting in speech
Herman Kamper
Michael Roth
19
34
0
13 Jun 2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech
David Harwath
Galen Chuang
James R. Glass
20
58
0
09 Apr 2018
Low-Resource Speech-to-Text Translation
Sameer Bansal
Herman Kamper
Karen Livescu
Adam Lopez
Sharon Goldwater
21
56
0
24 Mar 2018
Leveraging translations for speech transcription in low-resource settings
Antonios Anastasopoulos
David Chiang
14
26
0
23 Mar 2018
Gender Aware Spoken Language Translation Applied to English-Arabic
M. Elaraby
Ahmed Y. Tawfik
Mahmoud Khaled
Hany Hassan
Aly Osama
27
40
0
26 Feb 2018
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop
O. Scharenborg
Laurent Besacier
A. Black
M. Hasegawa-Johnson
Florian Metze
...
Elin Larsen
Danny Merkx
Rachid Riad
Liming Wang
Emmanuel Dupoux
28
33
0
14 Feb 2018
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
A. Kocabiyikoglu
Laurent Besacier
Olivier Kraif
14
104
0
09 Feb 2018
A
4
N
T
A^{4}NT
A
4
NT
: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation
Rakshith Shetty
Bernt Schiele
Mario Fritz
44
95
0
06 Nov 2017
Previous
1
2
3
4
5
Next