ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.08581
  4. Cited By
Sequence-to-Sequence Models Can Directly Translate Foreign Speech

Sequence-to-Sequence Models Can Directly Translate Foreign Speech

24 March 2017
Ron J. Weiss
J. Chorowski
Navdeep Jaitly
Yonghui Wu
Zhehuai Chen
ArXivPDFHTML

Papers citing "Sequence-to-Sequence Models Can Directly Translate Foreign Speech"

50 / 204 papers shown
Title
CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus
CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus
Changhan Wang
J. Pino
Anne Wu
Jiatao Gu
SLR
36
82
0
04 Feb 2020
From Speech-to-Speech Translation to Automatic Dubbing
From Speech-to-Speech Translation to Automatic Dubbing
Marcello Federico
Robert Enyedi
Roberto Barra-Chicote
Ritwik Giri
Umut Isik
A. Krishnaswamy
Hassan Sawaf
29
41
0
19 Jan 2020
Synchronous Speech Recognition and Speech-to-Text Translation with
  Interactive Decoding
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding
Yuchen Liu
Jiajun Zhang
Hao Xiong
Long Zhou
Zhongjun He
Hua Wu
Haifeng Wang
Chengqing Zong
26
70
0
16 Dec 2019
Multimodal Machine Translation through Visuals and Speech
Multimodal Machine Translation through Visuals and Speech
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
49
73
0
28 Nov 2019
On Using SpecAugment for End-to-End Speech Translation
On Using SpecAugment for End-to-End Speech Translation
Parnia Bahar
Albert Zeyer
Ralf Schluter
Hermann Ney
17
53
0
20 Nov 2019
A Comparative Study on End-to-end Speech to Text Translation
A Comparative Study on End-to-end Speech to Text Translation
Parnia Bahar
Tobias Bieschke
Hermann Ney
20
78
0
20 Nov 2019
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic
  Meta-Learning
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning
Sathish Indurthi
HyoJung Han
Nikhil Kumar Lakumarapu
Beomseok Lee
Insoo Chung
Sangha Kim
Chanwoo Kim
22
26
0
11 Nov 2019
Europarl-ST: A Multilingual Corpus For Speech Translation Of
  Parliamentary Debates
Europarl-ST: A Multilingual Corpus For Speech Translation Of Parliamentary Debates
Javier Iranzo-Sánchez
J. Silvestre-Cerdà
Javier Jorge
Nahuel Roselló
Adria Giménez
A. Sanchís
Jorge Civera Saiz
Alfons Juan-Císcar
19
180
0
08 Nov 2019
ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT
  2019 Shared Task
ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task
H. Nguyen
N. Tomashenko
Marcely Zanon Boito
Antoine Caubrière
Fethi Bougares
Mickael Rouvier
Laurent Besacier
Yannick Esteve
20
8
0
30 Oct 2019
Analyzing ASR pretraining for low-resource speech-to-text translation
Analyzing ASR pretraining for low-resource speech-to-text translation
Mihaela C. Stoian
Sameer Bansal
Sharon Goldwater
11
63
0
23 Oct 2019
Instance-Based Model Adaptation For Direct Speech Translation
Instance-Based Model Adaptation For Direct Speech Translation
Mattia Antonino Di Gangi
V. Nguyen
Matteo Negri
Marco Turchi
19
11
0
23 Oct 2019
LibriVoxDeEn: A Corpus for German-to-English Speech Translation and
  German Speech Recognition
LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech Recognition
Benjamin Beilharz
Xin Sun
Sariya Karimova
Stefan Riezler
8
28
0
17 Oct 2019
One-To-Many Multilingual End-to-end Speech Translation
One-To-Many Multilingual End-to-end Speech Translation
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
33
50
0
08 Oct 2019
Speech-to-speech Translation between Untranscribed Unknown Languages
Speech-to-speech Translation between Untranscribed Unknown Languages
Andros Tjandra
S. Sakti
Satoshi Nakamura
14
49
0
02 Oct 2019
Multilingual End-to-End Speech Translation
Multilingual End-to-End Speech Translation
Hirofumi Inaguma
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
LRM
28
86
0
01 Oct 2019
Breaking the Data Barrier: Towards Robust Speech Translation via
  Adversarial Stability Training
Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability Training
Qiao Cheng
Meiyuan Fang
Yaqian Han
Jin Huang
Yitao Duan
25
16
0
25 Sep 2019
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End
  Speech Translation
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation
Chengyi Wang
Yu-Huan Wu
Shujie Liu
Zhenglu Yang
M. Zhou
18
83
0
17 Sep 2019
Harnessing Indirect Training Data for End-to-End Automatic Speech
  Translation: Tricks of the Trade
Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade
J. Pino
Liezl Puzon
Jiatao Gu
Xutai Ma
Arya D. McCarthy
D. Gopinath
23
3
0
14 Sep 2019
A Comparative Study on Transformer vs RNN in Speech Applications
A Comparative Study on Transformer vs RNN in Speech Applications
Shigeki Karita
Nanxin Chen
Tomoki Hayashi
Takaaki Hori
Hirofumi Inaguma
...
Ryuichi Yamamoto
Xiao-fei Wang
Shinji Watanabe
Takenori Yoshimura
Wangyou Zhang
37
716
0
13 Sep 2019
Cross-lingual topic prediction for speech using translations
Cross-lingual topic prediction for speech using translations
Sameer Bansal
Herman Kamper
Adam Lopez
Sharon Goldwater
6
1
0
29 Aug 2019
DuTongChuan: Context-aware Translation Model for Simultaneous
  Interpreting
DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting
Hao Xiong
Ruiqing Zhang
Chuanqiang Zhang
Zhongjun He
Hua Wu
Haifeng Wang
41
25
0
30 Jul 2019
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken
  Utterances Extracted from the Bible
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible
Marcely Zanon Boito
William N. Havard
Mahault Garnerin
Éric Le Ferrand
Laurent Besacier
32
47
0
30 Jul 2019
Curriculum-based transfer learning for an effective end-to-end spoken
  language understanding and domain portability
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Antoine Caubrière
N. Tomashenko
Antoine Laurent
Emmanuel Morin
Nathalie Camelin
Yannick Esteve
18
54
0
18 Jun 2019
Exploring Phoneme-Level Speech Representations for End-to-End Speech
  Translation
Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation
Elizabeth Salesky
Matthias Sperber
A. Black
6
35
0
04 Jun 2019
Fluent Translations from Disfluent Speech in End-to-End Speech
  Translation
Fluent Translations from Disfluent Speech in End-to-End Speech Translation
Elizabeth Salesky
Matthias Sperber
A. Waibel
19
33
0
03 Jun 2019
End-to-End Speech Translation with Knowledge Distillation
End-to-End Speech Translation with Knowledge Distillation
Yuchen Liu
Hao Xiong
Zhongjun He
Jiajun Zhang
Hua Wu
Haifeng Wang
Chengqing Zong
32
151
0
17 Apr 2019
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech
  Translation
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation
Matthias Sperber
Graham Neubig
Jan Niehues
A. Waibel
14
101
0
15 Apr 2019
Direct speech-to-speech translation with a sequence-to-sequence model
Direct speech-to-speech translation with a sequence-to-sequence model
Ye Jia
Ron J. Weiss
Fadi Biadsy
Wolfgang Macherey
Melvin Johnson
Zhehuai Chen
Yonghui Wu
21
223
0
12 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its
  Applications to Hearing-Impaired Speech and Speech Separation
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
27
113
0
08 Apr 2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence
  Modeling
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Jonathan Shen
Patrick Nguyen
Yonghui Wu
Zhehuai Chen
Mengzhao Chen
...
William Chan
Shubham Toshniwal
Baohua Liao
M. Nirschl
Pat Rondon
VLM
27
209
0
21 Feb 2019
KINN: Incorporating Expert Knowledge in Neural Networks
KINN: Incorporating Expert Knowledge in Neural Networks
M. A. Chattha
Shoaib Ahmed Siddiqui
M. I. Malik
L. V. Elst
Andreas Dengel
Sheraz Ahmed
6
6
0
15 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech
  Recognition
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie
Rohit Prabhavalkar
Anjuli Kannan
A. Bruguier
David Rybach
Patrick Nguyen
12
37
0
05 Feb 2019
The USTC-NEL Speech Translation system at IWSLT 2018
The USTC-NEL Speech Translation system at IWSLT 2018
Dan Liu
Junhua Liu
Wu Guo
Shifu Xiong
Zhiqiang Ma
Rui Song
Chongliang Wu
Quan Liu
21
18
0
06 Dec 2018
Improving Clinical Predictions through Unsupervised Time Series
  Representation Learning
Improving Clinical Predictions through Unsupervised Time Series Representation Learning
Xinrui Lyu
Matthias Huser
Stephanie L. Hyland
George Zerveas
Gunnar Rätsch
SSL
OOD
AI4TS
17
43
0
02 Dec 2018
Towards Fluent Translations from Disfluent Speech
Towards Fluent Translations from Disfluent Speech
Mingming Wang
Lin-Ming Gong
Jan Niehues
Lian-He Shao
12
24
0
07 Nov 2018
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text
  Translation
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Ye Jia
Melvin Johnson
Wolfgang Macherey
Ron J. Weiss
Yuan Cao
Chung-Cheng Chiu
Naveen Ari
Stella Laurenzo
Yonghui Wu
31
159
0
05 Nov 2018
Towards Unsupervised Speech-to-Text Translation
Towards Unsupervised Speech-to-Text Translation
Yu-An Chung
W. Weng
S. Tong
James R. Glass
42
42
0
04 Nov 2018
How2: A Large-scale Dataset for Multimodal Language Understanding
How2: A Large-scale Dataset for Multimodal Language Understanding
Ramon Sanabria
Ozan Caglayan
Shruti Palaskar
Desmond Elliott
Loïc Barrault
Lucia Specia
Florian Metze
VGen
MLLM
26
287
0
01 Nov 2018
Fine-tuning on Clean Data for End-to-End Speech Translation: FBK @ IWSLT
  2018
Fine-tuning on Clean Data for End-to-End Speech Translation: FBK @ IWSLT 2018
Mattia Antonino Di Gangi
Roberto Dessì
R. Cattoni
Matteo Negri
Marco Turchi
26
10
0
16 Oct 2018
Indicatements that character language models learn English
  morpho-syntactic units and regularities
Indicatements that character language models learn English morpho-syntactic units and regularities
Yova Kementchedjhieva
Adam Lopez
16
10
0
31 Aug 2018
A small Griko-Italian speech translation corpus
A small Griko-Italian speech translation corpus
Marcely Zanon Boito
Antonios Anastasopoulos
M. Lekakou
Aline Villavicencio
Laurent Besacier
26
11
0
27 Jul 2018
Unsupervised Word Segmentation from Speech with Attention
Unsupervised Word Segmentation from Speech with Attention
Pierre Godard
Marcely Zanon Boito
Lucas Ondel
Alexandre Berard
François Yvon
Aline Villavicencio
Laurent Besacier
24
27
0
18 Jun 2018
Visually grounded cross-lingual keyword spotting in speech
Visually grounded cross-lingual keyword spotting in speech
Herman Kamper
Michael Roth
19
34
0
13 Jun 2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of
  Untranscribed Speech
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech
David Harwath
Galen Chuang
James R. Glass
20
58
0
09 Apr 2018
Low-Resource Speech-to-Text Translation
Low-Resource Speech-to-Text Translation
Sameer Bansal
Herman Kamper
Karen Livescu
Adam Lopez
Sharon Goldwater
21
56
0
24 Mar 2018
Leveraging translations for speech transcription in low-resource
  settings
Leveraging translations for speech transcription in low-resource settings
Antonios Anastasopoulos
David Chiang
14
26
0
23 Mar 2018
Gender Aware Spoken Language Translation Applied to English-Arabic
Gender Aware Spoken Language Translation Applied to English-Arabic
M. Elaraby
Ahmed Y. Tawfik
Mahmoud Khaled
Hany Hassan
Aly Osama
27
40
0
26 Feb 2018
Linguistic unit discovery from multi-modal inputs in unwritten
  languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop
O. Scharenborg
Laurent Besacier
A. Black
M. Hasegawa-Johnson
Florian Metze
...
Elin Larsen
Danny Merkx
Rachid Riad
Liming Wang
Emmanuel Dupoux
28
33
0
14 Feb 2018
Augmenting Librispeech with French Translations: A Multimodal Corpus for
  Direct Speech Translation Evaluation
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
A. Kocabiyikoglu
Laurent Besacier
Olivier Kraif
14
104
0
09 Feb 2018
$A^{4}NT$: Author Attribute Anonymity by Adversarial Training of Neural
  Machine Translation
A4NTA^{4}NTA4NT: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation
Rakshith Shetty
Bernt Schiele
Mario Fritz
44
95
0
06 Nov 2017
Previous
12345
Next