ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.08581
  4. Cited By
Sequence-to-Sequence Models Can Directly Translate Foreign Speech

Sequence-to-Sequence Models Can Directly Translate Foreign Speech

24 March 2017
Ron J. Weiss
J. Chorowski
Navdeep Jaitly
Yonghui Wu
Zhehuai Chen
ArXivPDFHTML

Papers citing "Sequence-to-Sequence Models Can Directly Translate Foreign Speech"

50 / 204 papers shown
Title
Investigating the Reordering Capability in CTC-based Non-Autoregressive
  End-to-End Speech Translation
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation
Shun-Po Chuang
Yung-Sung Chuang
Chih-Chiang Chang
Hung-yi Lee
34
25
0
11 May 2021
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC
  at IWSLT 2021
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021
Gerard I. Gállego
Ioannis Tsiamas
Carlos Escolano
José A. R. Fonollosa
Marta R. Costa-jussá
31
30
0
10 May 2021
Learning Shared Semantic Space for Speech-to-Text Translation
Learning Shared Semantic Space for Speech-to-Text Translation
Chi Han
Mingxuan Wang
Heng Ji
Lei Li
18
76
0
07 May 2021
Searchable Hidden Intermediates for End-to-End Models of Decomposable
  Sequence Tasks
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Siddharth Dalmia
Brian Yan
Vikas Raunak
Florian Metze
Shinji Watanabe
47
30
0
02 May 2021
AlloST: Low-resource Speech Translation without Source Transcription
AlloST: Low-resource Speech Translation without Source Transcription
Yao-Fei Cheng
Hung-Shin Lee
Hsin-Min Wang
27
8
0
01 May 2021
Segmenting Subtitles for Correcting ASR Segmentation Errors
Segmenting Subtitles for Correcting ASR Segmentation Errors
David Wan
Chris Kedzie
Faisal Ladhak
Elsbeth Turcan
P. Galuscáková
Elena Zotkina
Zhengping Jiang
P. Bell
Kathleen McKeown
10
4
0
16 Apr 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Changhan Wang
Anne Wu
J. Pino
Alexei Baevski
Michael Auli
Alexis Conneau
SSL
33
44
0
14 Apr 2021
Source and Target Bidirectional Knowledge Distillation for End-to-end
  Speech Translation
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Hirofumi Inaguma
Tatsuya Kawahara
Shinji Watanabe
31
42
0
13 Apr 2021
BSTC: A Large-Scale Chinese-English Speech Translation Dataset
BSTC: A Large-Scale Chinese-English Speech Translation Dataset
Ruiqing Zhang
Xiyang Wang
Chuanqiang Zhang
Zhongjun He
Hua Wu
Zhi Li
Haifeng Wang
Ying-Cong Chen
Qinfei Li
25
39
0
08 Apr 2021
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining
  and Speech Translation
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Renjie Zheng
Junkun Chen
Mingbo Ma
Liang Huang
36
69
0
10 Feb 2021
The Multilingual TEDx Corpus for Speech Recognition and Translation
The Multilingual TEDx Corpus for Speech Recognition and Translation
Elizabeth Salesky
Sanjeev Khudanpur
Jacob Bremerman
R. Cattoni
Matteo Negri
Marco Turchi
Douglas W. Oard
Matt Post
22
119
0
02 Feb 2021
CTC-based Compression for Direct Speech Translation
CTC-based Compression for Direct Speech Translation
Marco Gaido
Mauro Cettolo
Matteo Negri
Marco Turchi
22
57
0
02 Feb 2021
NeurST: Neural Speech Translation Toolkit
NeurST: Neural Speech Translation Toolkit
Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li
30
32
0
18 Dec 2020
Towards localisation of keywords in speech using weak supervision
Towards localisation of keywords in speech using weak supervision
Kayode Olaleye
Benjamin van Niekerk
Herman Kamper
24
5
0
14 Dec 2020
On Knowledge Distillation for Direct Speech Translation
On Knowledge Distillation for Direct Speech Translation
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
32
14
0
09 Dec 2020
Breeding Gender-aware Direct Speech Translation Systems
Breeding Gender-aware Direct Speech Translation Systems
Marco Gaido
Beatrice Savoldi
L. Bentivogli
Matteo Negri
Marco Turchi
48
20
0
09 Dec 2020
Tight Integrated End-to-End Training for Cascaded Speech Translation
Tight Integrated End-to-End Training for Cascaded Speech Translation
Parnia Bahar
Tobias Bieschke
Ralf Schluter
Hermann Ney
47
26
0
24 Nov 2020
Dual-decoder Transformer for Joint Automatic Speech Recognition and
  Multilingual Speech Translation
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation
Hang Le
J. Pino
Changhan Wang
Jiatao Gu
D. Schwab
Laurent Besacier
39
82
0
02 Nov 2020
Bridging the Modality Gap for Speech-to-Text Translation
Bridging the Modality Gap for Speech-to-Text Translation
Yuchen Liu
Junnan Zhu
Jiajun Zhang
Chengqing Zong
18
65
0
28 Oct 2020
Evaluating Gender Bias in Speech Translation
Evaluating Gender Bias in Speech Translation
Marta R. Costa-jussá
Christine Basta
Gerard I. Gállego
32
21
0
27 Oct 2020
Orthros: Non-autoregressive End-to-end Speech Translation with
  Dual-decoder
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Hirofumi Inaguma
Yosuke Higuchi
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
24
22
0
25 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained
  Models
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
21
6
0
24 Oct 2020
A Technical Report: BUT Speech Translation Systems
A Technical Report: BUT Speech Translation Systems
Hari Krishna Vydana
L. Burget
J. Černocký
24
0
0
22 Oct 2020
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Junkun Chen
Mingbo Ma
Renjie Zheng
Liang Huang
19
21
0
22 Oct 2020
A General Multi-Task Learning Framework to Leverage Text Data for Speech
  to Text Tasks
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
Yun Tang
J. Pino
Changhan Wang
Xutai Ma
Dmitriy Genzel
26
73
0
21 Oct 2020
Cascaded Models With Cyclic Feedback For Direct Speech Translation
Cascaded Models With Cyclic Feedback For Direct Speech Translation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
38
12
0
21 Oct 2020
Adaptive Feature Selection for End-to-End Speech Translation
Adaptive Feature Selection for End-to-End Speech Translation
Biao Zhang
Ivan Titov
Barry Haddow
Rico Sennrich
13
40
0
16 Oct 2020
Textual Supervision for Visually Grounded Spoken Language Understanding
Textual Supervision for Visually Grounded Spoken Language Understanding
Bertrand Higy
Desmond Eliott
Grzegorz Chrupała
15
10
0
06 Oct 2020
Consecutive Decoding for Speech-to-text Translation
Consecutive Decoding for Speech-to-text Translation
Qianqian Dong
Mingxuan Wang
Hao Zhou
Shuang Xu
Bo Xu
Lei Li
SLR
39
40
0
21 Sep 2020
"Listen, Understand and Translate": Triple Supervision Decouples
  End-to-end Speech-to-text Translation
"Listen, Understand and Translate": Triple Supervision Decouples End-to-end Speech-to-text Translation
Qianqian Dong
Rong Ye
Mingxuan Wang
Hao Zhou
Shuang Xu
Bo Xu
Lei Li
43
3
0
21 Sep 2020
Video captioning with stacked attention and semantic hard pull
Video captioning with stacked attention and semantic hard pull
Md. Mushfiqur Rahman
Thasinul Abedin
Khondokar S. S. Prottoy
Ayana Moshruba
Fazlul Hasan Siddiqui
27
2
0
15 Sep 2020
On Target Segmentation for Direct Speech Translation
On Target Segmentation for Direct Speech Translation
Mattia Antonino Di Gangi
Marco Gaido
Matteo Negri
Marco Turchi
37
14
0
10 Sep 2020
Convolutional Speech Recognition with Pitch and Voice Quality Features
Convolutional Speech Recognition with Pitch and Voice Quality Features
Guillermo Cámbara
Jordi Luque
Mireia Farrús
11
8
0
02 Sep 2020
Contextualized Translation of Automatically Segmented Speech
Contextualized Translation of Automatically Segmented Speech
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Mauro Cettolo
Marco Turchi
25
18
0
05 Aug 2020
Consistent Transcription and Translation of Speech
Consistent Transcription and Translation of Speech
Matthias Sperber
Hendra Setiawan
Christian Gollan
Udhyakumar Nallasamy
Matthias Paulik
31
18
0
24 Jul 2020
Self-Supervised Representations Improve End-to-End Speech Translation
Self-Supervised Representations Improve End-to-End Speech Translation
Anne Wu
Changhan Wang
J. Pino
Jiatao Gu
SSL
25
40
0
22 Jun 2020
UWSpeech: Speech to Speech Translation for Unwritten Languages
UWSpeech: Speech to Speech Translation for Unwritten Languages
Chen Zhang
Xu Tan
Yi Ren
Tao Qin
Ke-jun Zhang
Tie-Yan Liu
17
53
0
14 Jun 2020
Improving Cross-Lingual Transfer Learning for End-to-End Speech
  Recognition with Speech Translation
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation
Changhan Wang
J. Pino
Jiatao Gu
17
30
0
09 Jun 2020
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
19
53
0
04 Jun 2020
Self-Training for End-to-End Speech Translation
Self-Training for End-to-End Speech Translation
J. Pino
Qiantong Xu
Xutai Ma
M. Dousti
Yun Tang
33
59
0
03 Jun 2020
Phone Features Improve Speech Translation
Phone Features Improve Speech Translation
Elizabeth Salesky
A. Black
30
27
0
27 May 2020
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in
  Multitask End-to-End Speech Translation
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation
Shun-Po Chuang
Tzu-Wei Sung
Alexander H. Liu
Hung-yi Lee
18
19
0
21 May 2020
Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Baiji Liu
Songjun Cao
Sining Sun
Weibin Zhang
Long Ma
23
9
0
01 May 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
42
161
0
21 Apr 2020
Curriculum Pre-training for End-to-End Speech Translation
Curriculum Pre-training for End-to-End Speech Translation
Chengyi Wang
Yu Wu
Shujie Liu
Ming Zhou
Zhenglu Yang
21
108
0
21 Apr 2020
Neural Machine Translation: Challenges, Progress and Future
Neural Machine Translation: Challenges, Progress and Future
Jiajun Zhang
Chengqing Zong
26
52
0
13 Apr 2020
Unmet Needs and Opportunities for Mobile Translation AI
Unmet Needs and Opportunities for Mobile Translation AI
Susanne Putze
Michael Bonfert
Pitt Michelmann
Sebastian Höffner
Dirk Wenig
Rainer Malaka
Jan David Smeddinck
16
80
0
27 Feb 2020
SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech
  Translation
SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation
Arya D. McCarthy
Liezl Puzon
J. Pino
33
24
0
27 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
27
138
0
18 Feb 2020
A Data Efficient End-To-End Spoken Language Understanding Architecture
A Data Efficient End-To-End Spoken Language Understanding Architecture
Marco Dinarelli
Nikita Kapoor
Bassam Jabaian
Laurent Besacier
3DV
17
20
0
14 Feb 2020
Previous
12345
Next