Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.08295
Cited By
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
21 February 2019
Jonathan Shen
Patrick Nguyen
Yonghui Wu
Zhiwen Chen
Mengzhao Chen
Ye Jia
Anjuli Kannan
Tara N. Sainath
Yuan Cao
Chung-Cheng Chiu
Yanzhang He
J. Chorowski
Smit Hinsu
Stella Laurenzo
James Qin
Orhan Firat
Wolfgang Macherey
Suyog Gupta
Ankur Bapna
Shuyuan Zhang
Ruoming Pang
Ron J. Weiss
Rohit Prabhavalkar
Qiao Liang
Benoit Jacob
Bowen Liang
HyoukJoong Lee
Ciprian Chelba
Sébastien Jean
Yue Liu
Melvin Johnson
Rohan Anil
Rajat Tibrewal
Xiaobing Liu
Akiko Eriguchi
Navdeep Jaitly
Naveen Ari
Colin Cherry
Parisa Haghani
Otavio Good
Youlong Cheng
R. Álvarez
Isaac Caswell
Wei-Ning Hsu
Zongheng Yang
Kuan Wang
Ekaterina Gonina
Katrin Tomanek
Ben Vanik
Zelin Wu
Llion Jones
M. Schuster
Yanping Huang
Dehao Chen
Kazuki Irie
George F. Foster
J. Richardson
Klaus Macherey
A. Bruguier
Heiga Zen
Colin Raffel
Shankar Kumar
Kanishka Rao
David Rybach
M. Murray
Vijayaditya Peddinti
M. Krikun
M. Bacchiani
T. Jablin
R. Suderman
Ian Williams
Benjamin Lee
Deepti Bhatia
Justin Carlson
Semih Yavuz
Yu Zhang
Ian McGraw
M. Galkin
Qi Ge
Golan Pundak
Chad Whipkey
Todd Wang
Uri Alon
Dmitry Lepikhin
Ye Tian
S. Sabour
William Chan
Shubham Toshniwal
Baohua Liao
M. Nirschl
Pat Rondon
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling"
50 / 162 papers shown
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Rohit Prabhavalkar
Yanzhang He
David Rybach
S. Campbell
A. Narayanan
Trevor Strohman
Tara N. Sainath
236
37
0
12 Dec 2020
A Better and Faster End-to-End Model for Streaming ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yue Liu
Anmol Gulati
Jiahui Yu
Tara N. Sainath
Chung-Cheng Chiu
...
Wei Han
Qiao Liang
Yu Zhang
Trevor Strohman
Yonghui Wu
AuLLM
340
131
0
21 Nov 2020
Cascaded encoders for unifying streaming and non-streaming ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
A. Narayanan
Tara N. Sainath
Ruoming Pang
Jiahui Yu
Chung-Cheng Chiu
Rohit Prabhavalkar
Ehsan Variani
Trevor Strohman
AuLLM
253
88
0
27 Oct 2020
Data Troubles in Sentence Level Confidence Estimation for Machine Translation
Ciprian Chelba
Junpei Zhou
Yuezhang Li
Li
Hideto Kazawa
J. Klingner
Mengmeng Niu
139
4
0
26 Oct 2020
Rapid Domain Adaptation for Machine Translation with Monolingual Data
Mahdis Mahdieh
Mengzhao Chen
Yuan Cao
Orhan Firat
189
7
0
23 Oct 2020
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Qiujia Li
David Qiu
Yu Zhang
Yue Liu
Yanzhang He
P. Woodland
Liangliang Cao
Trevor Strohman
148
58
0
22 Oct 2020
Class-Conditional Defense GAN Against End-to-End Speech Attacks
Mohammad Esmaeilpour
P. Cardinal
Alessandro Lameiras Koerich
AAML
150
14
0
22 Oct 2020
FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Jiahui Yu
Chung-Cheng Chiu
Yue Liu
Shuo-yiin Chang
Tara N. Sainath
...
A. Narayanan
Wei Han
Anmol Gulati
Yonghui Wu
Ruoming Pang
209
98
0
21 Oct 2020
Human-Paraphrased References Improve Neural Machine Translation
Conference on Machine Translation (WMT), 2020
Markus Freitag
George F. Foster
David Grangier
Colin Cherry
145
15
0
20 Oct 2020
Towards Resistant Audio Adversarial Examples
Tom Dörr
Karla Markert
Nicolas Müller
Konstantin Böttinger
AAML
113
7
0
14 Oct 2020
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling
Jiahui Yu
Wei Han
Anmol Gulati
Chung-Cheng Chiu
Yue Liu
Tara N. Sainath
Yonghui Wu
Ruoming Pang
340
19
0
12 Oct 2020
fairseq S2T: Fast Speech-to-Text Modeling with fairseq
Changhan Wang
Yun Tang
Xutai Ma
Anne Wu
Sravya Popuri
Dmytro Okhonko
J. Pino
VLM
LRM
325
318
0
11 Oct 2020
Towards a Scalable and Distributed Infrastructure for Deep Learning Applications
Bita Hasheminezhad
S. Shirzad
Nanmiao Wu
Patrick Diehl
Hannes Schulz
Hartmut Kaiser
GNN
AI4CE
393
4
0
06 Oct 2020
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Interspeech (Interspeech), 2020
Wei Li
James Qin
Chung-Cheng Chiu
Ruoming Pang
Yanzhang He
184
15
0
30 Aug 2020
Textual Echo Cancellation
Shaojin Ding
Ye Jia
Ke Hu
Quan Wang
299
8
0
13 Aug 2020
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Dmitry Lepikhin
HyoukJoong Lee
Yuanzhong Xu
Dehao Chen
Orhan Firat
Yanping Huang
M. Krikun
Noam M. Shazeer
Zhiwen Chen
MoE
393
1,635
0
30 Jun 2020
CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency
Interspeech (Interspeech), 2020
Keyu An
Hongyu Xiang
Zhijian Ou
197
24
0
27 May 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
753
3,791
0
16 May 2020
Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Aditya Siddhant
Ankur Bapna
Yuan Cao
Orhan Firat
Mengzhao Chen
Sneha Kudugunta
N. Arivazhagan
Yonghui Wu
240
88
0
11 May 2020
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Chung-Cheng Chiu
A. Narayanan
Wei Han
Rohit Prabhavalkar
Yu Zhang
...
Ruoming Pang
Tara N. Sainath
Patrick Nguyen
Liangliang Cao
Yonghui Wu
376
44
0
07 May 2020
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Wei Han
Zhengdong Zhang
Yu Zhang
Jiahui Yu
Chung-Cheng Chiu
James Qin
Anmol Gulati
Ruoming Pang
Yonghui Wu
328
293
0
07 May 2020
Streaming Object Detection for 3-D Point Clouds
European Conference on Computer Vision (ECCV), 2020
Wei Han
Zhengdong Zhang
Benjamin Caine
Brandon Yang
Christoph Sprunk
O. Alsharif
Jiquan Ngiam
Vijay Vasudevan
Jonathon Shlens
Zhiwen Chen
3DPC
198
33
0
04 May 2020
Practical Perspectives on Quality Estimation for Machine Translation
Junpei Zhou
Ciprian Chelba
Yuezhang Li
Li
100
2
0
02 May 2020
Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Baiji Liu
Songjun Cao
Sining Sun
Weibin Zhang
Long Ma
156
9
0
01 May 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
237
172
0
21 Apr 2020
Language-agnostic Multilingual Modeling
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
A. Datta
Bhuvana Ramabhadran
Jesse Emond
Anjuli Kannan
Brian Roark
137
35
0
20 Apr 2020
Re-translation versus Streaming for Simultaneous Translation
International Workshop on Spoken Language Translation (IWSLT), 2020
N. Arivazhagan
Colin Cherry
Wolfgang Macherey
George F. Foster
166
66
0
07 Apr 2020
Machine Translation Pre-training for Data-to-Text Generation -- A Case Study in Czech
International Conference on Natural Language Generation (INLG), 2020
Mihir Kale
Scott Roy
141
14
0
05 Apr 2020
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Tara N. Sainath
Yanzhang He
Yue Liu
A. Narayanan
Ruoming Pang
...
Trevor Strohman
Mirkó Visontai
Yonghui Wu
Yu Zhang
Ding Zhao
200
225
0
28 Mar 2020
Julia Language in Machine Learning: Algorithms, Applications, and Open Issues
Computer Science Review (CSR), 2020
Kaifeng Gao
Gang Mei
F. Piccialli
S. Cuomo
Jingzhi Tu
Zenan Huo
222
67
0
23 Mar 2020
Deliberation Model Based Two-Pass End-to-End Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ke Hu
Tara N. Sainath
Ruoming Pang
Rohit Prabhavalkar
225
94
0
17 Mar 2020
Disentangling Adaptive Gradient Methods from Learning Rates
Naman Agarwal
Rohan Anil
Elad Hazan
Tomer Koren
Cyril Zhang
256
41
0
26 Feb 2020
Scalable Second Order Optimization for Deep Learning
Rohan Anil
Vineet Gupta
Tomer Koren
Kevin Regan
Y. Singer
ODL
170
31
0
20 Feb 2020
Controlling Computation versus Quality for Neural Sequence Models
Ankur Bapna
N. Arivazhagan
Orhan Firat
219
34
0
17 Feb 2020
Identifying Audio Adversarial Examples via Anomalous Pattern Detection
Victor Akinwande
C. Cintas
Skyler Speakman
Srihari Sridharan
AAML
185
18
0
13 Feb 2020
Faster Transformer Decoding: N-gram Masked Self-Attention
Ciprian Chelba
Mengzhao Chen
Ankur Bapna
Noam M. Shazeer
154
19
0
14 Jan 2020
Multimodal Machine Translation through Visuals and Speech
Machine Translation (MT), 2019
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
201
88
0
28 Nov 2019
Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Zhiyun Lu
Liangliang Cao
Yu Zhang
Chung-Cheng Chiu
James Fan
123
84
0
21 Nov 2019
Translationese as a Language in "Multilingual" NMT
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Parker Riley
Isaac Caswell
Markus Freitag
David Grangier
195
50
0
10 Nov 2019
A comparison of end-to-end models for long-form speech recognition
Automatic Speech Recognition & Understanding (ASRU), 2019
Chung-Cheng Chiu
Wei Han
Yu Zhang
Ruoming Pang
S. Kishchenko
...
Anjuli Kannan
Rohit Prabhavalkar
Zhiwen Chen
Tara N. Sainath
Yonghui Wu
AuLLM
207
90
0
06 Nov 2019
Fill in the Blanks: Imputing Missing Sentences for Larger-Context Neural Machine Translation
Sébastien Jean
Ankur Bapna
Orhan Firat
174
7
0
30 Oct 2019
Recognizing long-form speech using streaming end-to-end models
Automatic Speech Recognition & Understanding (ASRU), 2019
A. Narayanan
Rohit Prabhavalkar
Chung-Cheng Chiu
David Rybach
Tara N. Sainath
Trevor Strohman
173
135
0
24 Oct 2019
Optimizing Speech Recognition For The Edge
Yuan Shangguan
Jian Li
Qiao Liang
R. Álvarez
Ian McGraw
193
64
0
26 Sep 2019
Speech Recognition with Augmented Synthesized Speech
Automatic Speech Recognition & Understanding (ASRU), 2019
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Ye Jia
Pedro J. Moreno
Yonghui Wu
Zelin Wu
151
143
0
25 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Automatic Speech Recognition & Understanding (ASRU), 2019
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
178
75
0
18 Sep 2019
Simple, Scalable Adaptation for Neural Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Ankur Bapna
N. Arivazhagan
Orhan Firat
AI4CE
332
441
0
18 Sep 2019
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model
Interspeech (Interspeech), 2019
Anjuli Kannan
A. Datta
Tara N. Sainath
Eugene Weinstein
Bhuvana Ramabhadran
Yonghui Wu
Ankur Bapna
Zhiwen Chen
Seungjin Lee
AuLLM
147
187
0
11 Sep 2019
Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation
AAAI Conference on Artificial Intelligence (AAAI), 2019
Aditya Siddhant
Melvin Johnson
Henry Tsai
N. Arivazhagan
Jason Riesa
Ankur Bapna
Orhan Firat
Karthik Raman
164
73
0
01 Sep 2019
Two-Pass End-to-End Speech Recognition
Interspeech (Interspeech), 2019
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
166
157
0
29 Aug 2019
DELTA: A DEep learning based Language Technology plAtform
Kun Han
Junwen Chen
Hui Zhang
Haiyang Xu
Yiping Peng
...
Cheng Gong
Yunbo Wang
Wei Zou
Hui Song
Xiangang Li
VLM
98
10
0
02 Aug 2019
Previous
1
2
3
4
Next