Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.07503
Cited By
Attention-Based Models for Speech Recognition
24 June 2015
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention-Based Models for Speech Recognition"
50 / 313 papers shown
Title
Improving RNN Transducer Modeling for End-to-End Speech Recognition
Jinyu Li
Rui Zhao
Hu Hu
Y. Gong
8
170
0
26 Sep 2019
Attention Forcing for Sequence-to-sequence Model Training
Qingyun Dou
Yiting Lu
Joshua Efiong
Mark J. F. Gales
19
6
0
26 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
19
37
0
22 Sep 2019
Acoustic scene analysis with multi-head attention networks
Weimin Wang
Weiran Wang
Ming Sun
Chao Wang
14
3
0
16 Sep 2019
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
19
8
0
30 Aug 2019
Two-Pass End-to-End Speech Recognition
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
18
147
0
29 Aug 2019
ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal
Bin Ding
Chengjiang Long
Ling Zhang
Chunxia Xiao
GAN
3DH
22
151
0
04 Aug 2019
Deep Learning for Time Series Forecasting: The Electric Load Case
Alberto Gasparin
S. Lukovic
C. Alippi
AI4TS
24
220
0
22 Jul 2019
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
KELM
11
38
0
13 Jul 2019
Learning Blended, Precise Semantic Program Embeddings
Ke Wang
Z. Su
NAI
22
25
0
03 Jul 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
22
796
0
25 Jun 2019
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
22
99
0
25 Jun 2019
Query-based Interactive Recommendation by Meta-Path and Adapted Attention-GRU
Yu Zhu
Yu Gong
Qingwen Liu
Yingcai Ma
Wenwu Ou
Junxiong Zhu
Beidou Wang
Ziyu Guan
Deng Cai
LRM
14
15
0
24 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models
Wei Fang
Yu-An Chung
James R. Glass
11
27
0
17 Jun 2019
2D Attentional Irregular Scene Text Recognizer
Pengyuan Lyu
Zhicheng Yang
Xinhang Leng
Xiaojun Wu
Ruiyu Li
Xiaoyong Shen
3DV
28
50
0
13 Jun 2019
Gradual Machine Learning for Aspect-level Sentiment Analysis
Yanyan Wang
Qun Chen
Jiquan Shen
Boyi Hou
Ahmed Murtadha
Zhanhuai Li
22
1
0
06 Jun 2019
Sequential Neural Networks as Automata
William Merrill
10
74
0
04 Jun 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
19
125
0
27 May 2019
Acoustic-to-Word Models with Conversational Context Information
Suyoun Kim
Florian Metze
14
7
0
21 May 2019
End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
E. Tsunoo
Yosuke Kashiwagi
S. Asakawa
Toshiyuki Kumakura
11
4
0
17 May 2019
Sparse Sequence-to-Sequence Models
Ben Peters
Vlad Niculae
André F. T. Martins
TPM
19
209
0
14 May 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Yi Ren
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
41
101
0
13 May 2019
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
24
584
0
30 Apr 2019
Aggregation Cross-Entropy for Sequence Recognition
Zecheng Xie
Yaoxiong Huang
Yuanzhi Zhu
Lianwen Jin
Yuliang Liu
Lele Xie
17
92
0
17 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
16
46
0
17 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
16
112
0
08 Apr 2019
Relation-Aware Global Attention for Person Re-identification
Zhizheng Zhang
Cuiling Lan
Wenjun Zeng
Xin Jin
Zhibo Chen
3DPC
14
477
0
05 Apr 2019
Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees
J. Chorowski
A. Lancucki
Bartosz Kostka
Michal Zapotoczny
14
5
0
14 Jan 2019
Speaker Adaptation for End-to-End CTC Models
Ke Li
Jinyu Li
Yong Zhao
Kshitiz Kumar
Y. Gong
18
24
0
04 Jan 2019
Automatic Grammar Augmentation for Robust Voice Command Recognition
Yang Yang
Anusha Lalitha
Jinwon Lee
Chris Lott
21
3
0
14 Nov 2018
Stream attention-based multi-array end-to-end speech recognition
Xiaofei Wang
Ruizhi Li
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
22
21
0
12 Nov 2018
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Raden Muáz Muním
Nakamasa Inoue
K. Shinoda
22
25
0
12 Nov 2018
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Jiayang Liu
M. Baskar
Weiming Zhang
Takaaki Hori
Matthew Wiesner
Jan ''Honza'' Cernocký
16
18
0
07 Nov 2018
Multi-Head Attention with Disagreement Regularization
Jian Li
Zhaopeng Tu
Baosong Yang
Michael R. Lyu
Tong Zhang
27
145
0
24 Oct 2018
Deep Audio-Visual Speech Recognition
Triantafyllos Afouras
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
17
687
0
06 Sep 2018
Pedestrian Trajectory Prediction with Structured Memory Hierarchies
Tharindu Fernando
Simon Denman
S. Sridharan
Clinton Fookes
19
18
0
22 Jul 2018
Modeling Taxi Drivers' Behaviour for the Next Destination Prediction
Alberto Rossi
Gianni Barlacchi
Monica Bianchini
Bruno Lepri
8
31
0
21 Jul 2018
This Looks Like That: Deep Learning for Interpretable Image Recognition
Chaofan Chen
Oscar Li
Chaofan Tao
A. Barnett
Jonathan Su
Cynthia Rudin
29
1,156
0
27 Jun 2018
Multi-variable LSTM neural network for autoregressive exogenous model
Tian Guo
Tao R. Lin
BDL
AI4TS
30
19
0
17 Jun 2018
Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Chao Yang
Taehwan Kim
Ruizhe Wang
Hao Peng
C.-C. Jay Kuo
26
51
0
16 Jun 2018
Learning Tasks for Multitask Learning: Heterogenous Patient Populations in the ICU
Harini Suresh
Jen J. Gong
John Guttag
23
85
0
07 Jun 2018
Table-to-Text: Describing Table Region with Natural Language
Junwei Bao
Duyu Tang
Nan Duan
Zhao Yan
Yuanhua Lv
M. Zhou
T. Zhao
LMTD
21
98
0
29 May 2018
Abstractive Text Classification Using Sequence-to-convolution Neural Networks
Taehoon Kim
Jihoon Yang
VLM
25
11
0
20 May 2018
Token-level and sequence-level loss smoothing for RNN language models
Maha Elbayad
Laurent Besacier
Jakob Verbeek
22
19
0
14 May 2018
A Deep Learning Approach with an Attention Mechanism for Automatic Sleep Stage Classification
Martin Längkvist
Amy Loutfi
8
11
0
14 May 2018
Detection of Paroxysmal Atrial Fibrillation using Attention-based Bidirectional Recurrent Neural Networks
S. Shashikumar
Amit J. Shah
Gari D. Clifford
S. Nemati
14
78
0
07 May 2018
Interact and Decide: Medley of Sub-Attention Networks for Effective Group Recommendation
Lucas Vinh Tran
T. Pham
Yi Tay
Yiding Liu
Gao Cong
Xiaoli Li
19
93
0
12 Apr 2018
Graph2Seq: Graph to Sequence Learning with Attention-based Neural Networks
Kun Xu
Lingfei Wu
Zhiguo Wang
Yansong Feng
Michael Witbrock
V. Sheinin
GNN
25
171
0
03 Apr 2018
Conditional End-to-End Audio Transforms
Albert Haque
Michelle Guo
Prateek Verma
19
41
0
30 Mar 2018
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Matthew Wiesner
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
8
1,477
0
30 Mar 2018
Previous
1
2
3
4
5
6
7
Next