ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.05524
  4. Cited By
Learning Hard Alignments with Variational Inference
v1v2 (latest)

Learning Hard Alignments with Variational Inference

16 May 2017
Dieterich Lawson
Chung-Cheng Chiu
George Tucker
Colin Raffel
Kevin Swersky
Navdeep Jaitly
    DRL
ArXiv (abs)PDFHTML

Papers citing "Learning Hard Alignments with Variational Inference"

34 / 34 papers shown
R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework
  for Low-Latency Simultaneous Speech Translation
R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation
Jiaxin Guo
Zhanglin Wu
Zongyao Li
Hengchao Shang
Daimeng Wei
Xiaoyu Chen
Zhiqiang Rao
Shaojun Li
Hao Yang
191
1
0
11 Jan 2024
NAS-X: Neural Adaptive Smoothing via Twisting
NAS-X: Neural Adaptive Smoothing via TwistingNeural Information Processing Systems (NeurIPS), 2023
Dieterich Lawson
Michael Y. Li
Scott W. Linderman
231
2
0
28 Aug 2023
Neural Processes with Stochastic Attention: Paying more attention to the
  context dataset
Neural Processes with Stochastic Attention: Paying more attention to the context datasetInternational Conference on Learning Representations (ICLR), 2022
Mingyu Kim
Kyeongryeol Go
Se-Young Yun
149
20
0
11 Apr 2022
A study of latent monotonic attention variants
A study of latent monotonic attention variants
Albert Zeyer
Ralf Schluter
Hermann Ney
154
5
0
30 Mar 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based ApplicationsArtificial Intelligence Review (AIR), 2021
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Xiaoshi Zhong
OffRL
338
88
0
01 Jan 2021
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End
  Simultaneous Speech Translation
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation
Xutai Ma
J. Pino
Philipp Koehn
223
104
0
03 Nov 2020
Bayesian Attention Modules
Bayesian Attention ModulesNeural Information Processing Systems (NeurIPS), 2020
Xinjie Fan
Shujian Zhang
Bo Chen
Mingyuan Zhou
290
69
0
20 Oct 2020
Enhancing Monotonic Multihead Attention for Streaming ASR
Enhancing Monotonic Multihead Attention for Streaming ASR
Hirofumi Inaguma
Masato Mimura
Tatsuya Kawahara
288
35
0
19 May 2020
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and SurveyJournal of Artificial Intelligence Research (JAIR), 2019
Felix Stahlberg
3DVAI4TSMedIm
377
381
0
04 Dec 2019
Monotonic Multihead Attention
Monotonic Multihead AttentionInternational Conference on Learning Representations (ICLR), 2019
Xutai Ma
J. Pino
James Cross
Liezl Puzon
Jiatao Gu
174
147
0
26 Sep 2019
Select and Attend: Towards Controllable Content Selection in Text
  Generation
Select and Attend: Towards Controllable Content Selection in Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Xiaoyu Shen
Jun Suzuki
Kentaro Inui
Hui Su
Dietrich Klakow
Satoshi Sekine
154
29
0
10 Sep 2019
SeGMA: Semi-Supervised Gaussian Mixture Auto-Encoder
SeGMA: Semi-Supervised Gaussian Mixture Auto-EncoderIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019
Marek Śmieja
Maciej Wołczyk
Jacek Tabor
Bernhard C. Geiger
161
24
0
21 Jun 2019
Near-Optimal Glimpse Sequences for Improved Hard Attention Neural
  Network Training
Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network TrainingIEEE International Joint Conference on Neural Network (IJCNN), 2019
William Harvey
Michael Teng
Frank Wood
145
4
0
13 Jun 2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence
  Modeling
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Jonathan Shen
Patrick Nguyen
Yonghui Wu
Zhiwen Chen
Mengzhao Chen
...
William Chan
Shubham Toshniwal
Baohua Liao
M. Nirschl
Pat Rondon
VLM
204
214
0
21 Feb 2019
An Online Attention-based Model for Speech Recognition
An Online Attention-based Model for Speech Recognition
Ruchao Fan
Pan Zhou
Wei Chen
Jia Jia
Gang Liu
153
48
0
13 Nov 2018
Latent Alignment and Variational Attention
Latent Alignment and Variational Attention
Yuntian Deng
Yoon Kim
Justin T. Chiu
Demi Guo
Alexander M. Rush
BDL
185
115
0
10 Jul 2018
Gaussian mixture models with Wasserstein distance
Gaussian mixture models with Wasserstein distance
Benoit Gaujac
Ilya Feige
David Barber
117
9
0
12 Jun 2018
Monotonic Chunkwise Attention
Monotonic Chunkwise Attention
Chung-Cheng Chiu
Colin Raffel
233
267
0
14 Dec 2017
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
270
271
0
03 Apr 2017
Learning to Translate in Real-time with Neural Machine Translation
Learning to Translate in Real-time with Neural Machine Translation
Jiatao Gu
Graham Neubig
Dong Wang
Victor O.K. Li
264
230
0
03 Oct 2016
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Lantao Yu
Weinan Zhang
Jun Wang
Yong Yu
GAN
470
2,515
0
18 Sep 2016
Hierarchical Multiscale Recurrent Neural Networks
Hierarchical Multiscale Recurrent Neural Networks
Junyoung Chung
Sungjin Ahn
Yoshua Bengio
BDL
414
548
0
06 Sep 2016
Learning Online Alignments with Continuous Rewards Policy Gradient
Learning Online Alignments with Continuous Rewards Policy Gradient
Yuping Luo
Chung-Cheng Chiu
Navdeep Jaitly
Ilya Sutskever
OffRL
173
47
0
03 Aug 2016
Sequential Neural Models with Stochastic Layers
Sequential Neural Models with Stochastic Layers
Marco Fraccaro
Søren Kaae Sønderby
Ulrich Paquet
Ole Winther
BDL
267
423
0
24 May 2016
Variational inference for Monte Carlo objectives
Variational inference for Monte Carlo objectives
A. Mnih
Danilo Jimenez Rezende
DRLBDL
334
297
0
22 Feb 2016
Learning Wake-Sleep Recurrent Attention Models
Learning Wake-Sleep Recurrent Attention Models
Jimmy Ba
Roger C. Grosse
Ruslan Salakhutdinov
B. Frey
BDL
186
65
0
22 Sep 2015
Importance Weighted Autoencoders
Importance Weighted AutoencodersInternational Conference on Learning Representations (ICLR), 2015
Yuri Burda
Roger C. Grosse
Ruslan Salakhutdinov
BDL
770
1,305
0
01 Sep 2015
Listen, Attend and Spell
Listen, Attend and SpellIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
447
2,374
0
05 Aug 2015
Attention-Based Models for Speech Recognition
Attention-Based Models for Speech Recognition
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Dong Wang
Yoshua Bengio
358
2,715
0
24 Jun 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Dong Wang
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
869
10,588
0
10 Feb 2015
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and TranslateInternational Conference on Learning Representations (ICLR), 2014
Dzmitry Bahdanau
Dong Wang
Yoshua Bengio
AIMat
1.6K
28,724
0
01 Sep 2014
Recurrent Models of Visual Attention
Recurrent Models of Visual AttentionNeural Information Processing Systems (NeurIPS), 2014
Volodymyr Mnih
N. Heess
Alex Graves
Koray Kavukcuoglu
VLM
414
3,885
0
24 Jun 2014
Neural Variational Inference and Learning in Belief Networks
Neural Variational Inference and Learning in Belief NetworksInternational Conference on Machine Learning (ICML), 2014
A. Mnih
Karol Gregor
BDL
675
744
0
31 Jan 2014
Learning Generative Models with Visual Attention
Learning Generative Models with Visual AttentionNeural Information Processing Systems (NeurIPS), 2013
Yichuan Tang
Nitish Srivastava
Ruslan Salakhutdinov
DiffM
370
86
0
20 Dec 2013
1