ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.02960
  4. Cited By
Sequence-to-Sequence Learning as Beam-Search Optimization

Sequence-to-Sequence Learning as Beam-Search Optimization

9 June 2016
Sam Wiseman
Alexander M. Rush
ArXivPDFHTML

Papers citing "Sequence-to-Sequence Learning as Beam-Search Optimization"

50 / 276 papers shown
Title
Optimal Completion Distillation for Sequence Learning
Optimal Completion Distillation for Sequence Learning
S. Sabour
William Chan
Mohammad Norouzi
11
45
0
02 Oct 2018
Cell-aware Stacked LSTMs for Modeling Sentences
Cell-aware Stacked LSTMs for Modeling Sentences
Jihun Choi
Taeuk Kim
Sang-goo Lee
AI4TS
19
4
0
07 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text
  Generation
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation
Zhiting Hu
Haoran Shi
Bowen Tan
Wentao Wang
Zichao Yang
...
Zhengzhong Liu
Xiaodan Liang
Wangrong Zhu
Devendra Singh Sachan
Eric P. Xing
VLM
12
56
0
04 Sep 2018
Imitation Learning for Neural Morphological String Transduction
Imitation Learning for Neural Morphological String Transduction
Peter Makarov
Simon Clematide
AI4CE
17
33
0
31 Aug 2018
Reasoning about Actions and State Changes by Injecting Commonsense
  Knowledge
Reasoning about Actions and State Changes by Injecting Commonsense Knowledge
Niket Tandon
Bhavana Dalvi
Joel Grus
Wen-tau Yih
Antoine Bosselut
Peter Clark
16
87
0
29 Aug 2018
Correcting Length Bias in Neural Machine Translation
Correcting Length Bias in Neural Machine Translation
Kenton W. Murray
David Chiang
AIMat
15
150
0
29 Aug 2018
Breaking the Beam Search Curse: A Study of (Re-)Scoring Methods and
  Stopping Criteria for Neural Machine Translation
Breaking the Beam Search Curse: A Study of (Re-)Scoring Methods and Stopping Criteria for Neural Machine Translation
Yilin Yang
Liang Huang
Mingbo Ma
26
92
0
28 Aug 2018
Why Do Neural Response Generation Models Prefer Universal Replies?
Why Do Neural Response Generation Models Prefer Universal Replies?
Bowen Wu
Nan Jiang
Zhifeng Gao
Mengyuan Li
Zongsheng Wang
Suke Li
Qihang Feng
Wenge Rong
Baoxun Wang
24
8
0
28 Aug 2018
Natural Language Generation with Neural Variational Models
Natural Language Generation with Neural Variational Models
Hareesh Bahuleyan
DRL
14
6
0
27 Aug 2018
Large Margin Neural Language Model
Large Margin Neural Language Model
Jiaji Huang
Yi Li
Ming-Yu Liu
Liang Huang
13
25
0
27 Aug 2018
Paraphrases as Foreign Languages in Multilingual Neural Machine
  Translation
Paraphrases as Foreign Languages in Multilingual Neural Machine Translation
Zhong Zhou
Matthias Sperber
A. Waibel
LRM
17
19
0
25 Aug 2018
Approximate Distribution Matching for Sequence-to-Sequence Learning
Approximate Distribution Matching for Sequence-to-Sequence Learning
Wenhu Chen
Guanlin Li
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
OOD
BDL
11
0
0
24 Aug 2018
Improving Abstraction in Text Summarization
Improving Abstraction in Text Summarization
Wojciech Kry'sciñski
Romain Paulus
Caiming Xiong
R. Socher
16
147
0
23 Aug 2018
The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem
  Solvers
The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers
Dongxiang Zhang
Lei Wang
Nuo Xu
B. Dai
Heng Tao Shen
ReLM
AIMat
45
126
0
22 Aug 2018
XL-NBT: A Cross-lingual Neural Belief Tracking Framework
XL-NBT: A Cross-lingual Neural Belief Tracking Framework
Wenhu Chen
Jianshu Chen
Yu-Chuan Su
Xin Eric Wang
Dong Yu
Xifeng Yan
William Yang Wang
13
33
0
19 Aug 2018
Regularizing Neural Machine Translation by Target-bidirectional
  Agreement
Regularizing Neural Machine Translation by Target-bidirectional Agreement
Zhirui Zhang
Shuo Ren
Shujie Liu
Mu Li
M. Zhou
Tong Xu
37
116
0
13 Aug 2018
Improving Sequential Determinantal Point Processes for Supervised Video
  Summarization
Improving Sequential Determinantal Point Processes for Supervised Video Summarization
Aidean Sharghi
Ali Borji
Chengtao Li
Tianbao Yang
Boqing Gong
AI4TS
17
47
0
28 Jul 2018
Abstractive and Extractive Text Summarization using Document Context
  Vector and Recurrent Neural Networks
Abstractive and Extractive Text Summarization using Document Context Vector and Recurrent Neural Networks
Chandra Khatri
Gyanit Singh
Nish Parikh
27
60
0
20 Jul 2018
Latent Alignment and Variational Attention
Latent Alignment and Variational Attention
Yuntian Deng
Yoon Kim
Justin T. Chiu
Demi Guo
Alexander M. Rush
BDL
18
110
0
10 Jul 2018
Robust Text-to-SQL Generation with Execution-Guided Decoding
Robust Text-to-SQL Generation with Execution-Guided Decoding
Chenglong Wang
Kedar Tatwawadi
Marc Brockschmidt
Po-Sen Huang
Yi Mao
Oleksandr Polozov
Rishabh Singh
26
100
0
09 Jul 2018
On Adversarial Examples for Character-Level Neural Machine Translation
On Adversarial Examples for Character-Level Neural Machine Translation
J. Ebrahimi
Daniel Lowd
Dejing Dou
AAML
14
216
0
23 Jun 2018
Partially-Supervised Image Captioning
Partially-Supervised Image Captioning
Peter Anderson
Stephen Gould
Mark Johnson
19
32
0
15 Jun 2018
Generating Sentences Using a Dynamic Canvas
Generating Sentences Using a Dynamic Canvas
Harshil Shah
Bowen Zheng
David Barber
10
7
0
13 Jun 2018
SGM: Sequence Generation Model for Multi-label Classification
SGM: Sequence Generation Model for Multi-label Classification
Pengcheng Yang
Xu Sun
Wei Li
Shuming Ma
Wei Yu Wu
Houfeng Wang
22
376
0
13 Jun 2018
Deep State Space Models for Unconditional Word Generation
Deep State Space Models for Unconditional Word Generation
Florian Schmidt
Thomas Hofmann
9
14
0
12 Jun 2018
Policy Gradient as a Proxy for Dynamic Oracles in Constituency Parsing
Policy Gradient as a Proxy for Dynamic Oracles in Constituency Parsing
Daniel Fried
Dan Klein
31
27
0
08 Jun 2018
Towards Binary-Valued Gates for Robust LSTM Training
Towards Binary-Valued Gates for Robust LSTM Training
Zhuohan Li
Di He
Fei Tian
Wei-neng Chen
Tao Qin
Liwei Wang
Tie-Yan Liu
MQ
10
47
0
08 Jun 2018
Distilling Knowledge for Search-based Structured Prediction
Distilling Knowledge for Search-based Structured Prediction
Yijia Liu
Wanxiang Che
Huaipeng Zhao
Bing Qin
Ting Liu
19
22
0
29 May 2018
Toward Extractive Summarization of Online Forum Discussions via
  Hierarchical Attention Networks
Toward Extractive Summarization of Online Forum Discussions via Hierarchical Attention Networks
Sansiri Tarnpradab
Fei Liu
K. Hua
16
33
0
25 May 2018
Neural Argument Generation Augmented with Externally Retrieved Evidence
Neural Argument Generation Augmented with Externally Retrieved Evidence
Xinyu Hua
Lu Wang
LRM
14
61
0
25 May 2018
QuaterNet: A Quaternion-based Recurrent Model for Human Motion
QuaterNet: A Quaternion-based Recurrent Model for Human Motion
Dario Pavllo
David Grangier
Michael Auli
3DH
17
259
0
16 May 2018
Leveraging Grammar and Reinforcement Learning for Neural Program
  Synthesis
Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis
Rudy Bunel
Matthew J. Hausknecht
Jacob Devlin
Rishabh Singh
Pushmeet Kohli
NAI
8
206
0
11 May 2018
A comparable study of modeling units for end-to-end Mandarin speech
  recognition
A comparable study of modeling units for end-to-end Mandarin speech recognition
Wei Zou
Dongwei Jiang
Shuaijiang Zhao
Xiangang Li
8
32
0
10 May 2018
From Credit Assignment to Entropy Regularization: Two New Algorithms for
  Neural Sequence Prediction
From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction
Zihang Dai
Qizhe Xie
Eduard H. Hovy
16
6
0
29 Apr 2018
Neural Particle Smoothing for Sampling from Conditional Sequence Models
Neural Particle Smoothing for Sampling from Conditional Sequence Models
Chu-cheng Lin
Jason Eisner
BDL
14
12
0
28 Apr 2018
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End
  Task-Oriented Dialog Systems
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
VLM
19
302
0
23 Apr 2018
A Stable and Effective Learning Strategy for Trainable Greedy Decoding
A Stable and Effective Learning Strategy for Trainable Greedy Decoding
Yun Chen
V. Li
Kyunghyun Cho
Samuel R. Bowman
20
28
0
21 Apr 2018
Learning Approximate Inference Networks for Structured Prediction
Learning Approximate Inference Networks for Structured Prediction
Lifu Tu
Kevin Gimpel
BDL
11
53
0
09 Mar 2018
Differentiable lower bound for expected BLEU score
Differentiable lower bound for expected BLEU score
Vlad Zhukov
Eugene Golikov
M. Kretov
14
15
0
13 Dec 2017
Structured Set Matching Networks for One-Shot Part Labeling
Structured Set Matching Networks for One-Shot Part Labeling
Jonghyun Choi
Jayant Krishnamurthy
Aniruddha Kembhavi
Ali Farhadi
24
23
0
05 Dec 2017
Using stochastic computation graphs formalism for optimization of
  sequence-to-sequence model
Using stochastic computation graphs formalism for optimization of sequence-to-sequence model
Eugene Golikov
Vlad Zhukov
M. Kretov
21
0
0
21 Nov 2017
Classical Structured Prediction Losses for Sequence to Sequence Learning
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
56
185
0
14 Nov 2017
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Jiuxiang Gu
Jianfei Cai
G. Wang
Tsuhan Chen
27
178
0
11 Sep 2017
Globally Normalized Reader
Globally Normalized Reader
Jonathan Raiman
John Miller
25
36
0
08 Sep 2017
A Continuous Relaxation of Beam Search for End-to-end Training of Neural
  Sequence Models
A Continuous Relaxation of Beam Search for End-to-end Training of Neural Sequence Models
Kartik Goyal
Graham Neubig
Chris Dyer
Taylor Berg-Kirkpatrick
3DV
39
40
0
01 Aug 2017
Single-Queue Decoding for Neural Machine Translation
Single-Queue Decoding for Neural Machine Translation
Raphael Shu
Hideki Nakayama
23
0
0
06 Jul 2017
Efficient Attention using a Fixed-Size Memory Representation
Efficient Attention using a Fixed-Size Memory Representation
D. Britz
M. Guan
Minh-Thang Luong
3DV
16
32
0
01 Jul 2017
Generative Bridging Network in Neural Sequence Prediction
Generative Bridging Network in Neural Sequence Prediction
Wenhu Chen
Guanlin Li
Shuo Ren
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
12
10
0
28 Jun 2017
Encoder-Decoder Shift-Reduce Syntactic Parsing
Encoder-Decoder Shift-Reduce Syntactic Parsing
Jiangming Liu
Yue Zhang
18
15
0
24 Jun 2017
Neural Machine Translation with Gumbel-Greedy Decoding
Neural Machine Translation with Gumbel-Greedy Decoding
Jiatao Gu
Daniel Jiwoong Im
V. Li
17
35
0
22 Jun 2017
Previous
123456
Next