Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.01576
Cited By
Quasi-Recurrent Neural Networks
5 November 2016
James Bradbury
Stephen Merity
Caiming Xiong
R. Socher
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quasi-Recurrent Neural Networks"
50 / 206 papers shown
Title
Recurrent Attention Unit
G. Zhong
Guohua Yue
Xiao Ling
23
14
0
30 Oct 2018
Simplifying Neural Machine Translation with Addition-Subtraction Twin-Gated Recurrent Networks
Biao Zhang
Deyi Xiong
Jinsong Su
Qian Lin
Huiji Zhang
11
12
0
30 Oct 2018
Trellis Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
15
145
0
15 Oct 2018
Mixture of Expert/Imitator Networks: Scalable Semi-supervised Learning Framework
Shun Kiyono
Jun Suzuki
Kentaro Inui
31
8
0
13 Oct 2018
Persistence pays off: Paying Attention to What the LSTM Gating Mechanism Persists
Giancarlo D. Salton
John D. Kelleher
KELM
RALM
16
6
0
10 Oct 2018
End-to-End Text Classification via Image-based Embedding using Character-level Networks
Shunsuke Kitada
Ryunosuke Kotani
Hitoshi Iyatomi
17
4
0
08 Oct 2018
Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids
Yunzhu Li
Jiajun Wu
Russ Tedrake
J. Tenenbaum
Antonio Torralba
PINN
AI4CE
27
387
0
03 Oct 2018
Propagation Networks for Model-Based Control Under Partial Observation
Yunzhu Li
Jiajun Wu
Jun-Yan Zhu
J. Tenenbaum
Antonio Torralba
Russ Tedrake
AI4CE
8
137
0
28 Sep 2018
Adaptive Pruning of Neural Language Models for Mobile Devices
Raphael Tang
Jimmy J. Lin
16
6
0
27 Sep 2018
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
18
94
0
29 Aug 2018
Pyramidal Recurrent Unit for Language Modeling
Sachin Mehta
Rik Koncel-Kedziorski
Mohammad Rastegari
Hannaneh Hajishirzi
19
10
0
27 Aug 2018
Comparing CNN and LSTM character-level embeddings in BiLSTM-CRF models for chemical and disease named entity recognition
Zenan Zhai
Dat Quoc Nguyen
Karin Verspoor
9
30
0
25 Aug 2018
Recent Advances in Deep Learning: An Overview
Matiur Rahman Minar
Jibon Naher
VLM
18
116
0
21 Jul 2018
IGLOO: Slicing the Features Space to Represent Sequences
Vsevolod Sourkov
VLM
18
5
0
09 Jul 2018
Sliced Recurrent Neural Networks
Zeping Yu
Gongshen Liu
16
41
0
06 Jul 2018
Multi-task WaveNet: A Multi-task Generative Model for Statistical Parametric Speech Synthesis without Fundamental Frequency Conditions
Yu Gu
Yongguo Kang
8
17
0
22 Jun 2018
Semi-tied Units for Efficient Gating in LSTM and Highway Networks
Chao Zhang
P. Woodland
22
3
0
18 Jun 2018
Navigating with Graph Representations for Fast and Scalable Decoding of Neural Language Models
Minjia Zhang
Xiaodong Liu
Wenhan Wang
Jianfeng Gao
Yuxiong He
23
30
0
11 Jun 2018
Learning to Search in Long Documents Using Document Structure
Mor Geva
Jonathan Berant
RALM
20
15
0
09 Jun 2018
Convolutional neural networks for chemical-disease relation extraction are improved with character-based word embeddings
Dat Quoc Nguyen
Karin Verspoor
NAI
MedIm
6
46
0
27 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
19
44
0
22 May 2018
Deep Neural Machine Translation with Weakly-Recurrent Units
Mattia Antonino Di Gangi
Marcello Federico
AIMat
12
19
0
10 May 2018
Long Short-Term Memory as a Dynamically Computed Element-wise Weighted Sum
Omer Levy
Kenton Lee
Nicholas FitzGerald
Luke Zettlemoyer
RALM
11
34
0
09 May 2018
Convolutional Sequence to Sequence Model for Human Dynamics
Chen Li
Zhen Zhang
Wee Sun Lee
Gim Hee Lee
3DH
13
321
0
02 May 2018
Anticipating Traffic Accidents with Adaptive Loss and Large-scale Incident DB
Tomoyuki Suzuki
Hirokatsu Kataoka
Y. Aoki
Y. Satoh
11
108
0
08 Apr 2018
Single Stream Parallelization of Recurrent Neural Networks for Low Power and Fast Inference
Wonyong Sung
Jinhwan Park
11
5
0
30 Mar 2018
Multi-range Reasoning for Machine Comprehension
Yi Tay
Anh Tuan Luu
S. Hui
11
34
0
24 Mar 2018
An Analysis of Neural Language Modeling at Multiple Scales
Stephen Merity
N. Keskar
R. Socher
19
170
0
22 Mar 2018
Generalised Structural CNNs (SCNNs) for time series data with arbitrary graph topology
Thomas Teh
C. Auepanwiriyakul
J. Harston
A. Faisal
GNN
27
2
0
14 Mar 2018
Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN
Shuai Li
W. Li
Chris Cook
Ce Zhu
Yanbo Gao
8
719
0
13 Mar 2018
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
DRL
25
4,708
0
04 Mar 2018
The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks
Nicholas Carlini
Chang-rui Liu
Ulfar Erlingsson
Jernej Kos
D. Song
45
1,111
0
22 Feb 2018
Character-level Recurrent Neural Networks in Practice: Comparing Training and Sampling Schemes
Cedric De Boom
Thomas Demeester
Bart Dhoedt
8
8
0
02 Jan 2018
A Flexible Approach to Automated RNN Architecture Generation
Martin Schrimpf
Stephen Merity
James Bradbury
R. Socher
19
15
0
20 Dec 2017
Learning to Attend via Word-Aspect Associative Fusion for Aspect-based Sentiment Analysis
Yi Tay
Anh Tuan Luu
S. Hui
10
188
0
14 Dec 2017
Cross Temporal Recurrent Networks for Ranking Question Answer Pairs
Yi Tay
Anh Tuan Luu
S. Hui
OOD
16
42
0
21 Nov 2017
Weighted Transformer Network for Machine Translation
Karim Ahmed
N. Keskar
R. Socher
25
133
0
06 Nov 2017
Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence Learning
Zhen He
Shaobing Gao
Liang Xiao
Daxue Liu
Hangen He
David Barber
AIMat
35
64
0
05 Nov 2017
Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German
Pierre-Edouard Honnet
Andrei Popescu-Belis
C. Musat
Michael Baeriswyl
28
40
0
30 Oct 2017
Malware Detection by Eating a Whole EXE
Edward Raff
Jon Barker
Jared Sylvester
Robert Brandon
Bryan Catanzaro
Charles K. Nicholas
21
535
0
25 Oct 2017
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention
Hideyuki Tachibana
Katsuya Uenoyama
Shunsuke Aihara
16
265
0
24 Oct 2017
Low-Rank RNN Adaptation for Context-Aware Language Modeling
Aaron Jaech
Mari Ostendorf
17
25
0
06 Oct 2017
Lattice Recurrent Unit: Improving Convergence and Statistical Efficiency for Sequence Modeling
Chaitanya Ahuja
Louis-Philippe Morency
22
4
0
06 Oct 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
H. Li
MQ
27
140
0
15 Sep 2017
Parallelizing Linear Recurrent Neural Nets Over Sequence Length
Eric Martin
Chris Cundy
22
93
0
12 Sep 2017
Simple Recurrent Units for Highly Parallelizable Recurrence
Tao Lei
Yu Zhang
Sida I. Wang
Huijing Dai
Yoav Artzi
LRM
42
271
0
08 Sep 2017
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Victor Campos
Brendan Jou
Xavier Giró-i-Nieto
Jordi Torres
Shih-Fu Chang
14
217
0
22 Aug 2017
Regularizing and Optimizing LSTM Language Models
Stephen Merity
N. Keskar
R. Socher
45
1,090
0
07 Aug 2017
Revisiting Activation Regularization for Language RNNs
Stephen Merity
Bryan McCann
R. Socher
27
44
0
03 Aug 2017
Dual Rectified Linear Units (DReLUs): A Replacement for Tanh Activation Functions in Quasi-Recurrent Neural Networks
Fréderic Godin
Jonas Degrave
J. Dambre
W. D. Neve
MU
11
46
0
25 Jul 2017
Previous
1
2
3
4
5
Next