ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.2329
  4. Cited By
Recurrent Neural Network Regularization

Recurrent Neural Network Regularization

8 September 2014
Wojciech Zaremba
Ilya Sutskever
Oriol Vinyals
    ODL
ArXivPDFHTML

Papers citing "Recurrent Neural Network Regularization"

50 / 274 papers shown
Title
Super-resolution-based Change Detection Network with Stacked Attention
  Module for Images with Different Resolutions
Super-resolution-based Change Detection Network with Stacked Attention Module for Images with Different Resolutions
Mengxi Liu
Q. Shi
Andrea Marinoni
Da He
Xiaoping Liu
Liangpei Zhang
45
121
0
27 Feb 2021
AutoDropout: Learning Dropout Patterns to Regularize Deep Networks
AutoDropout: Learning Dropout Patterns to Regularize Deep Networks
Hieu H. Pham
Quoc V. Le
76
56
0
05 Jan 2021
Residual Matrix Product State for Machine Learning
Residual Matrix Product State for Machine Learning
Ye Meng
Jing Zhang
Peng Zhang
Chao Gao
Shi-Ju Ran
26
12
0
22 Dec 2020
Robot Gaining Accurate Pouring Skills through Self-Supervised Learning
  and Generalization
Robot Gaining Accurate Pouring Skills through Self-Supervised Learning and Generalization
Yongqiang Huang
Juan Wilches
Yu Sun
SSL
17
22
0
19 Nov 2020
Biomedical Named Entity Recognition at Scale
Biomedical Named Entity Recognition at Scale
Veysel Kocaman
D. Talby
22
67
0
12 Nov 2020
Bayesian Methods for Semi-supervised Text Annotation
Bayesian Methods for Semi-supervised Text Annotation
Kristian Miok
Gregor Pirš
Marko Robnik-Šikonja
BDL
34
5
0
28 Oct 2020
Avoiding Occupancy Detection from Smart Meter using Adversarial Machine
  Learning
Avoiding Occupancy Detection from Smart Meter using Adversarial Machine Learning
Ibrahim Yilmaz
Ambareen Siraj
AAML
15
21
0
23 Oct 2020
Improving Text Generation with Student-Forcing Optimal Transport
Improving Text Generation with Student-Forcing Optimal Transport
Guoyin Wang
Chunyuan Li
Jianqiao Li
Hao Fu
Yuh-Chen Lin
...
Ruiyi Zhang
Wenlin Wang
Dinghan Shen
Qian Yang
Lawrence Carin
OT
30
17
0
12 Oct 2020
Discrete-time signatures and randomness in reservoir computing
Discrete-time signatures and randomness in reservoir computing
Christa Cuchiero
Lukas Gonon
Lyudmila Grigoryeva
Juan-Pablo Ortega
Josef Teichmann
27
45
0
17 Sep 2020
DCSFN: Deep Cross-scale Fusion Network for Single Image Rain Removal
DCSFN: Deep Cross-scale Fusion Network for Single Image Rain Removal
Cong Wang
Xiaoying Xing
Zhixun Su
Junyang Chen
17
114
0
03 Aug 2020
Dimension reduction in recurrent networks by canonicalization
Dimension reduction in recurrent networks by canonicalization
Lyudmila Grigoryeva
Juan-Pablo Ortega
24
19
0
23 Jul 2020
Recognizing Chinese Judicial Named Entity using BiLSTM-CRF
Recognizing Chinese Judicial Named Entity using BiLSTM-CRF
Pin Tang
Pinli Yang
Yuang Shi
Yi Zhou
Feng Lin
Yan-Chao Wang
AILaw
27
12
0
31 May 2020
rTop-k: A Statistical Estimation Approach to Distributed SGD
rTop-k: A Statistical Estimation Approach to Distributed SGD
L. P. Barnes
Huseyin A. Inan
Berivan Isik
Ayfer Özgür
32
65
0
21 May 2020
Adaptive Partial Scanning Transmission Electron Microscopy with
  Reinforcement Learning
Adaptive Partial Scanning Transmission Electron Microscopy with Reinforcement Learning
Jeffrey M. Ede
24
12
0
06 Apr 2020
Using Reinforcement Learning in the Algorithmic Trading Problem
Using Reinforcement Learning in the Algorithmic Trading Problem
E. Ponomarev
Ivan Oseledets
A. Cichocki
30
24
0
26 Feb 2020
Teaching Machines to Converse
Teaching Machines to Converse
Jiwei Li
29
4
0
31 Jan 2020
Delving Deeper into the Decoder for Video Captioning
Delving Deeper into the Decoder for Video Captioning
Haoran Chen
Jianmin Li
Xiaolin Hu
43
34
0
16 Jan 2020
Graduate Employment Prediction with Bias
Graduate Employment Prediction with Bias
Teng Guo
Feng Xia
Shi Zhen
Xiaomei Bai
Dongyu Zhang
Zitao Liu
Jiliang Tang
40
26
0
27 Dec 2019
Machine Learning Techniques for Biomedical Image Segmentation: An
  Overview of Technical Aspects and Introduction to State-of-Art Applications
Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications
Hyunseok Seo
M. B. Khuzani
V. Vasudevan
Charles Huang
Hongyi Ren
Ruoxiu Xiao
Xiao Jia
Lei Xing
VLM
24
218
0
06 Nov 2019
Efficient Decoupled Neural Architecture Search by Structure and
  Operation Sampling
Efficient Decoupled Neural Architecture Search by Structure and Operation Sampling
Heung-Chang Lee
Do-Guk Kim
Bohyung Han
38
6
0
23 Oct 2019
Image Super-Resolution via Attention based Back Projection Networks
Image Super-Resolution via Attention based Back Projection Networks
Zhi-Song Liu
Li-Wen Wang
Chu-Tak Li
W. Siu
Yui-Lam Chan
SupR
40
67
0
10 Oct 2019
Multilingual End-to-End Speech Translation
Multilingual End-to-End Speech Translation
Hirofumi Inaguma
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
LRM
28
86
0
01 Oct 2019
Explaining and Interpreting LSTMs
Explaining and Interpreting LSTMs
L. Arras
Jose A. Arjona-Medina
Michael Widrich
G. Montavon
Michael Gillhofer
K. Müller
Sepp Hochreiter
Wojciech Samek
FAtt
AI4TS
21
79
0
25 Sep 2019
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence
  ASR
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR
F. Weninger
Jesús Andrés-Ferrer
Xinwei Li
P. Zhan
AI4TS
29
26
0
08 Jul 2019
Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular
  Video
Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video
Jian Liu
Naveed Akhtar
Ajmal Mian
3DH
18
10
0
01 Jun 2019
Stochastic Gradient Methods with Block Diagonal Matrix Adaptation
Stochastic Gradient Methods with Block Diagonal Matrix Adaptation
Jihun Yun
A. Lozano
Eunho Yang
ODL
9
5
0
26 May 2019
Survey of Dropout Methods for Deep Neural Networks
Survey of Dropout Methods for Deep Neural Networks
Alex Labach
Hojjat Salehinejad
S. Valaee
27
149
0
25 Apr 2019
Effective Estimation of Deep Generative Language Models
Effective Estimation of Deep Generative Language Models
Tom Pelsmaeker
Wilker Aziz
BDL
24
27
0
17 Apr 2019
Knowledge Distillation For Recurrent Neural Network Language Modeling
  With Trust Regularization
Knowledge Distillation For Recurrent Neural Network Language Modeling With Trust Regularization
Yangyang Shi
M. Hwang
X. Lei
Haoyu Sheng
31
25
0
08 Apr 2019
A Statistical Investigation of Long Memory in Language and Music
A Statistical Investigation of Long Memory in Language and Music
Alexander Greaves-Tunnell
Zaïd Harchaoui
AI4TS
27
24
0
08 Apr 2019
Effective and Efficient Dropout for Deep Convolutional Neural Networks
Effective and Efficient Dropout for Deep Convolutional Neural Networks
Shaofeng Cai
Jinyang Gao
Gang Chen
Beng Chin Ooi
Wei Wang
Meihui Zhang
BDL
18
53
0
06 Apr 2019
A Learned Representation for Scalable Vector Graphics
A Learned Representation for Scalable Vector Graphics
Raphael Gontijo-Lopes
David R Ha
Douglas Eck
Jonathon Shlens
GAN
OCL
30
113
0
04 Apr 2019
Model Slicing for Supporting Complex Analytics with Elastic Inference
  Cost and Resource Constraints
Model Slicing for Supporting Complex Analytics with Elastic Inference Cost and Resource Constraints
Shaofeng Cai
Gang Chen
Beng Chin Ooi
Jinyang Gao
25
19
0
03 Apr 2019
Data-driven Prognostics with Predictive Uncertainty Estimation using
  Ensemble of Deep Ordinal Regression Models
Data-driven Prognostics with Predictive Uncertainty Estimation using Ensemble of Deep Ordinal Regression Models
T. Vishnu
Diksha Garg
Pankaj Malhotra
L. Vig
Gautam M. Shroff
UQCV
32
15
0
23 Mar 2019
Self-Tuning Networks: Bilevel Optimization of Hyperparameters using
  Structured Best-Response Functions
Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions
M. Mackay
Paul Vicol
Jonathan Lorraine
David Duvenaud
Roger C. Grosse
27
164
0
07 Mar 2019
Context Vectors are Reflections of Word Vectors in Half the Dimensions
Context Vectors are Reflections of Word Vectors in Half the Dimensions
Z. Assylbekov
Rustem Takhanov
16
10
0
26 Feb 2019
Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise
  Non-linearities
Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise Non-linearities
O. Ganea
Sylvain Gelly
Gary Bécigneul
Aliaksei Severyn
26
18
0
21 Feb 2019
Ising-Dropout: A Regularization Method for Training and Compression of
  Deep Neural Networks
Ising-Dropout: A Regularization Method for Training and Compression of Deep Neural Networks
Hojjat Salehinejad
S. Valaee
26
30
0
07 Feb 2019
Hardware-Guided Symbiotic Training for Compact, Accurate, yet
  Execution-Efficient LSTM
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM
Hongxu Yin
Guoyang Chen
Yingmin Li
Shuai Che
Weifeng Zhang
N. Jha
36
10
0
30 Jan 2019
Improving Neural Network Quantization without Retraining using Outlier
  Channel Splitting
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
50
305
0
28 Jan 2019
Semantic Relation Classification via Bidirectional LSTM Networks with
  Entity-aware Attention using Latent Entity Typing
Semantic Relation Classification via Bidirectional LSTM Networks with Entity-aware Attention using Latent Entity Typing
Joohong Lee
Sang-gyu Seo
Y. Choi
25
116
0
23 Jan 2019
Slim LSTM networks: LSTM_6 and LSTM_C6
Slim LSTM networks: LSTM_6 and LSTM_C6
Atra Akandeh
F. Salem
16
13
0
18 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
38
3,674
0
09 Jan 2019
Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing
  Support Systems
Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems
Victor Makarenkov
Lior Rokach
Bracha Shapira
18
35
0
08 Jan 2019
Team EP at TAC 2018: Automating data extraction in systematic reviews of
  environmental agents
Team EP at TAC 2018: Automating data extraction in systematic reviews of environmental agents
Artur Jacek Nowak
P. Kunstman
9
2
0
07 Jan 2019
FPGA-based Accelerators of Deep Learning Networks for Learning and
  Classification: A Review
FPGA-based Accelerators of Deep Learning Networks for Learning and Classification: A Review
Ahmad Shawahna
S. M. Sait
A. El-Maleh
28
372
0
01 Jan 2019
Learning Private Neural Language Modeling with Attentive Aggregation
Learning Private Neural Language Modeling with Attentive Aggregation
Shaoxiong Ji
Shirui Pan
Guodong Long
Xue Li
Jing Jiang
Zi Huang
FedML
MoMe
16
136
0
17 Dec 2018
Parameter Re-Initialization through Cyclical Batch Size Schedules
Parameter Re-Initialization through Cyclical Batch Size Schedules
Norman Mu
Z. Yao
A. Gholami
Kurt Keutzer
Michael W. Mahoney
ODL
30
8
0
04 Dec 2018
Quantifying Uncertainties in Natural Language Processing Tasks
Quantifying Uncertainties in Natural Language Processing Tasks
Yijun Xiao
William Yang Wang
UQCV
BDL
32
142
0
18 Nov 2018
Spatio-temporal Stacked LSTM for Temperature Prediction in Weather
  Forecasting
Spatio-temporal Stacked LSTM for Temperature Prediction in Weather Forecasting
Zahra Karevan
Johan A. K. Suykens
14
39
0
15 Nov 2018
Previous
123456
Next