Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1609.07959
Cited By
v1
v2
v3 (latest)
Multiplicative LSTM for sequence modelling
26 September 2016
Ben Krause
Liang Lu
Iain Murray
Steve Renals
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multiplicative LSTM for sequence modelling"
50 / 77 papers shown
Title
Unified Implementations of Recurrent Neural Networks in Multiple Deep Learning Frameworks
Francesco Martinuzzi
AI4TS
120
0
0
24 Oct 2025
An Explainable Neural Radiomic Sequence Model with Spatiotemporal Continuity for Quantifying 4DCT-based Pulmonary Ventilation
Rihui Zhang
Haiming Zhu
Jingtong Zhao
Lei Zhang
F. Yin
Chunhao Wang
Zhenyu Yang
226
0
0
31 Mar 2025
Machine Learning-Based Estimation Of Wave Direction For Unmanned Surface Vehicles
Manele Ait Habouche
Mickaël Kerboeuf
Goulven Guillou
Jean-Philippe Babau
158
0
0
17 Dec 2024
Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection Layers
bioRxiv (bioRxiv), 2024
Yingheng Wang
Zichen Wang
Gil Sadeh
Luca Zancato
Alessandro Achille
George Karypis
Huzefa Rangwala
303
1
0
29 Oct 2024
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs
Wei Wu
Chao Wang
L. Chen
Mingze Yin
Yiheng Zhu
Kun Fu
Jieping Ye
Hui Xiong
Zheng Wang
321
3
0
04 Oct 2024
A Tensor Decomposition Perspective on Second-order RNNs
International Conference on Machine Learning (ICML), 2024
M. Lizaire
Michael Rizvi-Martel
Marawan Gamal Abdel Hameed
Guillaume Rabusseau
238
2
0
07 Jun 2024
Exploiting Hierarchical Interactions for Protein Surface Learning
IEEE journal of biomedical and health informatics (IEEE JBHI), 2024
Yiqun Lin
Liang Pan
Yi Li
Ziwei Liu
Xiaomeng Li
128
2
0
17 Jan 2024
Contractive error feedback for gradient compression
Bingcong Li
Shuai Zheng
Parameswaran Raman
Anshumali Shrivastava
G. Giannakis
135
0
0
13 Dec 2023
Deep Learning-based Sentiment Classification: A Comparative Survey
IEEE Access (IEEE Access), 2023
Mohammed Kayed
R. Redondo
Alhassan Mabrouk
156
45
0
12 Dec 2023
Memory-efficient Stochastic methods for Memory-based Transformers
Vishwajit Kumar Vishnu
C. Sekhar
78
0
0
14 Nov 2023
A Survey of AI Music Generation Tools and Models
Yueyue Zhu
Jared Baca
Banafsheh Rekabdar
Reza Rawassizadeh
MGen
223
20
0
24 Aug 2023
A Quantitative Review on Language Model Efficiency Research
Meng Jiang
Hy Dang
Lingbo Tong
159
0
0
28 May 2023
Extending Memory for Language Modelling
A. Nugaliyadde
KELM
CLL
VLM
112
1
0
19 May 2023
Protein Language Models and Structure Prediction: Connection and Progression
Bozhen Hu
Jun Xia
Jiangbin Zheng
Cheng Tan
Yufei Huang
Yongjie Xu
Stan Z. Li
170
45
0
30 Nov 2022
Attribution and Obfuscation of Neural Text Authorship: A Data Mining Perspective
SIGKDD Explorations (SIGKDD Explor.), 2022
Adaku Uchendu
Thai Le
Dongwon Lee
DeLMO
253
51
0
19 Oct 2022
An Embarrassingly Simple Approach for Intellectual Property Rights Protection on Recurrent Neural Networks
Zhi Qin Tan
H. P. Wong
Chee Seng Chan
162
2
0
03 Oct 2022
Gates Are Not What You Need in RNNs
Ronalds Zakovskis
Andis Draguns
Eliza Gaile
Emīls Ozoliņš
Kārlis Freivalds
170
1
0
01 Aug 2021
Poly-NL: Linear Complexity Non-local Layers with Polynomials
F. Babiloni
Ioannis Marras
Filippos Kokkinos
Jiankang Deng
Grigorios G. Chrysos
Stefanos Zafeiriou
103
7
0
06 Jul 2021
Top-KAST: Top-K Always Sparse Training
Neural Information Processing Systems (NeurIPS), 2021
Siddhant M. Jayakumar
Razvan Pascanu
Jack W. Rae
Simon Osindero
Erich Elsen
284
105
0
07 Jun 2021
Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural Units
AAAI Conference on Artificial Intelligence (AAAI), 2021
A. Mali
Alexander Ororbia
Daniel Kifer
C. Lee Giles
81
16
0
07 Apr 2021
Self-Supervised Test-Time Learning for Reading Comprehension
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Pratyay Banerjee
Tejas Gokhale
Chitta Baral
SSL
146
31
0
20 Mar 2021
Modeling Multivariate Cyber Risks: Deep Learning Dating Extreme Value Theory
Journal of Applied Statistics (J. Appl. Stat.), 2021
Mingyue Zhang Wu
Jinzhu Luo
Xing Fang
Maochao Xu
Peng Zhao
89
12
0
15 Mar 2021
Learning to Generate Music With Sentiment
International Society for Music Information Retrieval Conference (ISMIR), 2021
Lucas N. Ferreira
E. Whitehead
144
96
0
09 Mar 2021
Long Short Term Memory Networks for Bandwidth Forecasting in Mobile Broadband Networks under Mobility
Konstantinos Kousias
A. Pappas
Özgü Alay
A. Argyriou
Michael Riegler
79
1
0
20 Nov 2020
Music Classification in MIDI Format based on LSTM Mdel
Yiting Xia
Yiwei Jiang
Tao Ye
MGen
VLM
73
1
0
15 Oct 2020
Melody Classification based on Performance Event Vector and BRNN
Jinyue Guo
Aozhi Liu
Jing Xiao
59
1
0
15 Oct 2020
Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling
Tsendsuren Munkhdalai
CLL
OffRL
130
5
0
03 Sep 2020
Neural Language Generation: Formulation, Methods, and Evaluation
Cristina Garbacea
Qiaozhu Mei
273
29
0
31 Jul 2020
Deep Learning in Protein Structural Modeling and Design
Wenhao Gao
S. Mahajan
Jeremias Sulam
Jeffrey J. Gray
161
178
0
16 Jul 2020
Recognizing Long Grammatical Sequences Using Recurrent Networks Augmented With An External Differentiable Stack
International Conference on Graphics and Interaction (GI), 2020
A. Mali
Alexander Ororbia
Daniel Kifer
C. Lee Giles
155
14
0
04 Apr 2020
Encoding word order in complex embeddings
International Conference on Learning Representations (ICLR), 2019
Benyou Wang
Donghao Zhao
Christina Lioma
Qiuchi Li
Peng Zhang
J. Simonsen
207
124
0
27 Dec 2019
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection
Guangxiang Zhao
Junyang Lin
Zhiyuan Zhang
Xuancheng Ren
Qi Su
Xu Sun
138
133
0
25 Dec 2019
Pre-Training of Deep Bidirectional Protein Sequence Representations with Structural Information
IEEE Access (IEEE Access), 2019
Seonwoo Min
Seunghyun Park
Siwon Kim
Hyun-Soo Choi
Byunghan Lee
Sungroh Yoon
SSL
277
63
0
25 Nov 2019
Multi-Zone Unit for Recurrent Neural Networks
AAAI Conference on Artificial Intelligence (AAAI), 2019
Fandong Meng
Jinchao Zhang
Yang Liu
Jie Zhou
AI4CE
128
2
0
17 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling
International Conference on Learning Representations (ICLR), 2019
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALM
VLM
KELM
255
755
0
13 Nov 2019
BP-Transformer: Modelling Long-Range Context via Binary Partitioning
Zihao Ye
Qipeng Guo
Quan Gan
Xipeng Qiu
Zheng Zhang
186
84
0
11 Nov 2019
Stabilizing Transformers for Reinforcement Learning
International Conference on Machine Learning (ICML), 2019
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
250
421
0
13 Oct 2019
A Neural Virtual Anchor Synthesizer based on Seq2Seq and GAN Models
Ning Wang
Zhaoxiang Liu
Zezhou Chen
Huan Hu
Kai Wang
CVBM
139
9
0
20 Aug 2019
Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection
International Conference on Advanced Information Networking and Applications (AINA), 2019
David Ifeoluwa Adelani
H. Mai
Fuming Fang
H. Nguyen
Junichi Yamagishi
Isao Echizen
DeLMO
253
134
0
22 Jul 2019
A Scalable Framework for Multilevel Streaming Data Analytics using Deep Learning
Annual International Computer Software and Applications Conference (COMPSAC), 2019
Shihao Ge
Haruna Isah
F. Zulkernine
Shahzad Khan
211
14
0
15 Jul 2019
Augmenting Self-attention with Persistent Memory
Sainbayar Sukhbaatar
Edouard Grave
Guillaume Lample
Edouard Grave
Armand Joulin
RALM
KELM
153
149
0
02 Jul 2019
Inter and Intra Document Attention for Depression Risk Assessment
Diego Maupomé
Marc Queudot
Marie-Jean Meurs
44
7
0
30 Jun 2019
Multiplicative Models for Recurrent Language Modeling
Conference on Intelligent Text Processing and Computational Linguistics (CICLing), 2019
Diego Maupomé
Marie-Jean Meurs
KELM
78
1
0
30 Jun 2019
Evaluating Protein Transfer Learning with TAPE
bioRxiv (bioRxiv), 2019
Roshan Rao
Nicholas Bhattacharya
Neil Thomas
Yan Duan
Xi Chen
John F. Canny
Pieter Abbeel
Yun S. Song
SSL
170
901
0
19 Jun 2019
Dynamic Evaluation of Transformer Language Models
Ben Krause
Emmanuel Kahembwe
Iain Murray
Steve Renals
176
45
0
17 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
216
27
0
09 Apr 2019
Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
IEEE International Conference on Robotics and Automation (ICRA), 2019
Egor Lakomkin
M. Zamani
C. Weber
S. Magg
S. Wermter
142
24
0
28 Feb 2019
Compressing Gradient Optimizers via Count-Sketches
International Conference on Machine Learning (ICML), 2019
Ryan Spring
Anastasios Kyrillidis
Vijai Mohan
Anshumali Shrivastava
132
37
0
01 Feb 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
580
4,071
0
09 Jan 2019
Deep Online Learning via Meta-Learning: Continual Adaptation for Model-Based RL
Anusha Nagabandi
Chelsea Finn
Sergey Levine
OffRL
CLL
187
200
0
18 Dec 2018
1
2
Next