ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.07959
  4. Cited By
Multiplicative LSTM for sequence modelling
v1v2v3 (latest)

Multiplicative LSTM for sequence modelling

26 September 2016
Ben Krause
Liang Lu
Iain Murray
Steve Renals
ArXiv (abs)PDFHTML

Papers citing "Multiplicative LSTM for sequence modelling"

50 / 77 papers shown
Title
Unified Implementations of Recurrent Neural Networks in Multiple Deep Learning Frameworks
Unified Implementations of Recurrent Neural Networks in Multiple Deep Learning Frameworks
Francesco Martinuzzi
AI4TS
120
0
0
24 Oct 2025
An Explainable Neural Radiomic Sequence Model with Spatiotemporal Continuity for Quantifying 4DCT-based Pulmonary Ventilation
An Explainable Neural Radiomic Sequence Model with Spatiotemporal Continuity for Quantifying 4DCT-based Pulmonary Ventilation
Rihui Zhang
Haiming Zhu
Jingtong Zhao
Lei Zhang
F. Yin
Chunhao Wang
Zhenyu Yang
226
0
0
31 Mar 2025
Machine Learning-Based Estimation Of Wave Direction For Unmanned Surface Vehicles
Machine Learning-Based Estimation Of Wave Direction For Unmanned Surface Vehicles
Manele Ait Habouche
Mickaël Kerboeuf
Goulven Guillou
Jean-Philippe Babau
158
0
0
17 Dec 2024
Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection Layers
Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection LayersbioRxiv (bioRxiv), 2024
Yingheng Wang
Zichen Wang
Gil Sadeh
Luca Zancato
Alessandro Achille
George Karypis
Huzefa Rangwala
303
1
0
29 Oct 2024
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs
Wei Wu
Chao Wang
L. Chen
Mingze Yin
Yiheng Zhu
Kun Fu
Jieping Ye
Hui Xiong
Zheng Wang
321
3
0
04 Oct 2024
A Tensor Decomposition Perspective on Second-order RNNs
A Tensor Decomposition Perspective on Second-order RNNsInternational Conference on Machine Learning (ICML), 2024
M. Lizaire
Michael Rizvi-Martel
Marawan Gamal Abdel Hameed
Guillaume Rabusseau
238
2
0
07 Jun 2024
Exploiting Hierarchical Interactions for Protein Surface Learning
Exploiting Hierarchical Interactions for Protein Surface LearningIEEE journal of biomedical and health informatics (IEEE JBHI), 2024
Yiqun Lin
Liang Pan
Yi Li
Ziwei Liu
Xiaomeng Li
128
2
0
17 Jan 2024
Contractive error feedback for gradient compression
Contractive error feedback for gradient compression
Bingcong Li
Shuai Zheng
Parameswaran Raman
Anshumali Shrivastava
G. Giannakis
135
0
0
13 Dec 2023
Deep Learning-based Sentiment Classification: A Comparative Survey
Deep Learning-based Sentiment Classification: A Comparative SurveyIEEE Access (IEEE Access), 2023
Mohammed Kayed
R. Redondo
Alhassan Mabrouk
156
45
0
12 Dec 2023
Memory-efficient Stochastic methods for Memory-based Transformers
Memory-efficient Stochastic methods for Memory-based Transformers
Vishwajit Kumar Vishnu
C. Sekhar
78
0
0
14 Nov 2023
A Survey of AI Music Generation Tools and Models
A Survey of AI Music Generation Tools and Models
Yueyue Zhu
Jared Baca
Banafsheh Rekabdar
Reza Rawassizadeh
MGen
223
20
0
24 Aug 2023
A Quantitative Review on Language Model Efficiency Research
A Quantitative Review on Language Model Efficiency Research
Meng Jiang
Hy Dang
Lingbo Tong
159
0
0
28 May 2023
Extending Memory for Language Modelling
Extending Memory for Language Modelling
A. Nugaliyadde
KELMCLLVLM
112
1
0
19 May 2023
Protein Language Models and Structure Prediction: Connection and
  Progression
Protein Language Models and Structure Prediction: Connection and Progression
Bozhen Hu
Jun Xia
Jiangbin Zheng
Cheng Tan
Yufei Huang
Yongjie Xu
Stan Z. Li
170
45
0
30 Nov 2022
Attribution and Obfuscation of Neural Text Authorship: A Data Mining
  Perspective
Attribution and Obfuscation of Neural Text Authorship: A Data Mining PerspectiveSIGKDD Explorations (SIGKDD Explor.), 2022
Adaku Uchendu
Thai Le
Dongwon Lee
DeLMO
253
51
0
19 Oct 2022
An Embarrassingly Simple Approach for Intellectual Property Rights
  Protection on Recurrent Neural Networks
An Embarrassingly Simple Approach for Intellectual Property Rights Protection on Recurrent Neural Networks
Zhi Qin Tan
H. P. Wong
Chee Seng Chan
162
2
0
03 Oct 2022
Gates Are Not What You Need in RNNs
Gates Are Not What You Need in RNNs
Ronalds Zakovskis
Andis Draguns
Eliza Gaile
Emīls Ozoliņš
Kārlis Freivalds
170
1
0
01 Aug 2021
Poly-NL: Linear Complexity Non-local Layers with Polynomials
Poly-NL: Linear Complexity Non-local Layers with Polynomials
F. Babiloni
Ioannis Marras
Filippos Kokkinos
Jiankang Deng
Grigorios G. Chrysos
Stefanos Zafeiriou
103
7
0
06 Jul 2021
Top-KAST: Top-K Always Sparse Training
Top-KAST: Top-K Always Sparse TrainingNeural Information Processing Systems (NeurIPS), 2021
Siddhant M. Jayakumar
Razvan Pascanu
Jack W. Rae
Simon Osindero
Erich Elsen
284
105
0
07 Jun 2021
Recognizing and Verifying Mathematical Equations using Multiplicative
  Differential Neural Units
Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural UnitsAAAI Conference on Artificial Intelligence (AAAI), 2021
A. Mali
Alexander Ororbia
Daniel Kifer
C. Lee Giles
81
16
0
07 Apr 2021
Self-Supervised Test-Time Learning for Reading Comprehension
Self-Supervised Test-Time Learning for Reading ComprehensionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Pratyay Banerjee
Tejas Gokhale
Chitta Baral
SSL
146
31
0
20 Mar 2021
Modeling Multivariate Cyber Risks: Deep Learning Dating Extreme Value
  Theory
Modeling Multivariate Cyber Risks: Deep Learning Dating Extreme Value TheoryJournal of Applied Statistics (J. Appl. Stat.), 2021
Mingyue Zhang Wu
Jinzhu Luo
Xing Fang
Maochao Xu
Peng Zhao
89
12
0
15 Mar 2021
Learning to Generate Music With Sentiment
Learning to Generate Music With SentimentInternational Society for Music Information Retrieval Conference (ISMIR), 2021
Lucas N. Ferreira
E. Whitehead
144
96
0
09 Mar 2021
Long Short Term Memory Networks for Bandwidth Forecasting in Mobile
  Broadband Networks under Mobility
Long Short Term Memory Networks for Bandwidth Forecasting in Mobile Broadband Networks under Mobility
Konstantinos Kousias
A. Pappas
Özgü Alay
A. Argyriou
Michael Riegler
79
1
0
20 Nov 2020
Music Classification in MIDI Format based on LSTM Mdel
Music Classification in MIDI Format based on LSTM Mdel
Yiting Xia
Yiwei Jiang
Tao Ye
MGenVLM
73
1
0
15 Oct 2020
Melody Classification based on Performance Event Vector and BRNN
Melody Classification based on Performance Event Vector and BRNN
Jinyue Guo
Aozhi Liu
Jing Xiao
59
1
0
15 Oct 2020
Sparse Meta Networks for Sequential Adaptation and its Application to
  Adaptive Language Modelling
Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling
Tsendsuren Munkhdalai
CLLOffRL
130
5
0
03 Sep 2020
Neural Language Generation: Formulation, Methods, and Evaluation
Neural Language Generation: Formulation, Methods, and Evaluation
Cristina Garbacea
Qiaozhu Mei
273
29
0
31 Jul 2020
Deep Learning in Protein Structural Modeling and Design
Deep Learning in Protein Structural Modeling and Design
Wenhao Gao
S. Mahajan
Jeremias Sulam
Jeffrey J. Gray
161
178
0
16 Jul 2020
Recognizing Long Grammatical Sequences Using Recurrent Networks
  Augmented With An External Differentiable Stack
Recognizing Long Grammatical Sequences Using Recurrent Networks Augmented With An External Differentiable StackInternational Conference on Graphics and Interaction (GI), 2020
A. Mali
Alexander Ororbia
Daniel Kifer
C. Lee Giles
155
14
0
04 Apr 2020
Encoding word order in complex embeddings
Encoding word order in complex embeddingsInternational Conference on Learning Representations (ICLR), 2019
Benyou Wang
Donghao Zhao
Christina Lioma
Qiuchi Li
Peng Zhang
J. Simonsen
207
124
0
27 Dec 2019
Explicit Sparse Transformer: Concentrated Attention Through Explicit
  Selection
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection
Guangxiang Zhao
Junyang Lin
Zhiyuan Zhang
Xuancheng Ren
Qi Su
Xu Sun
138
133
0
25 Dec 2019
Pre-Training of Deep Bidirectional Protein Sequence Representations with
  Structural Information
Pre-Training of Deep Bidirectional Protein Sequence Representations with Structural InformationIEEE Access (IEEE Access), 2019
Seonwoo Min
Seunghyun Park
Siwon Kim
Hyun-Soo Choi
Byunghan Lee
Sungroh Yoon
SSL
277
63
0
25 Nov 2019
Multi-Zone Unit for Recurrent Neural Networks
Multi-Zone Unit for Recurrent Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2019
Fandong Meng
Jinchao Zhang
Yang Liu
Jie Zhou
AI4CE
128
2
0
17 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling
Compressive Transformers for Long-Range Sequence ModellingInternational Conference on Learning Representations (ICLR), 2019
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALMVLMKELM
255
755
0
13 Nov 2019
BP-Transformer: Modelling Long-Range Context via Binary Partitioning
BP-Transformer: Modelling Long-Range Context via Binary Partitioning
Zihao Ye
Qipeng Guo
Quan Gan
Xipeng Qiu
Zheng Zhang
186
84
0
11 Nov 2019
Stabilizing Transformers for Reinforcement Learning
Stabilizing Transformers for Reinforcement LearningInternational Conference on Machine Learning (ICML), 2019
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
250
421
0
13 Oct 2019
A Neural Virtual Anchor Synthesizer based on Seq2Seq and GAN Models
A Neural Virtual Anchor Synthesizer based on Seq2Seq and GAN Models
Ning Wang
Zhaoxiang Liu
Zezhou Chen
Huan Hu
Kai Wang
CVBM
139
9
0
20 Aug 2019
Generating Sentiment-Preserving Fake Online Reviews Using Neural
  Language Models and Their Human- and Machine-based Detection
Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based DetectionInternational Conference on Advanced Information Networking and Applications (AINA), 2019
David Ifeoluwa Adelani
H. Mai
Fuming Fang
H. Nguyen
Junichi Yamagishi
Isao Echizen
DeLMO
253
134
0
22 Jul 2019
A Scalable Framework for Multilevel Streaming Data Analytics using Deep
  Learning
A Scalable Framework for Multilevel Streaming Data Analytics using Deep LearningAnnual International Computer Software and Applications Conference (COMPSAC), 2019
Shihao Ge
Haruna Isah
F. Zulkernine
Shahzad Khan
211
14
0
15 Jul 2019
Augmenting Self-attention with Persistent Memory
Augmenting Self-attention with Persistent Memory
Sainbayar Sukhbaatar
Edouard Grave
Guillaume Lample
Edouard Grave
Armand Joulin
RALMKELM
153
149
0
02 Jul 2019
Inter and Intra Document Attention for Depression Risk Assessment
Inter and Intra Document Attention for Depression Risk Assessment
Diego Maupomé
Marc Queudot
Marie-Jean Meurs
44
7
0
30 Jun 2019
Multiplicative Models for Recurrent Language Modeling
Multiplicative Models for Recurrent Language ModelingConference on Intelligent Text Processing and Computational Linguistics (CICLing), 2019
Diego Maupomé
Marie-Jean Meurs
KELM
78
1
0
30 Jun 2019
Evaluating Protein Transfer Learning with TAPE
Evaluating Protein Transfer Learning with TAPEbioRxiv (bioRxiv), 2019
Roshan Rao
Nicholas Bhattacharya
Neil Thomas
Yan Duan
Xi Chen
John F. Canny
Pieter Abbeel
Yun S. Song
SSL
170
901
0
19 Jun 2019
Dynamic Evaluation of Transformer Language Models
Dynamic Evaluation of Transformer Language Models
Ben Krause
Emmanuel Kahembwe
Iain Murray
Steve Renals
176
45
0
17 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
216
27
0
09 Apr 2019
Incorporating End-to-End Speech Recognition Models for Sentiment
  Analysis
Incorporating End-to-End Speech Recognition Models for Sentiment AnalysisIEEE International Conference on Robotics and Automation (ICRA), 2019
Egor Lakomkin
M. Zamani
C. Weber
S. Magg
S. Wermter
142
24
0
28 Feb 2019
Compressing Gradient Optimizers via Count-Sketches
Compressing Gradient Optimizers via Count-SketchesInternational Conference on Machine Learning (ICML), 2019
Ryan Spring
Anastasios Kyrillidis
Vijai Mohan
Anshumali Shrivastava
132
37
0
01 Feb 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
580
4,071
0
09 Jan 2019
Deep Online Learning via Meta-Learning: Continual Adaptation for
  Model-Based RL
Deep Online Learning via Meta-Learning: Continual Adaptation for Model-Based RL
Anusha Nagabandi
Chelsea Finn
Sergey Levine
OffRLCLL
187
200
0
18 Dec 2018
12
Next