Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.21064
Cited By
Recurrent neural networks: vanishing and exploding gradients are not the end of the story
31 May 2024
Nicolas Zucchet
Antonio Orvieto
ODL
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recurrent neural networks: vanishing and exploding gradients are not the end of the story"
2 / 2 papers shown
Title
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Soham De
Samuel L. Smith
Anushan Fernando
Aleksandar Botev
George-Christian Muraru
...
David Budden
Yee Whye Teh
Razvan Pascanu
Nando de Freitas
Çağlar Gülçehre
Mamba
51
116
0
29 Feb 2024
Resurrecting Recurrent Neural Networks for Long Sequences
Antonio Orvieto
Samuel L. Smith
Albert Gu
Anushan Fernando
Çağlar Gülçehre
Razvan Pascanu
Soham De
83
258
0
11 Mar 2023
1