Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.07153
Cited By
Escaping the Gradient Vanishing: Periodic Alternatives of Softmax in Attention Mechanism
16 August 2021
Shulun Wang
Bin Liu
Feng Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Escaping the Gradient Vanishing: Periodic Alternatives of Softmax in Attention Mechanism"
1 / 1 papers shown
Title
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis
Weronika Ormaniec
Felix Dangel
Sidak Pal Singh
33
6
0
14 Oct 2024
1