Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.17174
Cited By
From Attention to Activation: Unravelling the Enigmas of Large Language Models
22 October 2024
Prannay Kaul
Chengcheng Ma
Ismail Elezi
Jiankang Deng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"From Attention to Activation: Unravelling the Enigmas of Large Language Models"
1 / 1 papers shown
Title
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
Zayd Muhammad Kawakibi Zuhri
Erland Hilman Fuadi
Alham Fikri Aji
29
0
0
29 Apr 2025
1