ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.17174
  4. Cited By
From Attention to Activation: Unravelling the Enigmas of Large Language
  Models

From Attention to Activation: Unravelling the Enigmas of Large Language Models

22 October 2024
Prannay Kaul
Chengcheng Ma
Ismail Elezi
Jiankang Deng
ArXivPDFHTML

Papers citing "From Attention to Activation: Unravelling the Enigmas of Large Language Models"

1 / 1 papers shown
Title
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
Zayd Muhammad Kawakibi Zuhri
Erland Hilman Fuadi
Alham Fikri Aji
24
0
0
29 Apr 2025
1