Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.13835
Cited By
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
17 October 2024
Tianyu Guo
Druv Pai
Yu Bai
Jiantao Jiao
Michael I. Jordan
Song Mei
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
2 / 2 papers shown
Title
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
Zayd Muhammad Kawakibi Zuhri
Erland Hilman Fuadi
Alham Fikri Aji
24
0
0
29 Apr 2025
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
Prateek Chhikara
Dev Khant
Saket Aryan
Taranjeet Singh
Deshraj Yadav
LLMAG
RALM
52
0
0
28 Apr 2025
1