Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2602.01203
Cited By
Attention Sink Forges Native MoE in Attention Layers: Sink-Aware Training to Address Head Collapse
1 February 2026
Zizhuo Fu
Wenxuan Zeng
Runsheng Wang
Meng Li
MoE
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"Attention Sink Forges Native MoE in Attention Layers: Sink-Aware Training to Address Head Collapse"
0 / 0 papers shown
No papers found
Page 1 of 0