Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2403.09635
Cited By

Transformers Get Stable: An End-to-End Signal Propagation Theory for
Language Models

v1v2 (latest)

Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models

International Conference on Machine Learning (ICML), 2024

14 March 2024

Mohd Abbas Zaidi

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (10★)

Papers citing "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"

4 / 4 papers shown

Normalization in Attention Dynamics

Normalization in Attention Dynamics

Nikita Karagodin

Yury Polyanskiy

Philippe Rigollet

274

3

0

24 Oct 2025

Stability of Transformers under Layer Normalization

Stability of Transformers under Layer Normalization

Benjamin J. Zhang

Markos A. Katsoulakis

178

3

0

10 Oct 2025

Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution

Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution

323

1

0

21 May 2025

Don't be lazy: CompleteP enables compute-efficient deep transformers

Don't be lazy: CompleteP enables compute-efficient deep transformers

Bin Claire Zhang

Cengiz Pehlevan

702

38

0

02 May 2025

Page 1 of 1