Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.10794
Cited By
A mathematical perspective on Transformers
17 December 2023
Borjan Geshkovski
Cyril Letrouit
Yury Polyanskiy
Philippe Rigollet
EDL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A mathematical perspective on Transformers"
8 / 8 papers shown
Title
Dual Filter: A Mathematical Framework for Inference using Transformer-like Architectures
Heng-Sheng Chang
P. Mehta
34
0
0
01 May 2025
Quantitative Clustering in Mean-Field Transformer Models
Shi Chen
Zhengjiang Lin
Yury Polyanskiy
Philippe Rigollet
26
0
0
20 Apr 2025
OT-Transformer: A Continuous-time Transformer Architecture with Optimal Transport Regularization
Kelvin Kan
Xingjian Li
Stanley Osher
89
2
0
30 Jan 2025
Emergence of meta-stable clustering in mean-field transformer models
Giuseppe Bruno
Federico Pasqualotto
Andrea Agazzi
39
6
0
30 Oct 2024
Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers
Lorenzo Tiberi
Francesca Mignacco
Kazuki Irie
H. Sompolinsky
34
5
0
24 May 2024
Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems
Subhabrata Dutta
Tanya Gautam
Soumen Chakrabarti
Tanmoy Chakraborty
41
15
0
30 Sep 2021
A Class of Dimension-free Metrics for the Convergence of Empirical Measures
Jiequn Han
Ruimeng Hu
Jihao Long
14
3
0
24 Apr 2021
Trainability and Accuracy of Neural Networks: An Interacting Particle System Approach
Grant M. Rotskoff
Eric Vanden-Eijnden
56
114
0
02 May 2018
1