Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.17335
Cited By
Non-asymptotic Convergence of Training Transformers for Next-token Prediction
25 September 2024
Ruiquan Huang
Yingbin Liang
Jing Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Non-asymptotic Convergence of Training Transformers for Next-token Prediction"
2 / 2 papers shown
Title
How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias
Ruiquan Huang
Yingbin Liang
Jing Yang
46
0
0
02 May 2025
On the Learn-to-Optimize Capabilities of Transformers in In-Context Sparse Recovery
Renpu Liu
Ruida Zhou
Cong Shen
Jing Yang
23
0
0
17 Oct 2024
1