Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.15850
Cited By
Stochastic Modified Equations and Dynamics of Dropout Algorithm
25 May 2023
Zhongwang Zhang
Yuqing Li
Yaoyu Zhang
Z. Xu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stochastic Modified Equations and Dynamics of Dropout Algorithm"
6 / 6 papers shown
Title
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Liangkai Hang
Junjie Yao
Zhiwei Bai
Tianyi Chen
Yang Chen
...
Feiyu Xiong
Y. Zhang
Weinan E
Hongkang Yang
Zhi-hai Xu
LRM
41
0
0
29 May 2025
An overview of condensation phenomenon in deep learning
Zhi-Qin John Xu
Yaoyu Zhang
Zhangchen Zhou
AI4CE
66
4
0
13 Apr 2025
Reasoning Bias of Next Token Prediction Training
Pengxiao Lin
Zhongwang Zhang
Zhi-Qin John Xu
LRM
191
2
0
21 Feb 2025
Spring-block theory of feature learning in deep neural networks
Chengzhi Shi
Liming Pan
Ivan Dokmanić
AI4CE
132
1
0
28 Jul 2024
Initialization is Critical to Whether Transformers Fit Composite Functions by Reasoning or Memorizing
Zhongwang Zhang
Pengxiao Lin
Zhiwei Wang
Yaoyu Zhang
Z. Xu
129
5
0
08 May 2024
Implicit regularization of dropout
Zhongwang Zhang
Zhi-Qin John Xu
70
29
0
13 Jul 2022
1