Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2001.07384
Cited By
v1
v2 (latest)
Understanding Why Neural Networks Generalize Well Through GSNR of Parameters
International Conference on Learning Representations (ICLR), 2020
21 January 2020
Jinlong Liu
Guo-qing Jiang
Yunzhi Bai
Ting Chen
Huayan Wang
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Understanding Why Neural Networks Generalize Well Through GSNR of Parameters"
30 / 30 papers shown
GRADSTOP: Early Stopping of Gradient Descent via Posterior Sampling
Arash Jamshidi
Lauri Seppäläinen
Katsiaryna Haitsiukevich
Hoang Phuc Hau Luu
Anton Björklund
Kai Puolamäki
BDL
221
0
0
26 Aug 2025
SGD as Free Energy Minimization: A Thermodynamic View on Neural Network Training
Ildus Sadrtdinov
Ivan Klimov
E. Lobacheva
Dmitry Vetrov
290
3
0
29 May 2025
DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
Haiduo Huang
Jiangcheng Song
Yadong Zhang
Pengju Ren
439
0
0
21 May 2025
CGLearn: Consistent Gradient-Based Learning for Out-of-Distribution Generalization
International Conference on Pattern Recognition Applications and Methods (ICPRAM), 2024
Jawad Chowdhury
G. Terejanu
AI4CE
BDL
OOD
OODD
429
1
0
09 Nov 2024
Adversarial Vulnerability as a Consequence of On-Manifold Inseparibility
Rajdeep Haldar
Yue Xing
Qifan Song
Guang Lin
265
0
0
09 Oct 2024
Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yongheng Zhang
Qiguang Chen
Jingxuan Zhou
Peng Wang
Jiasheng Si
Jin Wang
Wenpeng Lu
Libo Qin
LRM
398
13
0
06 Oct 2024
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation
Zixuan Li
Jing Xiong
Fanghua Ye
Chuanyang Zheng
Xun Wu
...
Xiaodan Liang
Chengming Li
Zhenan Sun
Lingpeng Kong
Ngai Wong
RALM
UQLM
421
9
0
03 Oct 2024
Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models
Yuji Zhang
Sha Li
Jiateng Liu
Pengfei Yu
Yi R. Fung
Jing Li
Pengfei Yu
Heng Ji
426
29
0
10 Jul 2024
Bias of Stochastic Gradient Descent or the Architecture: Disentangling the Effects of Overparameterization of Neural Networks
Amit Peleg
Matthias Hein
407
0
0
04 Jul 2024
Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability
Rajdeep Haldar
Yue Xing
Qifan Song
368
6
0
06 Mar 2024
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning
International Journal of Computer Vision (IJCV), 2024
Baoyuan Wu
Hongrui Chen
Ruotong Wang
Zihao Zhu
Shaokui Wei
Danni Yuan
Mingli Zhu
Ke Xu
Li Liu
Chaoxiao Shen
AAML
ELM
355
25
0
26 Jan 2024
Domain Generalization Guided by Gradient Signal to Noise Ratio of Parameters
IEEE International Conference on Computer Vision (ICCV), 2023
Mateusz Michalkiewicz
M. Faraki
Xiang Yu
Manmohan Chandraker
Mahsa Baktash
382
9
0
11 Oct 2023
Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR)
Guo-qing Jiang
Jinlong Liu
Zixiang Ding
Lin Guo
W. Lin
AI4CE
253
2
0
24 Sep 2023
Gradient Mask: Lateral Inhibition Mechanism Improves Performance in Artificial Neural Networks
Lei Jiang
Yongqing Liu
Shihai Xiao
Yansong Chua
223
1
0
14 Aug 2022
Deep Learning and Symbolic Regression for Discovering Parametric Equations
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Michael Zhang
Samuel Kim
Peter Y. Lu
M. Soljavcić
332
39
0
01 Jul 2022
BackdoorBench: A Comprehensive Benchmark of Backdoor Learning
Neural Information Processing Systems (NeurIPS), 2022
Baoyuan Wu
Hongrui Chen
Ruotong Wang
Zihao Zhu
Shaokui Wei
Danni Yuan
Chaoxiao Shen
ELM
AAML
374
213
0
25 Jun 2022
Exploiting Explainable Metrics for Augmented SGD
Computer Vision and Pattern Recognition (CVPR), 2022
Mahdi S. Hosseini
Mathieu Tuli
Konstantinos N. Plataniotis
AAML
318
3
0
31 Mar 2022
On the Generalization Mystery in Deep Learning
S. Chatterjee
Piotr Zielinski
OOD
355
47
0
18 Mar 2022
Confidence Dimension for Deep Learning based on Hoeffding Inequality and Relative Evaluation
Runqi Wang
Linlin Yang
Baochang Zhang
Wentao Zhu
David Doermann
Guodong Guo
183
1
0
17 Mar 2022
Dataset Condensation with Contrastive Signals
International Conference on Machine Learning (ICML), 2022
Saehyung Lee
Sanghyuk Chun
Sangwon Jung
Sangdoo Yun
Sung-Hoon Yoon
DD
402
137
0
07 Feb 2022
In Search of Probeable Generalization Measures
International Conference on Machine Learning and Applications (ICMLA), 2021
Jonathan Jaegerman
Khalil Damouni
M. M. Ankaralı
Konstantinos N. Plataniotis
231
2
0
23 Oct 2021
Taxonomizing local versus global structure in neural network loss landscapes
Neural Information Processing Systems (NeurIPS), 2021
Yaoqing Yang
Liam Hodgkinson
Ryan Theisen
Joe Zou
Joseph E. Gonzalez
Kannan Ramchandran
Michael W. Mahoney
410
46
0
23 Jul 2021
Disparity Between Batches as a Signal for Early Stopping
Mahsa Forouzesh
Patrick Thiran
370
12
0
14 Jul 2021
Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks
Neural Information Processing Systems (NeurIPS), 2021
Frank Schneider
Felix Dangel
Philipp Hennig
273
13
0
12 Feb 2021
The training accuracy of two-layer neural networks: its estimation and understanding using random datasets
International Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2020
Shuyue Guan
Murray H. Loew
166
0
0
26 Oct 2020
Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Wenxiang Jiao
Xing Wang
Shilin He
Irwin King
Michael R. Lyu
Zhaopeng Tu
222
26
0
06 Oct 2020
Analysis of Generalizability of Deep Neural Networks Based on the Complexity of Decision Boundary
International Conference on Machine Learning and Applications (ICMLA), 2020
Shuyue Guan
Murray H. Loew
301
37
0
16 Sep 2020
A Vision-based Social Distancing and Critical Density Detection System for COVID-19
Dongfang Yang
Ekim Yurtsever
Vishnu Renganathan
Keith A. Redmill
Ü. Özgüner
259
171
0
07 Jul 2020
Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Qi Meng
Shiqi Gong
Wei Chen
Zhi-Ming Ma
Tie-Yan Liu
380
16
0
24 Jun 2020
The Break-Even Point on Optimization Trajectories of Deep Neural Networks
International Conference on Learning Representations (ICLR), 2020
Stanislaw Jastrzebski
Maciej Szymczak
Stanislav Fort
Devansh Arpit
Jacek Tabor
Dong Wang
Krzysztof J. Geras
332
195
0
21 Feb 2020
1
Page 1 of 1