ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.07384
  4. Cited By
Understanding Why Neural Networks Generalize Well Through GSNR of
  Parameters
v1v2 (latest)

Understanding Why Neural Networks Generalize Well Through GSNR of Parameters

International Conference on Learning Representations (ICLR), 2020
21 January 2020
Jinlong Liu
Guo-qing Jiang
Yunzhi Bai
Ting Chen
Huayan Wang
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Understanding Why Neural Networks Generalize Well Through GSNR of Parameters"

30 / 30 papers shown
GRADSTOP: Early Stopping of Gradient Descent via Posterior Sampling
GRADSTOP: Early Stopping of Gradient Descent via Posterior Sampling
Arash Jamshidi
Lauri Seppäläinen
Katsiaryna Haitsiukevich
Hoang Phuc Hau Luu
Anton Björklund
Kai Puolamäki
BDL
221
0
0
26 Aug 2025
SGD as Free Energy Minimization: A Thermodynamic View on Neural Network Training
SGD as Free Energy Minimization: A Thermodynamic View on Neural Network Training
Ildus Sadrtdinov
Ivan Klimov
E. Lobacheva
Dmitry Vetrov
290
3
0
29 May 2025
DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
Haiduo Huang
Jiangcheng Song
Yadong Zhang
Pengju Ren
439
0
0
21 May 2025
CGLearn: Consistent Gradient-Based Learning for Out-of-Distribution
  Generalization
CGLearn: Consistent Gradient-Based Learning for Out-of-Distribution GeneralizationInternational Conference on Pattern Recognition Applications and Methods (ICPRAM), 2024
Jawad Chowdhury
G. Terejanu
AI4CEBDLOODOODD
429
1
0
09 Nov 2024
Adversarial Vulnerability as a Consequence of On-Manifold Inseparibility
Adversarial Vulnerability as a Consequence of On-Manifold Inseparibility
Rajdeep Haldar
Yue Xing
Qifan Song
Guang Lin
265
0
0
09 Oct 2024
Wrong-of-Thought: An Integrated Reasoning Framework with
  Multi-Perspective Verification and Wrong Information
Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong InformationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yongheng Zhang
Qiguang Chen
Jingxuan Zhou
Peng Wang
Jiasheng Si
Jin Wang
Wenpeng Lu
Libo Qin
LRM
398
13
0
06 Oct 2024
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling
  for Retrieval-Augmented Generation
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation
Zixuan Li
Jing Xiong
Fanghua Ye
Chuanyang Zheng
Xun Wu
...
Xiaodan Liang
Chengming Li
Zhenan Sun
Lingpeng Kong
Ngai Wong
RALMUQLM
421
9
0
03 Oct 2024
Knowledge Overshadowing Causes Amalgamated Hallucination in Large
  Language Models
Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models
Yuji Zhang
Sha Li
Jiateng Liu
Pengfei Yu
Yi R. Fung
Jing Li
Pengfei Yu
Heng Ji
426
29
0
10 Jul 2024
Bias of Stochastic Gradient Descent or the Architecture: Disentangling the Effects of Overparameterization of Neural Networks
Bias of Stochastic Gradient Descent or the Architecture: Disentangling the Effects of Overparameterization of Neural Networks
Amit Peleg
Matthias Hein
407
0
0
04 Jul 2024
Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability
Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability
Rajdeep Haldar
Yue Xing
Qifan Song
368
6
0
06 Mar 2024
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor
  Learning
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor LearningInternational Journal of Computer Vision (IJCV), 2024
Baoyuan Wu
Hongrui Chen
Ruotong Wang
Zihao Zhu
Shaokui Wei
Danni Yuan
Mingli Zhu
Ke Xu
Li Liu
Chaoxiao Shen
AAMLELM
355
25
0
26 Jan 2024
Domain Generalization Guided by Gradient Signal to Noise Ratio of
  Parameters
Domain Generalization Guided by Gradient Signal to Noise Ratio of ParametersIEEE International Conference on Computer Vision (ICCV), 2023
Mateusz Michalkiewicz
M. Faraki
Xiang Yu
Manmohan Chandraker
Mahsa Baktash
382
9
0
11 Oct 2023
Accelerating Large Batch Training via Gradient Signal to Noise Ratio
  (GSNR)
Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR)
Guo-qing Jiang
Jinlong Liu
Zixiang Ding
Lin Guo
W. Lin
AI4CE
253
2
0
24 Sep 2023
Gradient Mask: Lateral Inhibition Mechanism Improves Performance in
  Artificial Neural Networks
Gradient Mask: Lateral Inhibition Mechanism Improves Performance in Artificial Neural Networks
Lei Jiang
Yongqing Liu
Shihai Xiao
Yansong Chua
223
1
0
14 Aug 2022
Deep Learning and Symbolic Regression for Discovering Parametric
  Equations
Deep Learning and Symbolic Regression for Discovering Parametric EquationsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Michael Zhang
Samuel Kim
Peter Y. Lu
M. Soljavcić
332
39
0
01 Jul 2022
BackdoorBench: A Comprehensive Benchmark of Backdoor Learning
BackdoorBench: A Comprehensive Benchmark of Backdoor LearningNeural Information Processing Systems (NeurIPS), 2022
Baoyuan Wu
Hongrui Chen
Ruotong Wang
Zihao Zhu
Shaokui Wei
Danni Yuan
Chaoxiao Shen
ELMAAML
374
213
0
25 Jun 2022
Exploiting Explainable Metrics for Augmented SGD
Exploiting Explainable Metrics for Augmented SGDComputer Vision and Pattern Recognition (CVPR), 2022
Mahdi S. Hosseini
Mathieu Tuli
Konstantinos N. Plataniotis
AAML
318
3
0
31 Mar 2022
On the Generalization Mystery in Deep Learning
On the Generalization Mystery in Deep Learning
S. Chatterjee
Piotr Zielinski
OOD
355
47
0
18 Mar 2022
Confidence Dimension for Deep Learning based on Hoeffding Inequality and
  Relative Evaluation
Confidence Dimension for Deep Learning based on Hoeffding Inequality and Relative Evaluation
Runqi Wang
Linlin Yang
Baochang Zhang
Wentao Zhu
David Doermann
Guodong Guo
183
1
0
17 Mar 2022
Dataset Condensation with Contrastive Signals
Dataset Condensation with Contrastive SignalsInternational Conference on Machine Learning (ICML), 2022
Saehyung Lee
Sanghyuk Chun
Sangwon Jung
Sangdoo Yun
Sung-Hoon Yoon
DD
402
137
0
07 Feb 2022
In Search of Probeable Generalization Measures
In Search of Probeable Generalization MeasuresInternational Conference on Machine Learning and Applications (ICMLA), 2021
Jonathan Jaegerman
Khalil Damouni
M. M. Ankaralı
Konstantinos N. Plataniotis
231
2
0
23 Oct 2021
Taxonomizing local versus global structure in neural network loss
  landscapes
Taxonomizing local versus global structure in neural network loss landscapesNeural Information Processing Systems (NeurIPS), 2021
Yaoqing Yang
Liam Hodgkinson
Ryan Theisen
Joe Zou
Joseph E. Gonzalez
Kannan Ramchandran
Michael W. Mahoney
410
46
0
23 Jul 2021
Disparity Between Batches as a Signal for Early Stopping
Disparity Between Batches as a Signal for Early Stopping
Mahsa Forouzesh
Patrick Thiran
370
12
0
14 Jul 2021
Cockpit: A Practical Debugging Tool for the Training of Deep Neural
  Networks
Cockpit: A Practical Debugging Tool for the Training of Deep Neural NetworksNeural Information Processing Systems (NeurIPS), 2021
Frank Schneider
Felix Dangel
Philipp Hennig
273
13
0
12 Feb 2021
The training accuracy of two-layer neural networks: its estimation and
  understanding using random datasets
The training accuracy of two-layer neural networks: its estimation and understanding using random datasetsInternational Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2020
Shuyue Guan
Murray H. Loew
166
0
0
26 Oct 2020
Data Rejuvenation: Exploiting Inactive Training Examples for Neural
  Machine Translation
Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Wenxiang Jiao
Xing Wang
Shilin He
Irwin King
Michael R. Lyu
Zhaopeng Tu
222
26
0
06 Oct 2020
Analysis of Generalizability of Deep Neural Networks Based on the
  Complexity of Decision Boundary
Analysis of Generalizability of Deep Neural Networks Based on the Complexity of Decision BoundaryInternational Conference on Machine Learning and Applications (ICMLA), 2020
Shuyue Guan
Murray H. Loew
301
37
0
16 Sep 2020
A Vision-based Social Distancing and Critical Density Detection System
  for COVID-19
A Vision-based Social Distancing and Critical Density Detection System for COVID-19
Dongfang Yang
Ekim Yurtsever
Vishnu Renganathan
Keith A. Redmill
Ü. Özgüner
259
171
0
07 Jul 2020
Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Qi Meng
Shiqi Gong
Wei Chen
Zhi-Ming Ma
Tie-Yan Liu
380
16
0
24 Jun 2020
The Break-Even Point on Optimization Trajectories of Deep Neural
  Networks
The Break-Even Point on Optimization Trajectories of Deep Neural NetworksInternational Conference on Learning Representations (ICLR), 2020
Stanislaw Jastrzebski
Maciej Szymczak
Stanislav Fort
Devansh Arpit
Jacek Tabor
Dong Wang
Krzysztof J. Geras
332
195
0
21 Feb 2020
1
Page 1 of 1