ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.08947
  4. Cited By
Exploring Generalization in Deep Learning

Exploring Generalization in Deep Learning

27 June 2017
Behnam Neyshabur
Srinadh Bhojanapalli
David A. McAllester
Nathan Srebro
    FAtt
ArXivPDFHTML

Papers citing "Exploring Generalization in Deep Learning"

50 / 299 papers shown
Title
Sharpness-Aware Minimization with Z-Score Gradient Filtering for Neural Networks
Sharpness-Aware Minimization with Z-Score Gradient Filtering for Neural Networks
Juyoung Yun
38
0
0
05 May 2025
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
Ashkan Shakarami
Yousef Yeganeh
Azade Farshad
Lorenzo Nicolè
Stefano Ghidoni
Nassir Navab
52
0
0
21 Apr 2025
Hessian-aware Training for Enhancing DNNs Resilience to Parameter Corruptions
Hessian-aware Training for Enhancing DNNs Resilience to Parameter Corruptions
Tahmid Hasan Prato
Seijoon Kim
Lizhong Chen
Sanghyun Hong
AAML
35
0
0
02 Apr 2025
DGSAM: Domain Generalization via Individual Sharpness-Aware Minimization
DGSAM: Domain Generalization via Individual Sharpness-Aware Minimization
Youngjun Song
Youngsik Hwang
Jonghun Lee
Heechang Lee
Dong-Young Lim
AAML
49
0
0
30 Mar 2025
ZeroLM: Data-Free Transformer Architecture Search for Language Models
ZeroLM: Data-Free Transformer Architecture Search for Language Models
Zhen-Song Chen
Hong-Wei Ding
Xian-Jia Wang
Witold Pedrycz
53
0
0
24 Mar 2025
Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers
Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers
Gaojie Jin
Tianjin Huang
Ronghui Mu
Xiaowei Huang
AAML
46
0
0
21 Mar 2025
High-entropy Advantage in Neural Networks' Generalizability
High-entropy Advantage in Neural Networks' Generalizability
Entao Yang
X. Zhang
Yue Shang
Ge Zhang
AI4CE
63
0
0
17 Mar 2025
Analyzing the Role of Permutation Invariance in Linear Mode Connectivity
Keyao Zhan
Puheng Li
Lei Wu
MoMe
82
0
0
13 Mar 2025
Hamiltonian Neural Networks for Robust Out-of-Time Credit Scoring
Hamiltonian Neural Networks for Robust Out-of-Time Credit Scoring
Javier Marín
78
0
0
13 Mar 2025
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability
Lijia Yu
Yibo Miao
Yifan Zhu
Xiao-Shan Gao
Lijun Zhang
48
0
0
06 Mar 2025
Sharpness-Aware Minimization: General Analysis and Improved Rates
Dimitris Oikonomou
Nicolas Loizou
65
0
0
04 Mar 2025
A Near Complete Nonasymptotic Generalization Theory For Multilayer Neural Networks: Beyond the Bias-Variance Tradeoff
Hao Yu
Xiangyang Ji
AI4CE
58
0
0
03 Mar 2025
SASSHA: Sharpness-aware Adaptive Second-order Optimization with Stable Hessian Approximation
SASSHA: Sharpness-aware Adaptive Second-order Optimization with Stable Hessian Approximation
Dahun Shin
Dongyeop Lee
Jinseok Chung
Namhoon Lee
ODL
AAML
177
0
0
25 Feb 2025
Unveiling Mode Connectivity in Graph Neural Networks
Unveiling Mode Connectivity in Graph Neural Networks
Bingheng Li
Z. Chen
Haoyu Han
Shenglai Zeng
J. Liu
Jiliang Tang
48
0
0
18 Feb 2025
On Space Folds of ReLU Neural Networks
On Space Folds of ReLU Neural Networks
Michal Lewandowski
Hamid Eghbalzadeh
Bernhard Heinzl
Raphael Pisoni
Bernhard A.Moser
MLT
78
1
0
17 Feb 2025
Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning
Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning
Rémy Hosseinkhan Boucher
Onofrio Semeraro
L. Mathelin
74
0
0
28 Jan 2025
Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture
Yikun Hou
Suvrit Sra
A. Yurtsever
29
0
0
28 Jan 2025
Enhancing Robust Fairness via Confusional Spectral Regularization
Enhancing Robust Fairness via Confusional Spectral Regularization
Gaojie Jin
Sihao Wu
Jiaxu Liu
Tianjin Huang
Ronghui Mu
76
1
0
22 Jan 2025
MOFHEI: Model Optimizing Framework for Fast and Efficient
  Homomorphically Encrypted Neural Network Inference
MOFHEI: Model Optimizing Framework for Fast and Efficient Homomorphically Encrypted Neural Network Inference
Parsa Ghazvinian
Robert Podschwadt
Prajwal Panzade
Mohammad H. Rafiei
Daniel Takabi
72
0
0
10 Dec 2024
NoLoR: An ASR-Based Framework for Expedited Endangered Language
  Documentation with Neo-Aramaic as a Case Study
NoLoR: An ASR-Based Framework for Expedited Endangered Language Documentation with Neo-Aramaic as a Case Study
Matthew Nazari
65
0
0
06 Dec 2024
GradAlign for Training-free Model Performance Inference
GradAlign for Training-free Model Performance Inference
Yuxuan Li
Yunhui Guo
62
0
0
29 Nov 2024
An In-depth Investigation of Sparse Rate Reduction in Transformer-like
  Models
An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models
Yunzhe Hu
Difan Zou
Dong Xu
74
1
0
26 Nov 2024
Re-examining learning linear functions in context
Omar Naim
Guilhem Fouilhé
Nicholas Asher
63
0
0
18 Nov 2024
Understanding Generalization in Quantum Machine Learning with Margins
Understanding Generalization in Quantum Machine Learning with Margins
Tak Hur
Daniel K. Park
AI4CE
26
1
0
11 Nov 2024
Slowing Down Forgetting in Continual Learning
Slowing Down Forgetting in Continual Learning
Pascal Janetzky
Tobias Schlagenhauf
Stefan Feuerriegel
CLL
34
0
0
11 Nov 2024
ELU-GCN: Effectively Label-Utilizing Graph Convolutional Network
ELU-GCN: Effectively Label-Utilizing Graph Convolutional Network
Jincheng Huang
Yujie Mo
Xiaoshuang Shi
Lei Feng
Xiaofeng Zhu
39
0
0
04 Nov 2024
1st-Order Magic: Analysis of Sharpness-Aware Minimization
1st-Order Magic: Analysis of Sharpness-Aware Minimization
Nalin Tiwary
Siddarth Aananth
23
0
0
03 Nov 2024
Generalizability of Memorization Neural Networks
Generalizability of Memorization Neural Networks
Lijia Yu
Xiao-Shan Gao
Lijun Zhang
Yibo Miao
28
1
0
01 Nov 2024
FuseFL: One-Shot Federated Learning through the Lens of Causality with
  Progressive Model Fusion
FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion
Zhenheng Tang
Yonggang Zhang
Peijie Dong
Y. Cheung
Amelie Chi Zhou
Bo Han
Xiaowen Chu
FedML
MoMe
AI4CE
49
6
0
27 Oct 2024
Rethinking generalization of classifiers in separable classes scenarios
  and over-parameterized regimes
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes
Julius Martinetz
C. Linse
Thomas Martinetz
26
0
0
22 Oct 2024
Simplicity Bias via Global Convergence of Sharpness Minimization
Simplicity Bias via Global Convergence of Sharpness Minimization
Khashayar Gatmiry
Zhiyuan Li
Sashank J. Reddi
Stefanie Jegelka
29
1
0
21 Oct 2024
Implicit Regularization of Sharpness-Aware Minimization for
  Scale-Invariant Problems
Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems
Bingcong Li
Liang Zhang
Niao He
41
3
0
18 Oct 2024
Stochastic Gradient Descent Jittering for Inverse Problems: Alleviating
  the Accuracy-Robustness Tradeoff
Stochastic Gradient Descent Jittering for Inverse Problems: Alleviating the Accuracy-Robustness Tradeoff
Peimeng Guan
Mark A. Davenport
28
0
0
18 Oct 2024
Sharpness-Aware Black-Box Optimization
Sharpness-Aware Black-Box Optimization
Feiyang Ye
Yueming Lyu
Xuehao Wang
Masashi Sugiyama
Yu-Jie Zhang
Ivor W. Tsang
AAML
42
0
0
16 Oct 2024
Mitigating Suboptimality of Deterministic Policy Gradients in Complex
  Q-functions
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
Norio Kosaka
Xinhu Li
Kyung-Min Kim
Erdem Bıyık
Joseph J. Lim
OffRL
16
0
0
15 Oct 2024
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture
Sajad Movahedi
Antonio Orvieto
Seyed-Mohsen Moosavi-Dezfooli
AI4CE
AAML
142
0
0
15 Oct 2024
MoTE: Reconciling Generalization with Specialization for Visual-Language
  to Video Knowledge Transfer
MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer
Minghao Zhu
Zhengpu Wang
Mengxian Hu
Ronghao Dang
Xiao Lin
Xun Zhou
Chengju Liu
Qijun Chen
37
1
0
14 Oct 2024
Learning via Surrogate PAC-Bayes
Learning via Surrogate PAC-Bayes
Antoine Picard-Weibel
Roman Moscoviz
Benjamin Guedj
23
0
0
14 Oct 2024
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Zhanpeng Zhou
Mingze Wang
Yuchen Mao
Bingrui Li
Junchi Yan
AAML
62
0
0
14 Oct 2024
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes
  Fail to Improve?
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Fırat Öncel
Matthias Bethge
B. Ermiş
Mirco Ravanelli
Cem Subakan
Çağatay Yıldız
27
1
0
08 Oct 2024
LOTOS: Layer-wise Orthogonalization for Training Robust Ensembles
LOTOS: Layer-wise Orthogonalization for Training Robust Ensembles
A. Boroojeny
Hari Sundaram
Varun Chandrasekaran
AAML
34
1
0
07 Oct 2024
Improving Generalization with Flat Hilbert Bayesian Inference
Improving Generalization with Flat Hilbert Bayesian Inference
Tuan Truong
Quyen Tran
Quan Pham-Ngoc
Nhat Ho
Dinh Q. Phung
Trung Le
26
0
0
05 Oct 2024
Simplicity bias and optimization threshold in two-layer ReLU networks
Simplicity bias and optimization threshold in two-layer ReLU networks
Etienne Boursier
Nicolas Flammarion
31
2
0
03 Oct 2024
Towards Better Generalization: Weight Decay Induces Low-rank Bias for
  Neural Networks
Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks
Ke Chen
Chugang Yi
Haizhao Yang
MLT
21
0
0
03 Oct 2024
Model X-Ray: Detection of Hidden Malware in AI Model Weights using Few
  Shot Learning
Model X-Ray: Detection of Hidden Malware in AI Model Weights using Few Shot Learning
Daniel Gilkarov
Ran Dubin
22
0
0
28 Sep 2024
Super Level Sets and Exponential Decay: A Synergistic Approach to Stable
  Neural Network Training
Super Level Sets and Exponential Decay: A Synergistic Approach to Stable Neural Network Training
J. Chaudhary
Dipak Nidhi
J. Heikkonen
H. Merisaari
R. Kanth
21
0
0
25 Sep 2024
Revisiting Video Quality Assessment from the Perspective of
  Generalization
Revisiting Video Quality Assessment from the Perspective of Generalization
Xinli Yue
Jianhui Sun
Liangchao Yao
Fan Xia
Yuetang Deng
...
Lei Li
Fengyun Rao
Jing Lv
Qian Wang
Lingchen Zhao
MoMe
32
0
0
23 Sep 2024
Bilateral Sharpness-Aware Minimization for Flatter Minima
Bilateral Sharpness-Aware Minimization for Flatter Minima
Jiaxin Deng
Junbiao Pang
Baochang Zhang
Qingming Huang
AAML
116
0
0
20 Sep 2024
Unraveling the Hessian: A Key to Smooth Convergence in Loss Function
  Landscapes
Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes
Nikita Kiselev
Andrey Grabovoy
51
1
0
18 Sep 2024
The Optimality of (Accelerated) SGD for High-Dimensional Quadratic
  Optimization
The Optimality of (Accelerated) SGD for High-Dimensional Quadratic Optimization
Haihan Zhang
Yuanshi Liu
Qianwen Chen
Cong Fang
38
0
0
15 Sep 2024
123456
Next