ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.05947
  4. Cited By
Generalization Bounds of SGLD for Non-convex Learning: Two Theoretical
  Viewpoints

Generalization Bounds of SGLD for Non-convex Learning: Two Theoretical Viewpoints

19 July 2017
Wenlong Mou
Liwei Wang
Xiyu Zhai
Kai Zheng
    MLT
ArXiv (abs)PDFHTML

Papers citing "Generalization Bounds of SGLD for Non-convex Learning: Two Theoretical Viewpoints"

50 / 107 papers shown
Scaling Law Analysis in Federated Learning: How to Select the Optimal Model Size?
Scaling Law Analysis in Federated Learning: How to Select the Optimal Model Size?
Xuanyu Chen
Nan Yang
Shuai Wang
Dong Yuan
FedML
329
0
0
15 Nov 2025
On the Alignment Between Supervised and Self-Supervised Contrastive Learning
On the Alignment Between Supervised and Self-Supervised Contrastive Learning
Achleshwar Luthra
Priyadarsi Mishra
Tomer Galanti
SSL
211
1
0
09 Oct 2025
RamPINN: Recovering Raman Spectra From Coherent Anti-Stokes Spectra Using Embedded Physics
RamPINN: Recovering Raman Spectra From Coherent Anti-Stokes Spectra Using Embedded Physics
Sai Karthikeya Vemuri
Adithya Ashok Chalain Valapil
Tim Buchner
Joachim Denzler
151
0
0
07 Oct 2025
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Yilan Chen
Zhichao Wang
Wei Huang
Andi Han
Taiji Suzuki
Arya Mazumdar
MLT
305
1
0
12 Jun 2025
Algorithm- and Data-Dependent Generalization Bounds for Diffusion Models
Algorithm- and Data-Dependent Generalization Bounds for Diffusion Models
Benjamin Dupuis
Dario Shariatian
Maxime Haddouche
Alain Durmus
Umut Simsekli
248
1
0
04 Jun 2025
Information-theoretic Generalization Analysis for VQ-VAEs: A Role of Latent Variables
Information-theoretic Generalization Analysis for VQ-VAEs: A Role of Latent Variables
Futoshi Futami
Masahiro Fujisawa
DRLCML
526
0
0
26 May 2025
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
I. Harel
Yonathan Wolanowsky
Gal Vardi
Nathan Srebro
Daniel Soudry
AI4CE
471
2
0
25 May 2025
Randomized Pairwise Learning with Adaptive Sampling: A PAC-Bayes Analysis
Randomized Pairwise Learning with Adaptive Sampling: A PAC-Bayes Analysis
Sijia Zhou
Yunwen Lei
Ata Kabán
400
0
0
03 Apr 2025
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from GeneralizationInternational Conference on Learning Representations (ICLR), 2025
Zixuan Gong
Xiaolin Hu
Huayi Tang
Yong Liu
360
3
0
24 Feb 2025
Understanding Generalization in Transformers: Error Bounds and Training Dynamics Under Benign and Harmful Overfitting
Understanding Generalization in Transformers: Error Bounds and Training Dynamics Under Benign and Harmful Overfitting
Yingying Zhang
Zhikai Wu
Jian Li
Wenshu Fan
MLTAI4CE
273
3
0
18 Feb 2025
Generalization Bounds for Markov Algorithms through Entropy Flow Computations
Generalization Bounds for Markov Algorithms through Entropy Flow Computations
Benjamin Dupuis
Maxime Haddouche
George Deligiannidis
Umut Simsekli
369
1
0
11 Feb 2025
Rethinking generalization of classifiers in separable classes scenarios
  and over-parameterized regimes
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimesIEEE International Joint Conference on Neural Network (IJCNN), 2024
Julius Martinetz
C. Linse
Thomas Martinetz
395
0
0
22 Oct 2024
Generalization and Robustness of the Tilted Empirical Risk
Generalization and Robustness of the Tilted Empirical Risk
Gholamali Aminian
Amir R. Asadi
Tian Li
Ahmad Beirami
Gesine Reinert
Samuel N. Cohen
517
1
0
28 Sep 2024
Information-theoretic Generalization Analysis for Expected Calibration Error
Information-theoretic Generalization Analysis for Expected Calibration Error
Futoshi Futami
Masahiro Fujisawa
UQCVCML
400
15
0
24 May 2024
Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets
Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets
Benjamin Dupuis
Paul Viallard
George Deligiannidis
Umut Simsekli
382
6
0
26 Apr 2024
Tighter Generalisation Bounds via Interpolation
Tighter Generalisation Bounds via Interpolation
Paul Viallard
Maxime Haddouche
Umut Simsekli
Benjamin Guedj
355
4
0
07 Feb 2024
Subsampling is not Magic: Why Large Batch Sizes Work for Differentially
  Private Stochastic Optimisation
Subsampling is not Magic: Why Large Batch Sizes Work for Differentially Private Stochastic Optimisation
Ossi Raisa
Hibiki Ito
Antti Honkela
316
9
0
06 Feb 2024
Adam-like Algorithm with Smooth Clipping Attains Global Minima: Analysis
  Based on Ergodicity of Functional SDEs
Adam-like Algorithm with Smooth Clipping Attains Global Minima: Analysis Based on Ergodicity of Functional SDEs
Keisuke Suzuki
234
0
0
29 Nov 2023
Using Stochastic Gradient Descent to Smooth Nonconvex Functions:
  Analysis of Implicit Graduated Optimization with Optimal Noise Scheduling
Using Stochastic Gradient Descent to Smooth Nonconvex Functions: Analysis of Implicit Graduated Optimization with Optimal Noise Scheduling
Naoki Sato
Hideaki Iiduka
437
4
0
15 Nov 2023
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
Huayi Tang
Yong Liu
553
3
0
08 Nov 2023
Time-Independent Information-Theoretic Generalization Bounds for SGLD
Time-Independent Information-Theoretic Generalization Bounds for SGLDNeural Information Processing Systems (NeurIPS), 2023
Futoshi Futami
Masahiro Fujisawa
452
12
0
02 Nov 2023
Generalization Bounds for Label Noise Stochastic Gradient Descent
Generalization Bounds for Label Noise Stochastic Gradient DescentInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Jung Eun Huh
Patrick Rebeschini
377
3
0
01 Nov 2023
Sample-Conditioned Hypothesis Stability Sharpens Information-Theoretic
  Generalization Bounds
Sample-Conditioned Hypothesis Stability Sharpens Information-Theoretic Generalization BoundsNeural Information Processing Systems (NeurIPS), 2023
Ziqiao Wang
Yongyi Mao
402
7
0
31 Oct 2023
Minibatch and Local SGD: Algorithmic Stability and Linear Speedup in Generalization
Minibatch and Local SGD: Algorithmic Stability and Linear Speedup in Generalization
Yunwen Lei
Tao Sun
Mingrui Liu
573
4
0
02 Oct 2023
A Survey on Privacy in Graph Neural Networks: Attacks, Preservation, and
  Applications
A Survey on Privacy in Graph Neural Networks: Attacks, Preservation, and ApplicationsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Yi Zhang
Yuying Zhao
Zhaoqing Li
Xueqi Cheng
Yu Wang
Olivera Kotevska
Philip S. Yu
Hanyu Wang
464
40
0
31 Aug 2023
Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent
Towards Understanding the Generalizability of Delayed Stochastic Gradient DescentIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Xiaoge Deng
Li Shen
Shengwei Li
Tao Sun
Dongsheng Li
Dacheng Tao
492
3
0
18 Aug 2023
Power-law Dynamic arising from machine learning
Power-law Dynamic arising from machine learning
Wei Chen
Weitao Du
Zhi-Ming Ma
Qi Meng
244
0
0
16 Jun 2023
Understanding Generalization of Federated Learning via Stability:
  Heterogeneity Matters
Understanding Generalization of Federated Learning via Stability: Heterogeneity MattersInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Zhenyu Sun
Xiaochun Niu
Ermin Wei
FedMLMLT
288
35
0
06 Jun 2023
Stochastic Gradient Langevin Dynamics Based on Quantization with
  Increasing Resolution
Stochastic Gradient Langevin Dynamics Based on Quantization with Increasing Resolution
Jinwuk Seok
Chang-Jae Cho
376
0
0
30 May 2023
Generalization Guarantees of Gradient Descent for Multi-Layer Neural
  Networks
Generalization Guarantees of Gradient Descent for Multi-Layer Neural NetworksNeural Computation (Neural Comput.), 2023
Puyu Wang
Yunwen Lei
Di Wang
Yiming Ying
Ding-Xuan Zhou
MLT
366
8
0
26 May 2023
Uniform-in-Time Wasserstein Stability Bounds for (Noisy) Stochastic
  Gradient Descent
Uniform-in-Time Wasserstein Stability Bounds for (Noisy) Stochastic Gradient DescentNeural Information Processing Systems (NeurIPS), 2023
Lingjiong Zhu
Mert Gurbuzbalaban
Anant Raj
Umut Simsekli
321
8
0
20 May 2023
Select without Fear: Almost All Mini-Batch Schedules Generalize
  Optimally
Select without Fear: Almost All Mini-Batch Schedules Generalize Optimally
Konstantinos E. Nikolakakis
Amin Karbasi
Dionysis Kalogerias
399
7
0
03 May 2023
AdapterGNN: Parameter-Efficient Fine-Tuning Improves Generalization in
  GNNs
AdapterGNN: Parameter-Efficient Fine-Tuning Improves Generalization in GNNsAAAI Conference on Artificial Intelligence (AAAI), 2023
Shengrui Li
Xueting Han
Jing Bai
AI4CE
229
26
0
19 Apr 2023
Benign Overfitting for Two-layer ReLU Convolutional Neural Networks
Benign Overfitting for Two-layer ReLU Convolutional Neural NetworksInternational Conference on Machine Learning (ICML), 2023
Yiwen Kou
Zi-Yuan Chen
Yuanzhou Chen
Quanquan Gu
MLT
261
32
0
07 Mar 2023
Geometric ergodicity of SGLD via reflection coupling
Geometric ergodicity of SGLD via reflection couplingStochastics and Dynamics (SD), 2023
Lei Li
Jian‐Guo Liu
Yuliang Wang
182
4
0
17 Jan 2023
Do highly over-parameterized neural networks generalize since bad
  solutions are rare?
Do highly over-parameterized neural networks generalize since bad solutions are rare?IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Julius Martinetz
T. Martinetz
531
2
0
07 Nov 2022
On the Importance of Gradient Norm in PAC-Bayesian Bounds
On the Importance of Gradient Norm in PAC-Bayesian BoundsNeural Information Processing Systems (NeurIPS), 2022
Itai Gat
Yossi Adi
Alex Schwing
Tamir Hazan
BDL
326
6
0
12 Oct 2022
Differentially Private Deep Learning with ModelMix
Differentially Private Deep Learning with ModelMix
Hanshen Xiao
Jun Wan
S. Devadas
280
5
0
07 Oct 2022
Exploring the Algorithm-Dependent Generalization of AUPRC Optimization
  with List Stability
Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List StabilityNeural Information Processing Systems (NeurIPS), 2022
Peisong Wen
Qianqian Xu
Zhiyong Yang
Yuan He
Qingming Huang
335
15
0
27 Sep 2022
Generalisation under gradient descent via deterministic PAC-Bayes
Generalisation under gradient descent via deterministic PAC-BayesInternational Conference on Algorithmic Learning Theory (ALT), 2022
Eugenio Clerico
Tyler Farghly
George Deligiannidis
Benjamin Guedj
Arnaud Doucet
504
6
0
06 Sep 2022
Super-model ecosystem: A domain-adaptation perspective
Super-model ecosystem: A domain-adaptation perspective
Fengxiang He
Dacheng Tao
DiffM
208
1
0
30 Aug 2022
A sharp uniform-in-time error estimate for Stochastic Gradient Langevin Dynamics
A sharp uniform-in-time error estimate for Stochastic Gradient Langevin DynamicsCSIAM Transactions on Applied Mathematics (TCAM), 2022
Lei Li
Yuliang Wang
439
16
0
19 Jul 2022
Max-Margin Works while Large Margin Fails: Generalization without
  Uniform Convergence
Max-Margin Works while Large Margin Fails: Generalization without Uniform ConvergenceInternational Conference on Learning Representations (ICLR), 2022
Margalit Glasgow
Colin Wei
Mary Wootters
Tengyu Ma
372
6
0
16 Jun 2022
Stability and Generalization of Stochastic Optimization with Nonconvex
  and Nonsmooth Problems
Stability and Generalization of Stochastic Optimization with Nonconvex and Nonsmooth ProblemsAnnual Conference Computational Learning Theory (COLT), 2022
Yunwen Lei
322
32
0
14 Jun 2022
Generalization Error Bounds for Deep Neural Networks Trained by SGD
Generalization Error Bounds for Deep Neural Networks Trained by SGD
Mingze Wang
Chao Ma
174
21
0
07 Jun 2022
Algorithmic Stability of Heavy-Tailed Stochastic Gradient Descent on
  Least Squares
Algorithmic Stability of Heavy-Tailed Stochastic Gradient Descent on Least SquaresInternational Conference on Algorithmic Learning Theory (ALT), 2022
Anant Raj
Melih Barsbey
Mert Gurbuzbalaban
Lingjiong Zhu
Umut Simsekli
310
13
0
02 Jun 2022
Generalization Bounds for Gradient Methods via Discrete and Continuous
  Prior
Generalization Bounds for Gradient Methods via Discrete and Continuous PriorNeural Information Processing Systems (NeurIPS), 2022
Jun Yu Li
Xu Luo
Jian Li
389
4
0
27 May 2022
Uniform Generalization Bound on Time and Inverse Temperature for
  Gradient Descent Algorithm and its Application to Analysis of Simulated
  Annealing
Uniform Generalization Bound on Time and Inverse Temperature for Gradient Descent Algorithm and its Application to Analysis of Simulated Annealing
Keisuke Suzuki
AI4CE
279
0
0
25 May 2022
Weak Convergence of Approximate reflection coupling and its Application
  to Non-convex Optimization
Weak Convergence of Approximate reflection coupling and its Application to Non-convex Optimization
Keisuke Suzuki
318
6
0
24 May 2022
Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for
  Full-Batch GD
Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GDInternational Conference on Learning Representations (ICLR), 2022
Konstantinos E. Nikolakakis
Farzin Haddadpour
Amin Karbasi
Dionysios S. Kalogerias
555
23
0
26 Apr 2022
123
Next
Page 1 of 3