Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2006.02409
Cited By

On the Promise of the Stochastic Generalized Gauss-Newton Method for
Training DNNs

v1v2v3v4 (latest)

On the Promise of the Stochastic Generalized Gauss-Newton Method for Training DNNs

3 June 2020

Matilde Gargiani

Katharina Eggensperger

ArXiv (abs)PDF HTML

Papers citing "On the Promise of the Stochastic Generalized Gauss-Newton Method for Training DNNs"

12 / 12 papers shown

Incremental Gauss-Newton Descent for Machine Learning

Incremental Gauss-Newton Descent for Machine Learning

232

1

0

10 Aug 2024

Exact Gauss-Newton Optimization for Training Deep Neural Networks

Exact Gauss-Newton Optimization for Training Deep Neural Networks

Adeyemi Damilare Adeoye

Alberto Bemporad

408

9

0

23 May 2024

Thermodynamic Natural Gradient Descent

Thermodynamic Natural Gradient Descent

Kaelan Donatella

Samuel Duffield

Patrick J. Coles

198

5

0

22 May 2024

Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization

Dynamic Anisotropic Smoothing for Noisy Derivative-Free OptimizationInternational Conference on Machine Learning (ICML), 2024

Yoshihisa Yamamoto

299

3

0

02 May 2024

A Selective Review on Statistical Methods for Massive Data Computation:
Distributed Computing, Subsampling, and Minibatch Techniques

A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques

...

234

18

0

17 Mar 2024

Dual Gauss-Newton Directions for Deep Learning

Dual Gauss-Newton Directions for Deep Learning

Mathieu Blondel

218

0

0

17 Aug 2023

Sophia: A Scalable Stochastic Second-order Optimizer for Language Model
Pre-training

Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-trainingInternational Conference on Learning Representations (ICLR), 2023

David Leo Wright Hall

Abigail Z. Jacobs

726

259

0

23 May 2023

Achieving High Accuracy with PINNs via Energy Natural Gradients

Achieving High Accuracy with PINNs via Energy Natural GradientsInternational Conference on Machine Learning (ICML), 2023

Johannes Müller

Marius Zeinhofer

381

12

0

25 Feb 2023

Efficient first-order predictor-corrector multiple objective
optimization for fair misinformation detection

Efficient first-order predictor-corrector multiple objective optimization for fair misinformation detection

Katja Mathesius

Arielle K. Carr

115

2

0

15 Sep 2022

PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method
with Probabilistic Gradient Estimation

PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient EstimationInternational Conference on Machine Learning (ICML), 2022

Matilde Gargiani

Andrea Martinelli

Tyler H. Summers

184

17

0

01 Feb 2022

Inexact bilevel stochastic gradient methods for constrained and
unconstrained lower-level problems

Inexact bilevel stochastic gradient methods for constrained and unconstrained lower-level problems

Tommaso Giovannelli

Luis Nunes Vicente

393

16

0

01 Oct 2021

ViViT: Curvature access through the generalized Gauss-Newton's low-rank
structure

ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

239

15

0

04 Jun 2021

Page 1 of 1