v1v2v3 (latest)

Layer-Parallel Training of Deep Residual Neural Networks

11 December 2018

Papers citing "Layer-Parallel Training of Deep Residual Neural Networks"

46 / 46 papers shown

Optimal Control Theoretic Neural Optimizer: From Backpropagation to Dynamic Programming

Guan-Horng Liu

Tianrong Chen

Evangelos A. Theodorou

AI4CE

104

15 Oct 2025

OCTANE -- Optimal Control for Tensor-based Autoencoder Network Emergence: Explicit Case

09 Sep 2025

A Nonoverlapping Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems

Chang-Ock Lee

Youngkyu Lee

Byungeun Ryoo

190

22 Jun 2024

Two-level overlapping additive Schwarz preconditioner for training scientific machine learning applications

241

16 Jun 2024

Rethinking the Relationship between Recurrent and Non-Recurrent Neural Networks: A Study in Sparsity

338

01 Apr 2024

Machine learning and domain decomposition methods -- a survey

204

21 Dec 2023

Parallel Trust-Region Approaches in Neural Network Training: Beyond Traditional Methods

Ken Trotti

Samuel A. Cruz Alegría

Alena Kopanicáková

Rolf Krause

215

21 Dec 2023

Fast Multipole Attention: A Scalable Multilevel Attention Mechanism for Text and Images

Yanming Kang

Giang Tran

H. Sterck

321

18 Oct 2023

DeepPCR: Parallelizing Sequential Operations in Neural NetworksNeural Information Processing Systems (NeurIPS), 2023

Federico Danieli

Miguel Sarabia

Xavier Suau

Yuan-Sen Ting

Luca Zappella

227

28 Sep 2023

Parallelizing non-linear sequential models over the sequence lengthInternational Conference on Learning Representations (ICLR), 2023

419

21 Sep 2023

Enhancing training of physics-informed neural networks using domain-decomposition based preconditioning strategiesSIAM Journal on Scientific Computing (SISC), 2023

227

30 Jun 2023

Parareal with a physics-informed neural network as coarse propagatorEuropean Conference on Parallel Processing (Euro-Par), 2023

A. Ibrahim

Sebastian Götschel

Daniel Ruprecht

233

07 Mar 2023

Multilevel-in-Layer Training for Deep Neural Network Regression

116

11 Nov 2022

The phase unwrapping of under-sampled interferograms using radial basis function neural networks

P. Gourdain

Aidan Bachmann

19 Oct 2022

An Optimal Time Variable Learning Framework for Deep Neural NetworksAnnals of Mathematical Sciences and Applications (AMSA), 2022

Harbir Antil

Hugo Díaz

Evelyn Herberg

119

18 Apr 2022

TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving SpeedComputer Vision and Pattern Recognition (CVPR), 2022

259

19 Mar 2022

Parallel Training of GRU Networks with a Multi-Grid Solver for Long SequencesInternational Conference on Learning Representations (ICLR), 2022

G. Moon

E. Cyr

143

07 Mar 2022

Layer-Parallel Training of Residual Networks with Auxiliary-Variable Networks

207

10 Dec 2021

Quantized Convolutional Neural Networks Through the Lens of Partial Differential EquationsResearch in the Mathematical Sciences (Res. Math. Sci.), 2021

263

31 Aug 2021

Connections between Numerical Algorithms for PDEs and Neural NetworksJournal of Mathematical Imaging and Vision (JMIV), 2021

262

30 Jul 2021

Globally Convergent Multilevel Training of Deep Residual Networks

Alena Kopanicáková

Rolf Krause

335

15 Jul 2021

ResIST: Layer-Wise Decomposition of ResNets for Distributed Training

Chen Dun

Cameron R. Wolfe

C. Jermaine

Anastasios Kyrillidis

325

02 Jul 2021

Differentiable Multiple Shooting LayersNeural Information Processing Systems (NeurIPS), 2021

Jinkyoo Park

140

07 Jun 2021

Dynamic Game Theoretic Neural OptimizerInternational Conference on Machine Learning (ICML), 2021

Guan-Horng Liu

T. Chen

Evangelos A. Theodorou

AI4CE

279

08 May 2021

Parareal Neural Networks Emulating a Parallel-in-time AlgorithmIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

196

16 Mar 2021

Spline parameterization of neural network controls for deep learning

Stefanie Günther

Will Pazner

Dongping Qi

120

27 Feb 2021

GIST: Distributed Training for Large-Scale Graph Convolutional NetworksJournal of Applied and Computational Topology (JACT), 2021

Chen Dun

Anastasios Kyrillidis

BDL GNN LRM

295

20 Feb 2021

Novel Deep neural networks for solving Bayesian statistical inverse

130

08 Feb 2021

Parallel Blockwise Knowledge Distillation for Deep Neural Network CompressionIEEE Transactions on Parallel and Distributed Systems (TPDS), 2020

Cody Blakeney

Xiaomin Li

Yan Yan

Ziliang Zong

262

05 Dec 2020

MGIC: Multigrid-in-Channels Neural Network ArchitecturesSIAM Journal on Scientific Computing (SIAM J. Sci. Comput.), 2020

406

17 Nov 2020

A Practical Layer-Parallel Training Algorithm for Residual Networks

255

03 Sep 2020

A Differential Game Theoretic Neural Optimizer for Training Residual Networks

Guan-Horng Liu

T. Chen

Evangelos A. Theodorou

156

17 Jul 2020

Layer-Parallel Training with GPU Concurrency of Deep Residual Neural Networks via Nonlinear MultigridIEEE Conference on High Performance Extreme Computing (HPEC), 2020

Andrew Kirby

S. Samsi

Michael Jones

Albert Reuther

J. Kepner

V. Gadepally

156

14 Jul 2020

Multigrid-in-Channels Architectures for Wide Convolutional Neural Networks

Jonathan Ephrath

Lars Ruthotto

Eran Treister

161

11 Jun 2020

Structure preserving deep learning

E. Celledoni

Matthias Joachim Ehrhardt

Christian Etmann

R. McLachlan

B. Owren

Carola-Bibiane Schönlieb

Ferdia Sherry

AI4CE

217

05 Jun 2020

Discretize-Optimize vs. Optimize-Discretize for Time-Series Regression and Continuous Normalizing Flows

Derek Onken

Lars Ruthotto

BDL

265

27 May 2020

Multilevel Minimization for Deep Residual NetworksESAIM Proceedings and Surveys (ESAIM Proc. Surv.), 2020

Lisa Gaedke-Merzhäuser

Alena Kopanicáková

Rolf Krause

204

13 Apr 2020

Fractional Deep Neural Network via Constrained Optimization

160

01 Apr 2020

Deep connections between learning from limited labels & physical parameter estimation -- inspiration for regularization

Bas Peters

AI4CE

132

17 Mar 2020

DDPNOpt: Differential Dynamic Programming Neural OptimizerInternational Conference on Learning Representations (ICLR), 2020

Guan-Horng Liu

T. Chen

Evangelos A. Theodorou

272

20 Feb 2020

Hamiltonian neural networks for solving equations of motionPhysical Review E (PRE), 2020

474

29 Jan 2020

Multilevel Initialization for Layer-Parallel Deep Neural Network Training

122

19 Dec 2019

A literature survey of matrix methods for data scienceGAMM-Mitteilungen (GAMM), 2019

Martin Stoll

225

17 Dec 2019

Parareal with a Learned Coarse Model for Robotic Manipulation

220

12 Dec 2019

A Machine Learning Framework for Solving High-Dimensional Mean Field Game and Mean Field Control ProblemsProceedings of the National Academy of Sciences of the United States of America (PNAS), 2019

Lars Ruthotto

Stanley Osher

Wuchen Li

L. Nurbekyan

Samy Wu Fung

AI4CE

363

257

04 Dec 2019

Predict Globally, Correct Locally: Parallel-in-Time Optimal Control of Neural Networks

P. Parpas

Corey Muir

OOD

156

07 Feb 2019