Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1812.04352
Cited By
v1
v2
v3 (latest)
Layer-Parallel Training of Deep Residual Neural Networks
11 December 2018
Stefanie Günther
Lars Ruthotto
J. Schroder
E. Cyr
N. Gauger
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Layer-Parallel Training of Deep Residual Neural Networks"
46 / 46 papers shown
Optimal Control Theoretic Neural Optimizer: From Backpropagation to Dynamic Programming
Guan-Horng Liu
Tianrong Chen
Evangelos A. Theodorou
AI4CE
104
0
0
15 Oct 2025
OCTANE -- Optimal Control for Tensor-based Autoencoder Network Emergence: Explicit Case
R. Khatri
Anthony Kolshorn
Colin Olson
Harbir Antil
72
0
0
09 Sep 2025
A Nonoverlapping Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems
Chang-Ock Lee
Youngkyu Lee
Byungeun Ryoo
190
6
0
22 Jun 2024
Two-level overlapping additive Schwarz preconditioner for training scientific machine learning applications
Youngkyu Lee
Alena Kopanicáková
George Karniadakis
AI4CE
241
3
0
16 Jun 2024
Rethinking the Relationship between Recurrent and Non-Recurrent Neural Networks: A Study in Sparsity
Quincy Hershey
Randy Paffenroth
Harsh Nilesh Pathak
Simon Tavener
338
5
0
01 Apr 2024
Machine learning and domain decomposition methods -- a survey
A. Klawonn
M. Lanser
J. Weber
AI4CE
204
21
0
21 Dec 2023
Parallel Trust-Region Approaches in Neural Network Training: Beyond Traditional Methods
Ken Trotti
Samuel A. Cruz Alegría
Alena Kopanicáková
Rolf Krause
215
2
0
21 Dec 2023
Fast Multipole Attention: A Scalable Multilevel Attention Mechanism for Text and Images
Yanming Kang
Giang Tran
H. Sterck
321
9
0
18 Oct 2023
DeepPCR: Parallelizing Sequential Operations in Neural Networks
Neural Information Processing Systems (NeurIPS), 2023
Federico Danieli
Miguel Sarabia
Xavier Suau
Yuan-Sen Ting
Luca Zappella
227
6
0
28 Sep 2023
Parallelizing non-linear sequential models over the sequence length
International Conference on Learning Representations (ICLR), 2023
Yi Heng Lim
Qi Zhu
Joshua Selfridge
M. F. Kasim
419
28
0
21 Sep 2023
Enhancing training of physics-informed neural networks using domain-decomposition based preconditioning strategies
SIAM Journal on Scientific Computing (SISC), 2023
Alena Kopanicáková
Hardik Kothari
George Karniadakis
Rolf Krause
AI4CE
227
26
0
30 Jun 2023
Parareal with a physics-informed neural network as coarse propagator
European Conference on Parallel Processing (Euro-Par), 2023
A. Ibrahim
Sebastian Götschel
Daniel Ruprecht
233
13
0
07 Mar 2023
Multilevel-in-Layer Training for Deep Neural Network Regression
Colin Ponce
Ruipeng Li
Christina Mao
P. Vassilevski
AI4CE
116
1
0
11 Nov 2022
The phase unwrapping of under-sampled interferograms using radial basis function neural networks
P. Gourdain
Aidan Bachmann
79
1
0
19 Oct 2022
An Optimal Time Variable Learning Framework for Deep Neural Networks
Annals of Mathematical Sciences and Applications (AMSA), 2022
Harbir Antil
Hugo Díaz
Evelyn Herberg
119
4
0
18 Apr 2022
TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed
Computer Vision and Pattern Recognition (CVPR), 2022
Shian Du
Yihong Luo
Wei Chen
Jian Xu
Delu Zeng
259
9
0
19 Mar 2022
Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
International Conference on Learning Representations (ICLR), 2022
G. Moon
E. Cyr
143
8
0
07 Mar 2022
Layer-Parallel Training of Residual Networks with Auxiliary-Variable Networks
Qi Sun
Hexin Dong
Zewei Chen
Jiacheng Sun
Zhenguo Li
Bin Dong
207
3
0
10 Dec 2021
Quantized Convolutional Neural Networks Through the Lens of Partial Differential Equations
Research in the Mathematical Sciences (Res. Math. Sci.), 2021
Ido Ben-Yair
Gil Ben Shalom
Moshe Eliasof
Eran Treister
MQ
263
5
0
31 Aug 2021
Connections between Numerical Algorithms for PDEs and Neural Networks
Journal of Mathematical Imaging and Vision (JMIV), 2021
Tobias Alt
Karl Schrader
M. Augustin
Pascal Peter
Joachim Weickert
PINN
262
26
0
30 Jul 2021
Globally Convergent Multilevel Training of Deep Residual Networks
Alena Kopanicáková
Rolf Krause
335
19
0
15 Jul 2021
ResIST: Layer-Wise Decomposition of ResNets for Distributed Training
Chen Dun
Cameron R. Wolfe
C. Jermaine
Anastasios Kyrillidis
325
23
0
02 Jul 2021
Differentiable Multiple Shooting Layers
Neural Information Processing Systems (NeurIPS), 2021
Stefano Massaroli
Michael Poli
Sho Sonoda
Taji Suzuki
Jinkyoo Park
Atsushi Yamashita
Hajime Asama
AI4CE
140
20
0
07 Jun 2021
Dynamic Game Theoretic Neural Optimizer
International Conference on Machine Learning (ICML), 2021
Guan-Horng Liu
T. Chen
Evangelos A. Theodorou
AI4CE
279
6
0
08 May 2021
Parareal Neural Networks Emulating a Parallel-in-time Algorithm
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Zhanyu Ma
Jiyang Xie
Jingyi Yu
AI4CE
196
12
0
16 Mar 2021
Spline parameterization of neural network controls for deep learning
Stefanie Günther
Will Pazner
Dongping Qi
120
4
0
27 Feb 2021
GIST: Distributed Training for Large-Scale Graph Convolutional Networks
Journal of Applied and Computational Topology (JACT), 2021
Cameron R. Wolfe
Jingkang Yang
Arindam Chowdhury
Chen Dun
Artun Bayer
Santiago Segarra
Anastasios Kyrillidis
BDL
GNN
LRM
295
11
0
20 Feb 2021
Novel Deep neural networks for solving Bayesian statistical inverse
Harbir Antil
H. Elman
Akwum Onwunta
Deepanshu Verma
BDL
130
18
0
08 Feb 2021
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2020
Cody Blakeney
Xiaomin Li
Yan Yan
Ziliang Zong
262
45
0
05 Dec 2020
MGIC: Multigrid-in-Channels Neural Network Architectures
SIAM Journal on Scientific Computing (SIAM J. Sci. Comput.), 2020
Moshe Eliasof
Jonathan Ephrath
Lars Ruthotto
Eran Treister
406
8
0
17 Nov 2020
A Practical Layer-Parallel Training Algorithm for Residual Networks
Qi Sun
Hexin Dong
Zewei Chen
Weizhen Dian
Jiacheng Sun
Yitong Sun
Zhenguo Li
Bin Dong
ODL
255
2
0
03 Sep 2020
A Differential Game Theoretic Neural Optimizer for Training Residual Networks
Guan-Horng Liu
T. Chen
Evangelos A. Theodorou
156
2
0
17 Jul 2020
Layer-Parallel Training with GPU Concurrency of Deep Residual Neural Networks via Nonlinear Multigrid
IEEE Conference on High Performance Extreme Computing (HPEC), 2020
Andrew Kirby
S. Samsi
Michael Jones
Albert Reuther
J. Kepner
V. Gadepally
156
15
0
14 Jul 2020
Multigrid-in-Channels Architectures for Wide Convolutional Neural Networks
Jonathan Ephrath
Lars Ruthotto
Eran Treister
161
1
0
11 Jun 2020
Structure preserving deep learning
E. Celledoni
Matthias Joachim Ehrhardt
Christian Etmann
R. McLachlan
B. Owren
Carola-Bibiane Schönlieb
Ferdia Sherry
AI4CE
217
47
0
05 Jun 2020
Discretize-Optimize vs. Optimize-Discretize for Time-Series Regression and Continuous Normalizing Flows
Derek Onken
Lars Ruthotto
BDL
265
59
0
27 May 2020
Multilevel Minimization for Deep Residual Networks
ESAIM Proceedings and Surveys (ESAIM Proc. Surv.), 2020
Lisa Gaedke-Merzhäuser
Alena Kopanicáková
Rolf Krause
204
17
0
13 Apr 2020
Fractional Deep Neural Network via Constrained Optimization
Harbir Antil
R. Khatri
R. Löhner
Deepanshu Verma
160
32
0
01 Apr 2020
Deep connections between learning from limited labels & physical parameter estimation -- inspiration for regularization
Bas Peters
AI4CE
132
0
0
17 Mar 2020
DDPNOpt: Differential Dynamic Programming Neural Optimizer
International Conference on Learning Representations (ICLR), 2020
Guan-Horng Liu
T. Chen
Evangelos A. Theodorou
272
7
0
20 Feb 2020
Hamiltonian neural networks for solving equations of motion
Physical Review E (PRE), 2020
M. Mattheakis
David Sondak
Akshunna S. Dogra
P. Protopapas
474
85
0
29 Jan 2020
Multilevel Initialization for Layer-Parallel Deep Neural Network Training
E. Cyr
Stefanie Günther
J. Schroder
AI4CE
122
12
0
19 Dec 2019
A literature survey of matrix methods for data science
GAMM-Mitteilungen (GAMM), 2019
Martin Stoll
225
22
0
17 Dec 2019
Parareal with a Learned Coarse Model for Robotic Manipulation
Wisdom C. Agboh
Oliver Grainger
Daniel Ruprecht
M. Dogar
220
13
0
12 Dec 2019
A Machine Learning Framework for Solving High-Dimensional Mean Field Game and Mean Field Control Problems
Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2019
Lars Ruthotto
Stanley Osher
Wuchen Li
L. Nurbekyan
Samy Wu Fung
AI4CE
363
257
0
04 Dec 2019
Predict Globally, Correct Locally: Parallel-in-Time Optimal Control of Neural Networks
P. Parpas
Corey Muir
OOD
156
12
0
07 Feb 2019
1
Page 1 of 1