Demystifying Parallel and Distributed Deep Learning: An In-Depth
Concurrency AnalysisACM Computing Surveys (CSUR), 2018 |
A Stochastic Trust Region Algorithm Based on Careful Step NormalizationINFORMS Journal on Optimization (JIO), 2017 |
Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems
over Large GraphsIEEE Transactions on Automatic Control (TAC), 2017 |
On the role of synaptic stochasticity in training low-precision neural
networksPhysical Review Letters (PRL), 2017 |
Smart "Predict, then Optimize"Management Sciences (MS), 2017 |
Regularizing and Optimizing LSTM Language ModelsInternational Conference on Learning Representations (ICLR), 2017 |