Understanding deep learning requires rethinking generalization

10 November 2016

Benjamin Recht

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 1,028 papers shown

Title
Linear Stability Hypothesis and Rank Stratification for Nonlinear Models Yaoyu Zhang Zhongwang Zhang Leyang Zhang Zhiwei Bai Tao Luo Z. Xu 27 7 0 21 Nov 2022
Two Facets of SDE Under an Information-Theoretic Lens: Generalization of SGD via Training Trajectories and via Terminal States Ziqiao Wang Yongyi Mao 30 10 0 19 Nov 2022
Why Deep Learning Generalizes Benjamin L. Badger TDI AI4CE 20 3 0 17 Nov 2022
On the Sample Complexity of Two-Layer Networks: Lipschitz vs. Element-Wise Lipschitz Activation Amit Daniely Elad Granot MLT 32 1 0 17 Nov 2022
REPAIR: REnormalizing Permuted Activations for Interpolation Repair Keller Jordan Hanie Sedghi O. Saukh R. Entezari Behnam Neyshabur MoMe 46 94 0 15 Nov 2022
Robust Training of Graph Neural Networks via Noise Governance Siyi Qian Haochao Ying Renjun Hu Jingbo Zhou Jintai Chen Danny Chen Jian Wu NoLa 33 34 0 12 Nov 2022
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction Xuming Hu Shiao Meng Chenwei Zhang Xiangli Yang Lijie Wen Irwin King Philip S. Yu 52 0 0 11 Nov 2022
NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators Aditya Manglik Minesh Patel Haiyu Mao Behzad Salami Jisung Park Lois Orosa O. Mutlu 20 1 0 10 Nov 2022
How Does Sharpness-Aware Minimization Minimize Sharpness? Kaiyue Wen Tengyu Ma Zhiyuan Li AAML 23 47 0 10 Nov 2022
Do highly over-parameterized neural networks generalize since bad solutions are rare? Julius Martinetz T. Martinetz 30 1 0 07 Nov 2022
Biased Self-supervised learning for ASR Florian Kreyssig Yangyang Shi Jinxi Guo Leda Sari Abdel-rahman Mohamed P. Woodland SSL 30 2 0 04 Nov 2022
Private Semi-supervised Knowledge Transfer for Deep Learning from Noisy Labels Qiuchen Zhang Jing Ma Jian Lou Li Xiong Xiaoqian Jiang NoLa 21 0 0 03 Nov 2022
Instance-Dependent Generalization Bounds via Optimal Transport Songyan Hou Parnian Kassraie Anastasis Kratsios Andreas Krause Jonas Rothfuss 22 6 0 02 Nov 2022
Discriminative Speaker Representation via Contrastive Learning with Class-Aware Attention in Angular Space Zhe Li Man-Wai Mak Helen M. Meng 34 9 0 29 Oct 2022
A Functional-Space Mean-Field Theory of Partially-Trained Three-Layer Neural Networks Zhengdao Chen Eric Vanden-Eijnden Joan Bruna MLT 27 5 0 28 Oct 2022
Noise Injection Node Regularization for Robust Learning N. Levi I. Bloch M. Freytsis T. Volansky AI4CE 32 2 0 27 Oct 2022
Bridging the visual gap in VLN via semantically richer instructions Joaquín Ossandón Benjamín Earle Alvaro Soto 37 0 0 27 Oct 2022
The Curious Case of Benign Memorization Sotiris Anagnostidis Gregor Bachmann Lorenzo Noci Thomas Hofmann AAML 54 8 0 25 Oct 2022
Noise Injection as a Probe of Deep Learning Dynamics Noam Levi I. Bloch M. Freytsis T. Volansky 42 2 0 24 Oct 2022
A PAC-Bayesian Generalization Bound for Equivariant Networks Arash Behboodi Gabriele Cesa Taco S. Cohen 58 17 0 24 Oct 2022
Revisiting Sparse Convolutional Model for Visual Recognition Xili Dai Mingyang Li Pengyuan Zhai Shengbang Tong Xingjian Gao Shao-Lun Huang Zhihui Zhu Chong You Yi Ma FAtt 35 27 0 24 Oct 2022
A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models Lijia Zhou Frederic Koehler Pragya Sur Danica J. Sutherland Nathan Srebro 83 9 0 21 Oct 2022
Optimisation & Generalisation in Networks of Neurons Jeremy Bernstein AI4CE 24 2 0 18 Oct 2022
Dimensionality of datasets in object detection networks Ajay Chawda A. Vierling Karsten Berns 3DPC 10 0 0 13 Oct 2022
SGD with Large Step Sizes Learns Sparse Features Maksym Andriushchenko Aditya Varre Loucas Pillaud-Vivien Nicolas Flammarion 45 56 0 11 Oct 2022
Rediscovery of Numerical Lüscher's Formula from the Neural Network Yu Lu Yijia Wang YingChun Chen Jia-Jun Wu 23 1 0 05 Oct 2022
Block-wise Training of Residual Networks via the Minimizing Movement Scheme Skander Karkar Ibrahim Ayed Emmanuel de Bézenac Patrick Gallinari 33 1 0 03 Oct 2022
The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels Daniel Shwartz Uri Stern D. Weinshall NoLa 36 2 0 02 Oct 2022
On the Impossible Safety of Large AI Models El-Mahdi El-Mhamdi Sadegh Farhadkhani R. Guerraoui Nirupam Gupta L. Hoang Rafael Pinot Sébastien Rouault John Stephan 37 31 0 30 Sep 2022
Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel Sungyub Kim Si-hun Park Kyungsu Kim Eunho Yang BDL 32 4 0 30 Sep 2022
On the Robustness of Random Forest Against Untargeted Data Poisoning: An Ensemble-Based Approach M. Anisetti C. Ardagna Alessandro Balestrucci Nicola Bena Ernesto Damiani C. Yeun AAML OOD 34 10 0 28 Sep 2022
Why neural networks find simple solutions: the many regularizers of geometric complexity Benoit Dherin Michael Munn M. Rosca David Barrett 57 31 0 27 Sep 2022
Deep Double Descent via Smooth Interpolation Matteo Gamba Erik Englesson Mårten Björkman Hossein Azizpour 63 11 0 21 Sep 2022
Deep Linear Networks can Benignly Overfit when Shallow Ones Do Niladri S. Chatterji Philip M. Long 23 8 0 19 Sep 2022
Neural Collapse with Normalized Features: A Geometric Analysis over the Riemannian Manifold Can Yaras Peng Wang Zhihui Zhu Laura Balzano Qing Qu 25 42 0 19 Sep 2022
Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty Thomas George Guillaume Lajoie A. Baratin 34 5 0 19 Sep 2022
Generalization Bounds for Deep Transfer Learning Using Majority Predictor Accuracy Cuong N.Nguyen L. Ho Vu C. Dinh Tal Hassner Cuong V.Nguyen 19 4 0 13 Sep 2022
Black-Box Audits for Group Distribution Shifts Marc Juárez Samuel Yeom Matt Fredrikson MLAU 27 4 0 08 Sep 2022
Data-Driven Target Localization Using Adaptive Radar Processing and Convolutional Neural Networks Shyam Venkatasubramanian S. Gogineni Bosung Kang Ali Pezeshki M. Rangaswamy Vahid Tarokh 44 3 0 07 Sep 2022
Generalisation under gradient descent via deterministic PAC-Bayes Eugenio Clerico Tyler Farghly George Deligiannidis Benjamin Guedj Arnaud Doucet 33 4 0 06 Sep 2022
Data Provenance via Differential Auditing Xin Mu Ming Pang Feida Zhu 19 1 0 04 Sep 2022
Instance-Dependent Noisy Label Learning via Graphical Modelling Arpit Garg Cuong C. Nguyen Rafael Felix Thanh-Toan Do G. Carneiro NoLa 39 27 0 02 Sep 2022
PanorAMS: Automatic Annotation for Detecting Objects in Urban Context Inske Groenen S. Rudinac M. Worring 21 4 0 30 Aug 2022
Learning from Noisy Labels with Coarse-to-Fine Sample Credibility Modeling Boshen Zhang Yuxi Li Yuanpeng Tu Jinlong Peng Yabiao Wang Cunlin Wu Yanghua Xiao Cairong Zhao NoLa 38 6 0 23 Aug 2022
Intersection of Parallels as an Early Stopping Criterion Ali Vardasbi Maarten de Rijke Mostafa Dehghani MoMe 41 5 0 19 Aug 2022
Universal Solutions of Feedforward ReLU Networks for Interpolations Changcun Huang 20 2 0 16 Aug 2022
Do Quantum Circuit Born Machines Generalize? Kaitlin Gili Mohamed Hibat-Allah M. Mauri C. Ballance A. Perdomo-Ortiz 33 29 0 27 Jul 2022
Learning from Data with Noisy Labels Using Temporal Self-Ensemble Jun Ho Lee J. Baik Taebaek Hwang J. Choi NoLa 28 1 0 21 Jul 2022
Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting Neil Rohit Mallinar James B. Simon Amirhesam Abedsoltan Parthe Pandit M. Belkin Preetum Nakkiran 26 37 0 14 Jul 2022
PAC-Bayesian Domain Adaptation Bounds for Multiclass Learners Anthony Sicilia Katherine Atwell Malihe Alikhani Seong Jae Hwang BDL 56 9 0 12 Jul 2022