Understanding deep learning requires rethinking generalization

10 November 2016

Benjamin Recht

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 1,110 papers shown

Title
A comprehensive study on the prediction reliability of graph neural networks for virtual screening Soojung Yang K. Lee Seongok Ryu 26 7 0 17 Mar 2020
What Information Does a ResNet Compress? L. N. Darlow Amos Storkey SSL 32 11 0 13 Mar 2020
Analyzing Visual Representations in Embodied Navigation Tasks Erik Wijmans Julian Straub Dhruv Batra Irfan Essa Judy Hoffman Ari S. Morcos 21 2 0 12 Mar 2020
SASL: Saliency-Adaptive Sparsity Learning for Neural Network Acceleration Jun Shi Jianfeng Xu K. Tasaka Zhibo Chen 8 25 0 12 Mar 2020
A Mean-field Analysis of Deep ResNet and Beyond: Towards Provable Optimization Via Overparameterization From Depth Yiping Lu Chao Ma Yulong Lu Jianfeng Lu Lexing Ying MLT 44 78 0 11 Mar 2020
SuperMix: Supervising the Mixing Data Augmentation Ali Dabouei Sobhan Soleymani Fariborz Taherkhani Nasser M. Nasrabadi 24 98 0 10 Mar 2020
AL2: Progressive Activation Loss for Learning General Representations in Classification Neural Networks Majed El Helou Frederike Dumbgen Sabine Süsstrunk CLL AI4CE 30 2 0 07 Mar 2020
The Variational InfoMax Learning Objective Vincenzo Crescimanna Bruce P. Graham 23 0 0 07 Mar 2020
Combating noisy labels by agreement: A joint training method with co-regularization Hongxin Wei Lei Feng Xiangyu Chen Bo An NoLa 319 501 0 05 Mar 2020
Analyzing Accuracy Loss in Randomized Smoothing Defenses Yue Gao Harrison Rosenberg Kassem Fawaz S. Jha Justin Hsu AAML 24 6 0 03 Mar 2020
Towards Noise-resistant Object Detection with Noisy Annotations Junnan Li Caiming Xiong R. Socher Guosheng Lin ObjD NoLa 70 29 0 03 Mar 2020
Iterative Averaging in the Quest for Best Test Error Diego Granziol Xingchen Wan Samuel Albanie Stephen J. Roberts 29 3 0 02 Mar 2020
Double Trouble in Double Descent : Bias and Variance(s) in the Lazy Regime Stéphane dÁscoli Maria Refinetti Giulio Biroli Florent Krzakala 100 152 0 02 Mar 2020
Out-of-Distribution Generalization via Risk Extrapolation (REx) David M. Krueger Ethan Caballero J. Jacobsen Amy Zhang Jonathan Binas Dinghuai Zhang Rémi Le Priol Aaron Courville OOD 215 910 0 02 Mar 2020
Do CNNs Encode Data Augmentations? Eddie Q. Yan Yanping Huang OOD 23 5 0 29 Feb 2020
Overfitting in adversarially robust deep learning Leslie Rice Eric Wong Zico Kolter 47 788 0 26 Feb 2020
Predicting Neural Network Accuracy from Weights Thomas Unterthiner Daniel Keysers Sylvain Gelly Olivier Bousquet Ilya O. Tolstikhin 30 101 0 26 Feb 2020
Understanding Self-Training for Gradual Domain Adaptation Ananya Kumar Tengyu Ma Percy Liang CLL TTA 28 228 0 26 Feb 2020
Generalized Product Quantization Network for Semi-supervised Image Retrieval Young Kyun Jang N. Cho 26 38 0 26 Feb 2020
Convex Geometry and Duality of Over-parameterized Neural Networks Tolga Ergen Mert Pilanci MLT 49 54 0 25 Feb 2020
On Feature Normalization and Data Augmentation Boyi Li Felix Wu Ser-Nam Lim Serge J. Belongie Kilian Q. Weinberger 26 134 0 25 Feb 2020
Understanding and Mitigating the Tradeoff Between Robustness and Accuracy Aditi Raghunathan Sang Michael Xie Fanny Yang John C. Duchi Percy Liang AAML 51 224 0 25 Feb 2020
Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization S. Chatterjee ODL OOD 13 51 0 25 Feb 2020
Stochastic Polyak Step-size for SGD: An Adaptive Learning Rate for Fast Convergence Nicolas Loizou Sharan Vaswani I. Laradji Simon Lacoste-Julien 29 182 0 24 Feb 2020
The Early Phase of Neural Network Training Jonathan Frankle D. Schwab Ari S. Morcos 23 173 0 24 Feb 2020
An Optimization and Generalization Analysis for Max-Pooling Networks Alon Brutzkus Amir Globerson MLT AI4CE 16 4 0 22 Feb 2020
Generalisation error in learning with random features and the hidden manifold model Federica Gerace Bruno Loureiro Florent Krzakala M. Mézard Lenka Zdeborová 30 166 0 21 Feb 2020
Bayesian Deep Learning and a Probabilistic Perspective of Generalization A. Wilson Pavel Izmailov UQCV BDL OOD 24 642 0 20 Feb 2020
Implicit Regularization of Random Feature Models Arthur Jacot Berfin Simsek Francesco Spadaro Clément Hongler Franck Gabriel 36 82 0 19 Feb 2020
Identifying Critical Neurons in ANN Architectures using Mixed Integer Programming M. Elaraby Guy Wolf Margarida Carvalho 26 5 0 17 Feb 2020
Learning Not to Learn in the Presence of Noisy Labels Liu Ziyin Blair Chen Ru Wang Paul Pu Liang Ruslan Salakhutdinov Louis-Philippe Morency Masahito Ueda NoLa 26 18 0 16 Feb 2020
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks Carlos Aspillaga Andrés Carvallo Vladimir Araujo ELM 47 31 0 14 Feb 2020
Self-Distillation Amplifies Regularization in Hilbert Space H. Mobahi Mehrdad Farajtabar Peter L. Bartlett 43 229 0 13 Feb 2020
The Conditional Entropy Bottleneck Ian S. Fischer OOD 29 117 0 13 Feb 2020
Topologically Densified Distributions Christoph Hofer Florian Graf Marc Niethammer Roland Kwitt 27 15 0 12 Feb 2020
Object Detection as a Positive-Unlabeled Problem Yuewei Yang Kevin J. Liang Lawrence Carin 30 38 0 11 Feb 2020
Distribution Approximation and Statistical Estimation Guarantees of Generative Adversarial Networks Minshuo Chen Wenjing Liao H. Zha Tuo Zhao 28 15 0 10 Feb 2020
A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima Zeke Xie Issei Sato Masashi Sugiyama ODL 30 17 0 10 Feb 2020
Semi-Supervised Class Discovery Jeremy Nixon J. Liu David Berthelot 20 2 0 10 Feb 2020
Spectrum Dependent Learning Curves in Kernel Regression and Wide Neural Networks Blake Bordelon Abdulkadir Canatar Cengiz Pehlevan 149 201 0 07 Feb 2020
$A Precise High-Dimensional Asymptotic Theory for Boosting and Minimum-$\ell_1$-Norm Interpolated Classifiers$ A Precise High-Dimensional Asymptotic Theory for Boosting and Minimum- $\ell_1$ -Norm Interpolated Classifiers Tengyuan Liang Pragya Sur 50 68 0 05 Feb 2020
TDEFSI: Theory Guided Deep Learning Based Epidemic Forecasting with Synthetic Information Lijing Wang Jiangzhuo Chen Madhav Marathe AI4TS 31 19 0 28 Jan 2020
QActor: On-line Active Learning for Noisy Labeled Stream Data Taraneh Younesian Zilong Zhao Amirmasoud Ghiassi Robert Birke L. Chen 23 5 0 28 Jan 2020
How Much Position Information Do Convolutional Neural Networks Encode? Md. Amirul Islam Sen Jia Neil D. B. Bruce SSL 205 344 0 22 Jan 2020
Optimized Generic Feature Learning for Few-shot Classification across Domains Tonmoy Saikia Thomas Brox Cordelia Schmid VLM 30 48 0 22 Jan 2020
Memory capacity of neural networks with threshold and ReLU activations Roman Vershynin 33 21 0 20 Jan 2020
Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network Jungkyu Lee Taeryun Won Tae Kwan Lee Hyemin Lee Geonmo Gu K. Hong 34 57 0 17 Jan 2020
Rethinking Generalization of Neural Models: A Named Entity Recognition Case Study Jinlan Fu Pengfei Liu Qi Zhang Xuanjing Huang AI4CE 35 73 0 12 Jan 2020
Confidence Scores Make Instance-dependent Label-noise Learning Possible Antonin Berthon Bo Han Gang Niu Tongliang Liu Masashi Sugiyama NoLa 49 105 0 11 Jan 2020
Identifying and Compensating for Feature Deviation in Imbalanced Deep Learning Han-Jia Ye Hong-You Chen De-Chuan Zhan Wei-Lun Chao 39 99 0 06 Jan 2020