Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.09913
Cited By
Visualizing the Loss Landscape of Neural Nets
28 December 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visualizing the Loss Landscape of Neural Nets"
50 / 1,039 papers shown
Title
Friendly Sharpness-Aware Minimization
Tao Li
Pan Zhou
Zhengbao He
Xinwen Cheng
Xiaolin Huang
AAML
46
15
0
19 Mar 2024
Towards Faster Training of Diffusion Models: An Inspiration of A Consistency Phenomenon
Tianshuo Xu
Peng Mi
Ruilin Wang
Yingcong Chen
DiffM
27
6
0
14 Mar 2024
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Junseok Park
Yoonsung Kim
Hee Bin Yoo
Min Whoo Lee
Kibeom Kim
Won-Seok Choi
Minsu Lee
Byoung-Tak Zhang
OffRL
27
1
0
11 Mar 2024
PeerAiD: Improving Adversarial Distillation from a Specialized Peer Tutor
Jaewon Jung
Hongsun Jang
Jaeyong Song
Jinho Lee
OOD
AAML
168
4
0
11 Mar 2024
Improve Generalization Ability of Deep Wide Residual Network with A Suitable Scaling Factor
Songtao Tian
Zixiong Yu
18
1
0
07 Mar 2024
T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers
Mariano V. Ntrougkas
Nikolaos Gkalelis
Vasileios Mezaris
FAtt
ViT
25
5
0
07 Mar 2024
On a Neural Implementation of Brenier's Polar Factorization
Nina Vesseron
Marco Cuturi
39
2
0
05 Mar 2024
Sensitivity Analysis On Loss Landscape
Salman Faroz
16
0
0
02 Mar 2024
Merging Text Transformer Models from Different Initializations
Neha Verma
Maha Elbayad
MoMe
56
7
0
01 Mar 2024
Beyond Single-Model Views for Deep Learning: Optimization versus Generalizability of Stochastic Optimization Algorithms
Toki Tahmid Inan
Mingrui Liu
Amarda Shehu
27
0
0
01 Mar 2024
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
Xiaomeng Hu
Pin-Yu Chen
Tsung-Yi Ho
AAML
24
26
0
01 Mar 2024
Fine-tuning with Very Large Dropout
Jianyu Zhang
Léon Bottou
37
1
0
01 Mar 2024
Improving Group Connectivity for Generalization of Federated Deep Learning
Zexi Li
Jie Lin
Zhiqi Li
Didi Zhu
Chao Wu
AI4CE
FedML
38
0
0
29 Feb 2024
Gradient Alignment for Cross-Domain Face Anti-Spoofing
B. Le
Simon S. Woo
CVBM
38
19
0
29 Feb 2024
Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization
Zirui Zhu
Yong Liu
Zangwei Zheng
Huifeng Guo
Yang You
19
0
0
23 Feb 2024
Investigating the Histogram Loss in Regression
Ehsan Imani
Kai Luedemann
Sam Scholnick-Hughes
Esraa Elelimy
Martha White
UQCV
34
5
0
20 Feb 2024
Mirror Gradient: Towards Robust Multimodal Recommender Systems via Exploring Flat Local Minima
Shan Zhong
Zhongzhan Huang
Daifeng Li
Wushao Wen
Jinghui Qin
Liang Lin
22
12
0
17 Feb 2024
Bridging the Empirical-Theoretical Gap in Neural Network Formal Language Learning Using Minimum Description Length
N. Lan
Emmanuel Chemla
Roni Katzir
13
2
0
15 Feb 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
30
6
0
14 Feb 2024
On Differentially Private Subspace Estimation in a Distribution-Free Setting
Eliad Tsfadia
15
1
0
09 Feb 2024
Tradeoffs of Diagonal Fisher Information Matrix Estimators
Alexander Soen
Ke Sun
14
1
0
08 Feb 2024
Strong convexity-guided hyper-parameter optimization for flatter losses
Rahul Yedida
Snehanshu Saha
16
0
0
07 Feb 2024
Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time
Sungyoon Kim
Mert Pilanci
35
4
0
06 Feb 2024
Careful with that Scalpel: Improving Gradient Surgery with an EMA
Yu-Guan Hsieh
James Thornton
Eugène Ndiaye
Michal Klein
Marco Cuturi
Pierre Ablin
MedIm
31
0
0
05 Feb 2024
Balanced Resonate-and-Fire Neurons
Saya Higuchi
Sebastian Kairat
S. Bohté
Sebastian Otte
17
6
0
02 Feb 2024
Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion
Zexi Li
Zhiqi Li
Jie Lin
Tao Shen
Tao Lin
Chao Wu
28
4
0
02 Feb 2024
LTAU-FF: Loss Trajectory Analysis for Uncertainty in Atomistic Force Fields
Joshua A. Vita
Amit Samanta
Fei Zhou
Vincenzo Lordi
23
2
0
01 Feb 2024
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen
Ning Liu
Yichen Zhu
Zhengping Che
Rui Ma
Fachao Zhang
Xiaofeng Mou
Yi Chang
Jian Tang
29
3
0
31 Jan 2024
Towards Assessing the Synthetic-to-Measured Adversarial Vulnerability of SAR ATR
Bowen Peng
Bo Peng
Jingyuan Xia
Tianpeng Liu
Yongxiang Liu
Li Liu
AAML
26
4
0
30 Jan 2024
Speeding up and reducing memory usage for scientific machine learning via mixed precision
Joel Hayford
Jacob Goldman-Wetzler
Eric Wang
Lu Lu
49
8
0
30 Jan 2024
Accelerating superconductor discovery through tempered deep learning of the electron-phonon spectral function
Jason B. Gibson
A. Hire
P. M. Dee
Oscar Barrera
Benjamin Geisler
P. Hirschfeld
R. G. Hennig
13
4
0
29 Jan 2024
Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators
Yaniv Blumenfeld
Itay Hubara
Daniel Soudry
37
3
0
25 Jan 2024
Towards Effective and General Graph Unlearning via Mutual Evolution
Xunkai Li
Yulin Zhao
Zhengyu Wu
Wentao Zhang
Ronghua Li
Guoren Wang
MU
25
14
0
22 Jan 2024
Momentum-SAM: Sharpness Aware Minimization without Computational Overhead
Marlon Becker
Frederick Altrock
Benjamin Risse
74
5
0
22 Jan 2024
Understanding the Generalization Benefits of Late Learning Rate Decay
Yinuo Ren
Chao Ma
Lexing Ying
AI4CE
24
6
0
21 Jan 2024
Bag of Tricks to Boost Adversarial Transferability
Zeliang Zhang
Rongyi Zhu
Wei Yao
Xiaosen Wang
Chenliang Xu
AAML
47
9
0
16 Jan 2024
A topological description of loss surfaces based on Betti Numbers
Maria Sofia Bucarelli
Giuseppe Alessio D’Inverno
Monica Bianchini
F. Scarselli
Fabrizio Silvestri
25
1
0
08 Jan 2024
Data-Driven Physics-Informed Neural Networks: A Digital Twin Perspective
Sunwoong Yang
Hojin Kim
Y. Hong
K. Yee
R. Maulik
Namwoo Kang
PINN
AI4CE
18
17
0
05 Jan 2024
f
f
f
-Divergence Based Classification: Beyond the Use of Cross-Entropy
Nicola Novello
Andrea M. Tonello
22
7
0
02 Jan 2024
On the Necessity of Metalearning: Learning Suitable Parameterizations for Learning Processes
Massinissa Hamidi
A. Osmani
19
0
0
31 Dec 2023
Universal Pyramid Adversarial Training for Improved ViT Performance
Ping Yeh-Chiang
Yipin Zhou
Omid Poursaeed
S. Narayan
Shukla
Tom Goldstein
Ser-Nam Lim
AAML
ViT
14
0
0
26 Dec 2023
CR-SAM: Curvature Regularized Sharpness-Aware Minimization
Tao Wu
Tie Luo
D. C. Wunsch
14
3
0
21 Dec 2023
Enhancing Neural Training via a Correlated Dynamics Model
Jonathan Brokman
Roy Betser
Rotem Turjeman
Tom Berkov
I. Cohen
Guy Gilboa
24
3
0
20 Dec 2023
Sparse is Enough in Fine-tuning Pre-trained Large Language Models
Weixi Song
Z. Li
Lefei Zhang
Hai Zhao
Bo Du
VLM
19
6
0
19 Dec 2023
NAC-TCN: Temporal Convolutional Networks with Causal Dilated Neighborhood Attention for Emotion Understanding
Alexander Mehta
William Yang
ViT
37
1
0
12 Dec 2023
Continual Learning through Networks Splitting and Merging with Dreaming-Meta-Weighted Model Fusion
Yi Sun
Xin Xu
Jian Li
Guanglei Xie
Yifei Shi
Qiang Fang
CLL
MoMe
24
1
0
12 Dec 2023
Measurement-driven neural-network training for integrated magnetic tunnel junction arrays
W. A. Borders
A. Madhavan
M. Daniels
Vasileia Georgiou
Martin Lueker-Boden
Tiffany S. Santos
Patrick M. Braganca
M. D. Stiles
Jabez J. McClelland
Brian D. Hoskins
25
3
0
11 Dec 2023
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
35
0
0
08 Dec 2023
On The Fairness Impacts of Hardware Selection in Machine Learning
Sree Harsha Nelaturu
Nishaanth Kanna Ravichandran
Cuong Tran
Sara Hooker
Ferdinando Fioretto
45
2
0
06 Dec 2023
Generalisable Agents for Neural Network Optimisation
Kale-ab Tessera
C. Tilbury
Sasha Abramowitz
Ruan de Kock
Omayma Mahjoub
Benjamin Rosman
Sara Hooker
Arnu Pretorius
AI4CE
20
0
0
30 Nov 2023
Previous
1
2
3
4
5
6
...
19
20
21
Next