Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.09913
Cited By
Visualizing the Loss Landscape of Neural Nets
28 December 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visualizing the Loss Landscape of Neural Nets"
50 / 1,039 papers shown
Title
ICE-Pruning: An Iterative Cost-Efficient Pruning Pipeline for Deep Neural Networks
Wenhao Hu
Paul Henderson
José Cano
25
0
0
12 May 2025
Towards the Three-Phase Dynamics of Generalization Power of a DNN
Yuxuan He
Junpeng Zhang
Hongyuan Zhang
Quanshi Zhang
AI4CE
26
0
0
11 May 2025
Low-Loss Space in Neural Networks is Continuous and Fully Connected
Yongding Tian
Zaid Al-Ars
Maksim Kitsak
P. Hofstee
3DPC
26
0
0
05 May 2025
The effect of the number of parameters and the number of local feature patches on loss landscapes in distributed quantum neural networks
Yoshiaki Kawase
71
0
0
27 Apr 2025
Unveiling and Mitigating Adversarial Vulnerabilities in Iterative Optimizers
Elad Sofer
Tomer Shaked
Caroline Chaux
Nir Shlezinger
AAML
33
0
0
26 Apr 2025
DNAD: Differentiable Neural Architecture Distillation
Xuan Rao
Bo Zhao
Derong Liu
34
1
0
25 Apr 2025
The effects of Hessian eigenvalue spectral density type on the applicability of Hessian analysis to generalization capability assessment of neural networks
Nikita Gabdullin
16
0
0
24 Apr 2025
Enhancing Variational Autoencoders with Smooth Robust Latent Encoding
Hyomin Lee
Minseon Kim
Sangwon Jang
Jongheon Jeong
S. Hwang
DiffM
AAML
39
0
0
24 Apr 2025
Regularizing Differentiable Architecture Search with Smooth Activation
Yanlin Zhou
Mostafa El-Khamy
Kee-Bong Song
17
0
0
22 Apr 2025
AlphaGrad: Non-Linear Gradient Normalization Optimizer
Soham Sane
ODL
48
0
0
22 Apr 2025
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
Ashkan Shakarami
Yousef Yeganeh
Azade Farshad
Lorenzo Nicolè
Stefano Ghidoni
Nassir Navab
52
0
0
21 Apr 2025
Connecting Parameter Magnitudes and Hessian Eigenspaces at Scale using Sketched Methods
Andres Fernandez
Frank Schneider
Maren Mahsereci
Philipp Hennig
28
0
0
20 Apr 2025
Generative Classifier for Domain Generalization
Shaocong Long
Qianyu Zhou
X. Li
Chenhao Ying
Yunhai Tong
Lizhuang Ma
Yuan Luo
Dacheng Tao
36
0
0
03 Apr 2025
Hessian-aware Training for Enhancing DNNs Resilience to Parameter Corruptions
Tahmid Hasan Prato
Seijoon Kim
Lizhong Chen
Sanghyun Hong
AAML
33
0
0
02 Apr 2025
Plane-Wave Decomposition and Randomised Training; a Novel Path to Generalised PINNs for SHM
Rory Clements
James Ellis
Geoff Hassall
Simon Horsley
Gavin Tabor
50
0
0
31 Mar 2025
Deep Learning for Forensic Identification of Source
Cole Patten
Christopher Saunders
Michael Puthawala
35
0
0
26 Mar 2025
FedAWA: Adaptive Optimization of Aggregation Weights in Federated Learning Using Client Vectors
Changlong Shi
He Zhao
Bingjie Zhang
Mingyuan Zhou
Dandan Guo
Yi Chang
42
0
0
20 Mar 2025
FedLWS: Federated Learning with Adaptive Layer-wise Weight Shrinking
Changlong Shi
Jinmeng Li
He Zhao
D. Guo
Yi Chang
FedML
47
0
0
19 Mar 2025
Temporal Flexibility in Spiking Neural Networks: Towards Generalization Across Time Steps and Deployment Friendliness
Kangrui Du
Yuhang Wu
Shikuang Deng
Shi Gu
38
0
0
18 Mar 2025
Layer-wise Adaptive Gradient Norm Penalizing Method for Efficient and Accurate Deep Learning
Sunwoo Lee
98
0
0
18 Mar 2025
Mitigating Spectral Bias in Neural Operators via High-Frequency Scaling for Physical Systems
Siavash Khodakarami
Vivek Oommen
Aniruddha Bora
George Karniadakis
AI4CE
60
1
0
17 Mar 2025
Hamiltonian Neural Networks for Robust Out-of-Time Credit Scoring
Javier Marín
70
0
0
13 Mar 2025
Analyzing the Role of Permutation Invariance in Linear Mode Connectivity
Keyao Zhan
Puheng Li
Lei Wu
MoMe
77
0
0
13 Mar 2025
Architecture-Aware Minimization (A
2
^2
2
M): How to Find Flat Minima in Neural Architecture Search
Matteo Gambella
Fabrizio Pittorino
Manuel Roveri
61
0
0
13 Mar 2025
Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild
Damien Teney
Liangze Jiang
Florin Gogianu
Ehsan Abbasnejad
145
0
0
13 Mar 2025
Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching
Nannan Wu
Zhuo Kuang
Zengqiang Yan
Ping Wang
Li Yu
FedML
49
0
0
12 Mar 2025
Extra Clients at No Extra Cost: Overcome Data Heterogeneity in Federated Learning with Filter Decomposition
Wei Chen
Qiang Qiu
FedML
62
0
0
11 Mar 2025
SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting
Linqi Yang
Xiongwei Zhao
Qihao Sun
Ke Wang
Ao Chen
Peng Kang
3DGS
76
2
0
07 Mar 2025
When Can You Get Away with Low Memory Adam?
Dayal Singh Kalra
John Kirchenbauer
M. Barkeshli
Tom Goldstein
69
0
0
03 Mar 2025
STAR: Stability-Inducing Weight Perturbation for Continual Learning
Masih Eskandar
Tooba Imtiaz
Davin Hill
Zifeng Wang
Jennifer Dy
CLL
34
0
0
03 Mar 2025
Parameter Expanded Stochastic Gradient Markov Chain Monte Carlo
Hyunsu Kim
G. Nam
Chulhee Yun
Hongseok Yang
Juho Lee
BDL
UQCV
52
0
0
02 Mar 2025
Re-Imagining Multimodal Instruction Tuning: A Representation View
Yiyang Liu
James Liang
Ruixiang Tang
Yugyung Lee
Majid Rabbani
...
Raghuveer M. Rao
Lifu Huang
Dongfang Liu
Qifan Wang
Cheng Han
111
0
0
02 Mar 2025
VRM: Knowledge Distillation via Virtual Relation Matching
W. Zhang
Fei Xie
Weidong Cai
Chao Ma
71
0
0
28 Feb 2025
Large Language Models as Attribution Regularizers for Efficient Model Training
Davor Vukadin
Marin Šilić
Goran Delač
36
0
0
27 Feb 2025
Bayesian Computation in Deep Learning
Wenlong Chen
Bolian Li
Ruqi Zhang
Yingzhen Li
BDL
75
0
0
25 Feb 2025
Reasoning Bias of Next Token Prediction Training
Pengxiao Lin
Zhongwang Zhang
Zhi-Qin John Xu
LRM
80
1
0
21 Feb 2025
Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism
Yu Liang
Wenjie Wei
A. Belatreche
Honglin Cao
Zijian Zhou
Shuai Wang
Malu Zhang
Y. Yang
MQ
61
0
0
21 Feb 2025
High-dimensional manifold of solutions in neural networks: insights from statistical physics
Enrico M. Malatesta
46
4
0
20 Feb 2025
UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models
Huawei Lin
Yingjie Lao
Tong Geng
Tan Yu
Weijie Zhao
AAML
SILM
79
2
0
18 Feb 2025
Computational Safety for Generative AI: A Signal Processing Perspective
Pin-Yu Chen
68
1
0
18 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
49
2
0
29 Jan 2025
CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling
Kaiyuan Zhang
Siyuan Cheng
Guangyu Shen
Bruno Ribeiro
Shengwei An
Pin-Yu Chen
X. Zhang
Ninghui Li
90
1
0
28 Jan 2025
Understanding the Functional Roles of Modelling Components in Spiking Neural Networks
Huifeng Yin
Hanle Zheng
Jiayi Mao
Siyuan Ding
Xing Liu
M. Xu
Yifan Hu
Jing Pei
Lei Deng
42
1
0
28 Jan 2025
Field-level simulation-based inference with galaxy catalogs: the impact of systematic effects
Natalí S. M. de Santi
F. Villaescusa-Navarro
L. Abramo
Helen Shao
Lucia A. Perez
...
F. Marinacci
D. Spergel
K. Dolag
L. Hernquist
M. Vogelsberger
59
4
0
28 Jan 2025
Gradient Networks
Shreyas Chaudhari
Srinivasa Pranav
J. M. F. Moura
50
0
0
28 Jan 2025
Persistence of Backdoor-based Watermarks for Neural Networks: A Comprehensive Evaluation
Anh Tu Ngo
Chuan Song Heng
Nandish Chattopadhyay
Anupam Chattopadhyay
AAML
107
0
0
06 Jan 2025
A ghost mechanism: An analytical model of abrupt learning
Fatih Dinc
Ege Cirakman
Yiqi Jiang
Mert Yuksekgonul
Mark J. Schnitzer
Hidenori Tanaka
33
2
0
04 Jan 2025
Time-independent Spiking Neuron via Membrane Potential Estimation for Efficient Spiking Neural Networks
Hanqi Chen
Lixing Yu
Shaojie Zhan
Penghui Yao
Jiankun Shao
42
1
0
31 Dec 2024
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
Mingshu Zhao
Yi Luo
Yong Ouyang
31
0
0
27 Dec 2024
Non-Uniform Parameter-Wise Model Merging
Albert Manuel Orozco Camacho
Stefan Horoi
Guy Wolf
Eugene Belilovsky
MoMe
FedML
83
0
0
20 Dec 2024
1
2
3
4
...
19
20
21
Next