Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.09913
Cited By
Visualizing the Loss Landscape of Neural Nets
28 December 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visualizing the Loss Landscape of Neural Nets"
50 / 1,039 papers shown
Title
Flat Posterior Does Matter For Bayesian Model Averaging
Sungjun Lim
Jeyoon Yeom
Sooyon Kim
Hoyoon Byun
Jinho Kang
Yohan Jung
Jiyoung Jung
Kyungwoo Song
AAML
BDL
43
0
0
21 Jun 2024
MEAT: Median-Ensemble Adversarial Training for Improving Robustness and Generalization
Zhaozhe Hu
Jia-Li Yin
Bin Chen
Luojun Lin
Bo-Hao Chen
Ximeng Liu
AAML
28
0
0
20 Jun 2024
Information Guided Regularization for Fine-tuning Language Models
Mandar Sharma
Nikhil Muralidhar
Shengzhe Xu
Raquib Bin Yousuf
Naren Ramakrishnan
33
0
0
20 Jun 2024
Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Models
Yili Wang
Kaixiong Zhou
Ninghao Liu
Ying Wang
Xin Wang
36
10
0
19 Jun 2024
BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning
Yi Liu
Cong Wang
Xingliang Yuan
AAML
39
2
0
18 Jun 2024
Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Training
Akul Malhotra
S. Gupta
11
0
0
15 Jun 2024
Enhancing Cross-Modal Fine-Tuning with Gradually Intermediate Modality Generation
Lincan Cai
Shuang Li
Wenxuan Ma
Jingxuan Kang
Binhui Xie
Zixun Sun
Chengwei Zhu
MoE
MoMe
40
0
0
13 Jun 2024
What is Dataset Distillation Learning?
William Yang
Ye Zhu
Zhiwei Deng
Olga Russakovsky
DD
44
3
0
06 Jun 2024
A comprehensive and FAIR comparison between MLP and KAN representations for differential equations and operator networks
K. Shukla
Juan Diego Toscano
Zhicheng Wang
Zongren Zou
George Karniadakis
24
73
0
05 Jun 2024
Can Dense Connectivity Benefit Outlier Detection? An Odyssey with NAS
Hao Fu
Tunhou Zhang
Hai Li
Yiran Chen
23
0
0
04 Jun 2024
Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training
Jiancheng Xie
Lou C. Kohler Voinov
Noga Mudrik
Gal Mishne
Adam Charles
GNN
25
0
0
04 Jun 2024
Understanding Token Probability Encoding in Output Embeddings
Hakaze Cho
Yoshihiro Sakai
Kenshiro Tanaka
Mariko Kato
Naoya Inoue
30
2
0
03 Jun 2024
On the Use of Anchoring for Training Vision Models
V. Narayanaswamy
Kowshik Thopalli
Rushil Anirudh
Yamen Mubarka
W. Sakla
Jayaraman J. Thiagarajan
35
0
0
01 Jun 2024
Understanding the Convergence in Balanced Resonate-and-Fire Neurons
Saya Higuchi
S. Bohté
Sebastian Otte
29
1
0
01 Jun 2024
RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection
Zhiyuan He
Pin-Yu Chen
Tsung-Yi Ho
36
12
0
30 May 2024
Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization
Ziqing Fan
Shengchao Hu
Jiangchao Yao
Gang Niu
Ya-Qin Zhang
Masashi Sugiyama
Yanfeng Wang
FedML
44
11
0
29 May 2024
Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Ruipeng Zhang
Ziqing Fan
Jiangchao Yao
Ya-Qin Zhang
Yanfeng Wang
36
7
0
29 May 2024
To FP8 and Back Again: Quantifying Reduced Precision Effects on LLM Training Stability
Joonhyung Lee
Jeongin Bae
Byeongwook Kim
S. Kwon
Dongsoo Lee
MQ
41
1
0
29 May 2024
Visualizing the loss landscape of Self-supervised Vision Transformer
Youngwan Lee
Jeffrey Willette
Jonghee Kim
Sung Ju Hwang
ViT
33
1
0
28 May 2024
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Yake Wei
Di Hu
27
13
0
28 May 2024
Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models
Sheng-Hsuan Peng
Pin-Yu Chen
Matthew Hull
Duen Horng Chau
50
20
0
27 May 2024
Pretraining with Random Noise for Fast and Robust Learning without Weight Transport
Jeonghwan Cheon
Sang Wan Lee
Se-Bum Paik
OOD
120
1
0
27 May 2024
Does SGD really happen in tiny subspaces?
Minhak Song
Kwangjun Ahn
Chulhee Yun
61
4
1
25 May 2024
Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Pranshu Malviya
Jerry Huang
Quentin Fournier
Sarath Chandar
54
0
0
24 May 2024
Leakage-Resilient and Carbon-Neutral Aggregation Featuring the Federated AI-enabled Critical Infrastructure
Zehang Deng
Ruoxi Sun
Minhui Xue
Sheng Wen
S. Çamtepe
Surya Nepal
Yang Xiang
35
1
0
24 May 2024
Efficiency for Free: Ideal Data Are Transportable Representations
Peng Sun
Yi Jiang
Tao Lin
DD
36
0
0
23 May 2024
RoPINN: Region Optimized Physics-Informed Neural Networks
Haixu Wu
Huakun Luo
Yuezhou Ma
Jianmin Wang
Mingsheng Long
AI4CE
32
6
0
23 May 2024
Improving Generalization of Deep Neural Networks by Optimum Shifting
Yuyan Zhou
Ye Li
Lei Feng
Sheng-Jun Huang
OOD
ODL
25
0
0
23 May 2024
SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data
Sakshi Choudhary
Sai Aparna Aketi
Kaushik Roy
FedML
37
0
0
22 May 2024
Visualizing, Rethinking, and Mining the Loss Landscape of Deep Neural Networks
Xin-Chun Li
Lan Li
De-Chuan Zhan
33
2
0
21 May 2024
Exploring and Exploiting the Asymmetric Valley of Deep Neural Networks
Xin-Chun Li
Jinli Tang
Bo Zhang
Lan Li
De-Chuan Zhan
41
2
0
21 May 2024
Using Degeneracy in the Loss Landscape for Mechanistic Interpretability
Lucius Bushnaq
Jake Mendel
Stefan Heimersheim
Dan Braun
Nicholas Goldowsky-Dill
Kaarel Hänni
Cindy Wu
Marius Hobbhahn
27
7
0
17 May 2024
Sharpness-Aware Minimization in Genetic Programming
I. Bakurov
N. Haut
Wolfgang Banzhaf
23
0
0
16 May 2024
Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers
Johann Schmidt
Sebastian Stober
43
1
0
06 May 2024
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Guoping Xu
Xiaxia Wang
Xinglong Wu
Xuesong Leng
Yongchao Xu
3DPC
32
8
0
02 May 2024
Generalization Measures for Zero-Shot Cross-Lingual Transfer
Saksham Bassi
Duygu Ataman
Kyunghyun Cho
24
0
0
24 Apr 2024
FedTrans: Efficient Federated Learning via Multi-Model Transformation
Yuxuan Zhu
Jiachen Liu
Mosharaf Chowdhury
Fan Lai
36
0
0
21 Apr 2024
A Hybrid Generative and Discriminative PointNet on Unordered Point Sets
Yang Ye
Shihao Ji
PINN
3DPC
33
0
0
19 Apr 2024
QGen: On the Ability to Generalize in Quantization Aware Training
Mohammadhossein Askarihemmat
Ahmadreza Jeddi
Reyhane Askari Hemmat
Ivan Lazarevich
Alexander Hoffman
Sudhakar Sah
Ehsan Saboori
Yvon Savaria
Jean-Pierre David
MQ
21
0
0
17 Apr 2024
Eliminating Catastrophic Overfitting Via Abnormal Adversarial Examples Regularization
Runqi Lin
Chaojian Yu
Tongliang Liu
AAML
30
9
0
11 Apr 2024
Adapting LLaMA Decoder to Vision Transformer
Jiahao Wang
Wenqi Shao
Mengzhao Chen
Chengyue Wu
Yong Liu
Taiqiang Wu
Kaipeng Zhang
Songyang Zhang
Kai-xiang Chen
Ping Luo
MLLM
38
4
0
10 Apr 2024
Slax: A Composable JAX Library for Rapid and Flexible Prototyping of Spiking Neural Networks
Thomas M. Summe
Siddharth Joshi
36
2
0
08 Apr 2024
Statistical Mechanics and Artificial Neural Networks: Principles, Models, and Applications
Lucas Böttcher
Gregory R. Wheeler
32
0
0
05 Apr 2024
Continual Learning with Weight Interpolation
Jkedrzej Kozal
Jan Wasilewski
Bartosz Krawczyk
Michal Wo'zniak
CLL
MoMe
34
6
0
05 Apr 2024
Revisiting Random Weight Perturbation for Efficiently Improving Generalization
Tao Li
Qinghua Tao
Weihao Yan
Zehao Lei
Yingwen Wu
Kun Fang
M. He
Xiaolin Huang
AAML
28
5
0
30 Mar 2024
Model Stock: All we need is just a few fine-tuned models
Dong-Hwan Jang
Sangdoo Yun
Dongyoon Han
OODD
MoMe
27
38
0
28 Mar 2024
Insights into the Lottery Ticket Hypothesis and Iterative Magnitude Pruning
Tausifa Jan Saleem
Ramanjit Ahuja
Surendra Prasad
Brejesh Lall
23
0
0
22 Mar 2024
Progressive trajectory matching for medical dataset distillation
Zhennaan Yu
Yang Liu
Qingchao Chen
DD
40
4
0
20 Mar 2024
Diversity-Aware Agnostic Ensemble of Sharpness Minimizers
Anh-Vu Bui
Vy Vo
Tung Pham
Dinh Q. Phung
Trung Le
FedML
UQCV
21
1
0
19 Mar 2024
Simple Ingredients for Offline Reinforcement Learning
Edoardo Cetin
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
Yann Ollivier
Ahmed Touati
OffRL
32
2
0
19 Mar 2024
Previous
1
2
3
4
5
...
19
20
21
Next