ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.09913
  4. Cited By
Visualizing the Loss Landscape of Neural Nets

Visualizing the Loss Landscape of Neural Nets

28 December 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
ArXivPDFHTML

Papers citing "Visualizing the Loss Landscape of Neural Nets"

50 / 1,039 papers shown
Title
SSE-SAM: Balancing Head and Tail Classes Gradually through Stage-Wise
  SAM
SSE-SAM: Balancing Head and Tail Classes Gradually through Stage-Wise SAM
Xingyu Lyu
Qianqian Xu
Zhiyong Yang
Shaojie Lyu
Qingming Huang
74
0
0
18 Dec 2024
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Aodi Li
Liansheng Zhuang
Xiao Long
Minghong Yao
Shafei Wang
175
0
0
18 Dec 2024
LossLens: Diagnostics for Machine Learning through Loss Landscape Visual
  Analytics
LossLens: Diagnostics for Machine Learning through Loss Landscape Visual Analytics
Tiankai Xie
Jiaqing Chen
Yaoqing Yang
Caleb Geniesse
Ge Shi
...
J. Cava
Michael W. Mahoney
Talita Perciano
Gunther H. Weber
Ross Maciejewski
72
0
0
17 Dec 2024
Meta Curvature-Aware Minimization for Domain Generalization
Meta Curvature-Aware Minimization for Domain Generalization
Z. Chen
Yiwen Ye
Feilong Tang
Yongsheng Pan
Yong-quan Xia
BDL
179
1
0
16 Dec 2024
Set-Valued Sensitivity Analysis of Deep Neural Networks
Set-Valued Sensitivity Analysis of Deep Neural Networks
Xin Wang
Feiling wang
X. Ban
70
0
0
15 Dec 2024
Path-Guided Particle-based Sampling
Path-Guided Particle-based Sampling
Mingzhou Fan
Ruida Zhou
C. Tian
Xiaoning Qian
79
4
0
04 Dec 2024
Task Arithmetic Through The Lens Of One-Shot Federated Learning
Task Arithmetic Through The Lens Of One-Shot Federated Learning
Zhixu Tao
I. Mason
Sanjeev R. Kulkarni
Xavier Boix
MoMe
FedML
82
3
0
27 Nov 2024
FREE-Merging: Fourier Transform for Efficient Model Merging
FREE-Merging: Fourier Transform for Efficient Model Merging
Shenghe Zheng
Hongzhi Wang
MoMe
77
0
0
25 Nov 2024
Towards Accurate and Efficient Sub-8-Bit Integer Training
Wenjin Guo
Donglai Liu
Weiying Xie
Yunsong Li
Xuefei Ning
Zihan Meng
Shulin Zeng
Jie Lei
Zhenman Fang
Yu Wang
MQ
34
1
0
17 Nov 2024
Deep Loss Convexification for Learning Iterative Models
Deep Loss Convexification for Learning Iterative Models
Ziming Zhang
Yuping Shao
Yiqing Zhang
Fangzhou Lin
Haichong K. Zhang
Elke Rundensteiner
3DPC
36
0
0
16 Nov 2024
Evaluating Loss Landscapes from a Topology Perspective
Evaluating Loss Landscapes from a Topology Perspective
Tiankai Xie
Caleb Geniesse
Jiaqing Chen
Yaoqing Yang
Dmitriy Morozov
Michael W. Mahoney
Ross Maciejewski
Gunther H. Weber
23
1
0
14 Nov 2024
Enhancing generalization in high energy physics using white-box
  adversarial attacks
Enhancing generalization in high energy physics using white-box adversarial attacks
Franck Rothen
Samuel Klein
Matthew Leigh
T. Golling
AAML
31
1
0
14 Nov 2024
Unraveling the Gradient Descent Dynamics of Transformers
Unraveling the Gradient Descent Dynamics of Transformers
Bingqing Song
Boran Han
Shuai Zhang
Jie Ding
Mingyi Hong
AI4CE
36
1
0
12 Nov 2024
Stepping Forward on the Last Mile
Stepping Forward on the Last Mile
Chen Feng
Shaojie Zhuo
Xiaopeng Zhang
R. Ramakrishnan
Zhaocong Yuan
Andrew Zou Li
33
0
0
06 Nov 2024
Stein Variational Newton Neural Network Ensembles
Stein Variational Newton Neural Network Ensembles
Klemens Flöge
Mohammed Abdul Moeed
Vincent Fortuin
BDL
UQCV
37
0
0
04 Nov 2024
Theoretical characterisation of the Gauss-Newton conditioning in Neural Networks
Theoretical characterisation of the Gauss-Newton conditioning in Neural Networks
Jim Zhao
Sidak Pal Singh
Aurélien Lucchi
AI4CE
39
0
0
04 Nov 2024
Visual Fourier Prompt Tuning
Visual Fourier Prompt Tuning
Runjia Zeng
Cheng Han
Qifan Wang
Chunshu Wu
Tong Geng
Lifu Huang
Ying Nian Wu
Dongfang Liu
VPVLM
VLM
48
6
0
02 Nov 2024
Guiding Neural Collapse: Optimising Towards the Nearest Simplex
  Equiangular Tight Frame
Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame
Evan Markou
Thalaiyasingam Ajanthan
Stephen Gould
26
0
0
02 Nov 2024
Does the Definition of Difficulty Matter? Scoring Functions and their
  Role for Curriculum Learning
Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning
Simon Rampp
M. Milling
Andreas Triantafyllopoulos
Björn Schuller
26
1
0
01 Nov 2024
Mitigating Gradient Overlap in Deep Residual Networks with Gradient
  Normalization for Improved Non-Convex Optimization
Mitigating Gradient Overlap in Deep Residual Networks with Gradient Normalization for Improved Non-Convex Optimization
Juyoung Yun
19
2
0
28 Oct 2024
Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for
  Long-Tailed Visual Recognition
Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual Recognition
Mengke Li
Y. Liu
Yang Lu
Yiqun Zhang
Yiu-ming Cheung
Hui Huang
VLM
33
2
0
28 Oct 2024
Relaxed Equivariance via Multitask Learning
Relaxed Equivariance via Multitask Learning
Ahmed A. A. Elhag
T. Konstantin Rusch
Francesco Di Giovanni
Michael Bronstein
42
2
0
23 Oct 2024
Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating
  Few-Shot Node Classification
Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Yihong Luo
Yuhan Chen
Siya Qiu
Yiwei Wang
Chen Zhang
Yan Zhou
Xiaochun Cao
Jing Tang
AAML
27
2
0
22 Oct 2024
Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs
Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs
Zifei Xu
Sayeh Sharify
W. Yazar
T. Webb
Xin Eric Wang
MQ
38
0
0
18 Oct 2024
Transformer-Based Approaches for Sensor-Based Human Activity
  Recognition: Opportunities and Challenges
Transformer-Based Approaches for Sensor-Based Human Activity Recognition: Opportunities and Challenges
Clayton Frederick Souza Leite
Henry Mauranen
Aziza Zhanabatyrova
Yu Xiao
24
1
0
17 Oct 2024
Loss Landscape Characterization of Neural Networks without
  Over-Parametrization
Loss Landscape Characterization of Neural Networks without Over-Parametrization
Rustem Islamov
Niccolò Ajroldi
Antonio Orvieto
Aurélien Lucchi
33
4
0
16 Oct 2024
Building a Multivariate Time Series Benchmarking Datasets Inspired by
  Natural Language Processing (NLP)
Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (NLP)
Mohammad Asif Ibna Mustafa
Ferdinand Heinrich
AI4TS
22
0
0
14 Oct 2024
MoTE: Reconciling Generalization with Specialization for Visual-Language
  to Video Knowledge Transfer
MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer
Minghao Zhu
Zhengpu Wang
Mengxian Hu
Ronghao Dang
Xiao Lin
Xun Zhou
Chengju Liu
Qijun Chen
30
1
0
14 Oct 2024
Stein Variational Evolution Strategies
Stein Variational Evolution Strategies
Cornelius V. Braun
Robert T. Lange
Marc Toussaint
26
0
0
14 Oct 2024
Lambda-Skip Connections: the architectural component that prevents Rank Collapse
Lambda-Skip Connections: the architectural component that prevents Rank Collapse
Federico Arangath Joseph
Jerome Sieber
M. Zeilinger
Carmen Amo Alonso
33
0
0
14 Oct 2024
Growing Efficient Accurate and Robust Neural Networks on the Edge
Growing Efficient Accurate and Robust Neural Networks on the Edge
Vignesh Sundaresha
Naresh Shanbhag
15
0
0
10 Oct 2024
Adversarial Robustness Overestimation and Instability in TRADES
Adversarial Robustness Overestimation and Instability in TRADES
Jonathan Weiping Li
Ren-Wei Liang
Cheng-Han Yeh
Cheng-Chang Tsai
Kuanchun Yu
Chun-Shien Lu
Shang-Tse Chen
AAML
41
0
0
10 Oct 2024
Measuring and Controlling Solution Degeneracy across Task-Trained
  Recurrent Neural Networks
Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks
Ann Huang
Satpreet H. Singh
Kanaka Rajan
19
0
0
04 Oct 2024
PRF: Parallel Resonate and Fire Neuron for Long Sequence Learning in
  Spiking Neural Networks
PRF: Parallel Resonate and Fire Neuron for Long Sequence Learning in Spiking Neural Networks
Yulong Huang
Zunchang Liu
Changchun Feng
Xiaopeng Lin
Hongwei Ren
Haotian Fu
Yue Zhou
Hong Xing
Bojun Cheng
36
1
0
04 Oct 2024
Mitigating Memorization In Language Models
Mitigating Memorization In Language Models
Mansi Sakarvadia
Aswathy Ajith
Arham Khan
Nathaniel Hudson
Caleb Geniesse
Kyle Chard
Yaoqing Yang
Ian Foster
Michael W. Mahoney
KELM
MU
50
0
0
03 Oct 2024
Towards Model Discovery Using Domain Decomposition and PINNs
Towards Model Discovery Using Domain Decomposition and PINNs
Tirtho S. Saha
Alexander Heinlein
Cordula Reisch
PINN
11
0
0
02 Oct 2024
Basis-to-Basis Operator Learning Using Function Encoders
Basis-to-Basis Operator Learning Using Function Encoders
Tyler Ingebrand
Adam J. Thorpe
Somdatta Goswami
Krishna Kumar
Ufuk Topcu
11
3
0
30 Sep 2024
Do Influence Functions Work on Large Language Models?
Do Influence Functions Work on Large Language Models?
Zhe Li
Wei Zhao
Yige Li
Jun Sun
TDI
28
1
0
30 Sep 2024
CycleBNN: Cyclic Precision Training in Binary Neural Networks
CycleBNN: Cyclic Precision Training in Binary Neural Networks
Federico Fontana
Romeo Lanzino
Anxhelo Diko
G. Foresti
Luigi Cinque
MQ
34
0
0
28 Sep 2024
Kendall's $τ$ Coefficient for Logits Distillation
Kendall's τττ Coefficient for Logits Distillation
Yuchen Guan
Runxi Cheng
Kang Liu
Chun Yuan
26
0
0
26 Sep 2024
Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological
  Measurement
Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement
Haodong Li
Hao Lu
Ying-Cong Chen
28
1
0
25 Sep 2024
Super Level Sets and Exponential Decay: A Synergistic Approach to Stable
  Neural Network Training
Super Level Sets and Exponential Decay: A Synergistic Approach to Stable Neural Network Training
J. Chaudhary
Dipak Nidhi
J. Heikkonen
H. Merisaari
R. Kanth
21
0
0
25 Sep 2024
Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
Suayb S. Arslan
23
2
0
24 Sep 2024
Revisiting Video Quality Assessment from the Perspective of
  Generalization
Revisiting Video Quality Assessment from the Perspective of Generalization
Xinli Yue
Jianhui Sun
Liangchao Yao
Fan Xia
Yuetang Deng
...
Lei Li
Fengyun Rao
Jing Lv
Qian Wang
Lingchen Zhao
MoMe
23
0
0
23 Sep 2024
Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape
Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape
Tao Li
Zhengbao He
Yujun Li
Yasheng Wang
Lifeng Shang
X. Huang
51
0
0
22 Sep 2024
UU-Mamba: Uncertainty-aware U-Mamba for Cardiovascular Segmentation
UU-Mamba: Uncertainty-aware U-Mamba for Cardiovascular Segmentation
Ting Yu Tsai
Li Lin
Shu Hu
Connie W. Tsao
Xin Li
Ming-Ching Chang
Hongtu Zhu
Xin Wang
Mamba
43
1
0
22 Sep 2024
Bilateral Sharpness-Aware Minimization for Flatter Minima
Bilateral Sharpness-Aware Minimization for Flatter Minima
Jiaxin Deng
Junbiao Pang
Baochang Zhang
Qingming Huang
AAML
104
0
0
20 Sep 2024
Unraveling the Hessian: A Key to Smooth Convergence in Loss Function
  Landscapes
Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes
Nikita Kiselev
Andrey Grabovoy
41
1
0
18 Sep 2024
Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight
  Speech Enhancement
Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement
Zizhen Lin
Yuanle Li
Junyu Wang
Ruili Li
34
0
0
18 Sep 2024
Flash STU: Fast Spectral Transform Units
Flash STU: Fast Spectral Transform Units
Y. Isabel Liu
Windsor Nguyen
Yagiz Devre
Evan Dogariu
Anirudha Majumdar
Elad Hazan
AI4TS
70
1
0
16 Sep 2024
Previous
12345...192021
Next