ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.09913
  4. Cited By
Visualizing the Loss Landscape of Neural Nets

Visualizing the Loss Landscape of Neural Nets

28 December 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
ArXivPDFHTML

Papers citing "Visualizing the Loss Landscape of Neural Nets"

50 / 1,039 papers shown
Title
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and
  Strong Baselines
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
Marius Mosbach
Maksym Andriushchenko
Dietrich Klakow
12
352
0
08 Jun 2020
Efficient AutoML Pipeline Search with Matrix and Tensor Factorization
Efficient AutoML Pipeline Search with Matrix and Tensor Factorization
Chengrun Yang
Jicong Fan
Ziyang Wu
Madeleine Udell
8
8
0
07 Jun 2020
An Overview of Neural Network Compression
An Overview of Neural Network Compression
James OÑeill
AI4CE
45
98
0
05 Jun 2020
Noisy Differentiable Architecture Search
Noisy Differentiable Architecture Search
Xiangxiang Chu
Bo-Wen Zhang
10
41
0
07 May 2020
An Information-theoretic Visual Analysis Framework for Convolutional
  Neural Networks
An Information-theoretic Visual Analysis Framework for Convolutional Neural Networks
Jingyi Shen
Han-Wei Shen
FAtt
HAI
6
1
0
02 May 2020
Deeply Cascaded U-Net for Multi-Task Image Processing
Deeply Cascaded U-Net for Multi-Task Image Processing
I. Gubins
R. Veltkamp
SSeg
9
6
0
01 May 2020
Masking as an Efficient Alternative to Finetuning for Pretrained
  Language Models
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models
Mengjie Zhao
Tao R. Lin
Fei Mi
Martin Jaggi
Hinrich Schütze
17
119
0
26 Apr 2020
Climate Adaptation: Reliably Predicting from Imbalanced Satellite Data
Climate Adaptation: Reliably Predicting from Imbalanced Satellite Data
Ruchit Rawal
Prabhu Pradhan
20
1
0
26 Apr 2020
Understanding the Difficulty of Training Transformers
Understanding the Difficulty of Training Transformers
Liyuan Liu
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
Jiawei Han
AI4CE
14
244
0
17 Apr 2020
Adversarial Weight Perturbation Helps Robust Generalization
Adversarial Weight Perturbation Helps Robust Generalization
Dongxian Wu
Shutao Xia
Yisen Wang
OOD
AAML
6
17
0
13 Apr 2020
Orthogonal Over-Parameterized Training
Orthogonal Over-Parameterized Training
Weiyang Liu
Rongmei Lin
Zhen Liu
James M. Rehg
Liam Paull
Li Xiong
Le Song
Adrian Weller
19
40
0
09 Apr 2020
TSInsight: A local-global attribution framework for interpretability in
  time-series data
TSInsight: A local-global attribution framework for interpretability in time-series data
Shoaib Ahmed Siddiqui
Dominique Mercier
Andreas Dengel
Sheraz Ahmed
FAtt
AI4TS
6
12
0
06 Apr 2020
Applying Cyclical Learning Rate to Neural Machine Translation
Applying Cyclical Learning Rate to Neural Machine Translation
Choon Meng Lee
Jianfeng Liu
Wei Peng
ODL
6
2
0
06 Apr 2020
Understanding Learning Dynamics for Neural Machine Translation
Understanding Learning Dynamics for Neural Machine Translation
Conghui Zhu
Guanlin Li
Lemao Liu
T. Zhao
Shuming Shi
15
3
0
05 Apr 2020
Towards Deep Learning Models Resistant to Large Perturbations
Towards Deep Learning Models Resistant to Large Perturbations
Amirreza Shaeiri
Rozhin Nobahari
M. Rohban
OOD
AAML
18
12
0
30 Mar 2020
SuperNet -- An efficient method of neural networks ensembling
SuperNet -- An efficient method of neural networks ensembling
Ludwik Bukowski
W. Dzwinel
6
2
0
29 Mar 2020
Pipelined Backpropagation at Scale: Training Large Models without
  Batches
Pipelined Backpropagation at Scale: Training Large Models without Batches
Atli Kosson
Vitaliy Chiley
Abhinav Venigalla
Joel Hestness
Urs Koster
25
33
0
25 Mar 2020
Auto-Ensemble: An Adaptive Learning Rate Scheduling based Deep Learning
  Model Ensembling
Auto-Ensemble: An Adaptive Learning Rate Scheduling based Deep Learning Model Ensembling
Jun Yang
Fei Wang
12
32
0
25 Mar 2020
Illumination-based Transformations Improve Skin Lesion Segmentation in
  Dermoscopic Images
Illumination-based Transformations Improve Skin Lesion Segmentation in Dermoscopic Images
Kumar Abhishek
Ghassan Hamarneh
M. S. Drew
25
36
0
23 Mar 2020
SAT: Improving Adversarial Training via Curriculum-Based Loss Smoothing
SAT: Improving Adversarial Training via Curriculum-Based Loss Smoothing
Chawin Sitawarin
S. Chakraborty
David A. Wagner
AAML
9
37
0
18 Mar 2020
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Wei Zhou
Yiying Li
Yongxin Yang
Huaimin Wang
Timothy M. Hospedales
OffRL
22
46
0
11 Mar 2020
Some Geometrical and Topological Properties of DNNs' Decision Boundaries
Some Geometrical and Topological Properties of DNNs' Decision Boundaries
Bo Liu
Mengya Shen
AAML
12
3
0
07 Mar 2020
Iterative Averaging in the Quest for Best Test Error
Iterative Averaging in the Quest for Best Test Error
Diego Granziol
Xingchen Wan
Samuel Albanie
Stephen J. Roberts
6
3
0
02 Mar 2020
WaveQ: Gradient-Based Deep Quantization of Neural Networks through
  Sinusoidal Adaptive Regularization
WaveQ: Gradient-Based Deep Quantization of Neural Networks through Sinusoidal Adaptive Regularization
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
T. Elgindi
Charles-Alban Deledalle
H. Esmaeilzadeh
MQ
17
10
0
29 Feb 2020
Layer-wise Conditioning Analysis in Exploring the Learning Dynamics of
  DNNs
Layer-wise Conditioning Analysis in Exploring the Learning Dynamics of DNNs
Lei Huang
Jie Qin
Li Liu
Fan Zhu
Ling Shao
AI4CE
15
11
0
25 Feb 2020
Investigating the interaction between gradient-only line searches and
  different activation functions
Investigating the interaction between gradient-only line searches and different activation functions
D. Kafka
D. Wilke
17
0
0
23 Feb 2020
Do We Need Zero Training Loss After Achieving Zero Training Error?
Do We Need Zero Training Loss After Achieving Zero Training Error?
Takashi Ishida
Ikko Yamane
Tomoya Sakai
Gang Niu
Masashi Sugiyama
AI4CE
41
134
0
20 Feb 2020
Unraveling Meta-Learning: Understanding Feature Representations for
  Few-Shot Tasks
Unraveling Meta-Learning: Understanding Feature Representations for Few-Shot Tasks
Micah Goldblum
Steven Reich
Liam H. Fowl
Renkun Ni
Valeriia Cherepanova
Tom Goldstein
SSL
OffRL
26
75
0
17 Feb 2020
Why Do Deep Residual Networks Generalize Better than Deep Feedforward
  Networks? -- A Neural Tangent Kernel Perspective
Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? -- A Neural Tangent Kernel Perspective
Kaixuan Huang
Yuqing Wang
Molei Tao
T. Zhao
MLT
4
96
0
14 Feb 2020
Understanding Global Loss Landscape of One-hidden-layer ReLU Networks,
  Part 1: Theory
Understanding Global Loss Landscape of One-hidden-layer ReLU Networks, Part 1: Theory
Bo Liu
FAtt
MLT
16
1
0
12 Feb 2020
Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale
  Chest Computed Tomography Volumes
Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale Chest Computed Tomography Volumes
R. Draelos
D. Dov
Maciej Mazurowski
J. Lo
Ricardo Henao
Geoffrey D. Rubin
Lawrence Carin
9
68
0
12 Feb 2020
Unique Properties of Flat Minima in Deep Networks
Unique Properties of Flat Minima in Deep Networks
Rotem Mulayoff
T. Michaeli
ODL
11
4
0
11 Feb 2020
Machine-Learning-Based Diagnostics of EEG Pathology
Machine-Learning-Based Diagnostics of EEG Pathology
Lukas A. W. Gemein
R. Schirrmeister
P. Chrabaszcz
Daniel Wilson
Joschka Boedecker
A. Schulze-Bonhage
Frank Hutter
T. Ball
14
153
0
11 Feb 2020
Translating Diffusion, Wavelets, and Regularisation into Residual
  Networks
Translating Diffusion, Wavelets, and Regularisation into Residual Networks
Tobias Alt
Joachim Weickert
Pascal Peter
DiffM
11
8
0
07 Feb 2020
Optimizing Loss Functions Through Multivariate Taylor Polynomial
  Parameterization
Optimizing Loss Functions Through Multivariate Taylor Polynomial Parameterization
Santiago Gonzalez
Risto Miikkulainen
16
9
0
31 Jan 2020
Learning Deep Analysis Dictionaries -- Part II: Convolutional
  Dictionaries
Learning Deep Analysis Dictionaries -- Part II: Convolutional Dictionaries
Jun-Jie Huang
P. Dragotti
23
1
0
31 Jan 2020
RatLesNetv2: A Fully Convolutional Network for Rodent Brain Lesion
  Segmentation
RatLesNetv2: A Fully Convolutional Network for Rodent Brain Lesion Segmentation
Juan Miguel Valverde
Artem Shatillo
Riccardo De Feo
O. Gröhn
Alejandra Sierra
Jussi Tohka
MedIm
18
21
0
24 Jan 2020
Deep Residual Dense U-Net for Resolution Enhancement in Accelerated MRI
  Acquisition
Deep Residual Dense U-Net for Resolution Enhancement in Accelerated MRI Acquisition
Pak Lun Kevin Ding
Zhiqiang Li
Yuxiang Zhou
Baoxin Li
21
34
0
13 Jan 2020
Stochastic Weight Averaging in Parallel: Large-Batch Training that
  Generalizes Well
Stochastic Weight Averaging in Parallel: Large-Batch Training that Generalizes Well
Vipul Gupta
S. Serrano
D. DeCoste
MoMe
30
55
0
07 Jan 2020
CProp: Adaptive Learning Rate Scaling from Past Gradient Conformity
CProp: Adaptive Learning Rate Scaling from Past Gradient Conformity
Konpat Preechakul
B. Kijsirikul
ODL
20
3
0
24 Dec 2019
TRADI: Tracking deep neural network weight distributions for uncertainty
  estimation
TRADI: Tracking deep neural network weight distributions for uncertainty estimation
Gianni Franchi
Andrei Bursuc
Emanuel Aldea
Séverine Dubuisson
Isabelle Bloch
UQCV
20
51
0
24 Dec 2019
Deep Curvature Suite
Deep Curvature Suite
Diego Granziol
Xingchen Wan
T. Garipov
3DV
15
12
0
20 Dec 2019
Optimization for deep learning: theory and algorithms
Optimization for deep learning: theory and algorithms
Ruoyu Sun
ODL
14
168
0
19 Dec 2019
A Deep Neural Network's Loss Surface Contains Every Low-dimensional
  Pattern
A Deep Neural Network's Loss Surface Contains Every Low-dimensional Pattern
Wojciech M. Czarnecki
Simon Osindero
Razvan Pascanu
Max Jaderberg
3DPC
6
11
0
16 Dec 2019
PyHessian: Neural Networks Through the Lens of the Hessian
PyHessian: Neural Networks Through the Lens of the Hessian
Z. Yao
A. Gholami
Kurt Keutzer
Michael W. Mahoney
ODL
17
289
0
16 Dec 2019
Meta-Learning Initializations for Image Segmentation
Meta-Learning Initializations for Image Segmentation
S. Hendryx
Andrew B. Leach
P. Hein
Clayton T. Morrison
VLM
8
25
0
13 Dec 2019
A Machine Learning Framework for Solving High-Dimensional Mean Field
  Game and Mean Field Control Problems
A Machine Learning Framework for Solving High-Dimensional Mean Field Game and Mean Field Control Problems
Lars Ruthotto
Stanley Osher
Wuchen Li
L. Nurbekyan
Samy Wu Fung
AI4CE
20
212
0
04 Dec 2019
Constrained Linear Data-feature Mapping for Image Classification
Constrained Linear Data-feature Mapping for Image Classification
Juncai He
Yuyan Chen
Lian Zhang
Jinchao Xu
14
2
0
23 Nov 2019
DC-S3GD: Delay-Compensated Stale-Synchronous SGD for Large-Scale
  Decentralized Neural Network Training
DC-S3GD: Delay-Compensated Stale-Synchronous SGD for Large-Scale Decentralized Neural Network Training
Alessandro Rigazzi
11
4
0
06 Nov 2019
GraphAIR: Graph Representation Learning with Neighborhood Aggregation
  and Interaction
GraphAIR: Graph Representation Learning with Neighborhood Aggregation and Interaction
Fenyu Hu
Yanqiao Zhu
Shu Wu
Weiran Huang
Liang Wang
T. Tan
GNN
11
47
0
05 Nov 2019
Previous
123...1718192021
Next