ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08709
  4. Cited By
Do We Need Zero Training Loss After Achieving Zero Training Error?
v1v2 (latest)

Do We Need Zero Training Loss After Achieving Zero Training Error?

International Conference on Machine Learning (ICML), 2020
20 February 2020
Takashi Ishida
Ikko Yamane
Tomoya Sakai
Gang Niu
Masashi Sugiyama
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Do We Need Zero Training Loss After Achieving Zero Training Error?"

50 / 64 papers shown
A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees
A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees
Miao Zhang
Junpeng Li
Changchun Hua
Yana Yang
157
0
0
28 Nov 2025
A Closer Look at Personalized Fine-Tuning in Heterogeneous Federated Learning
A Closer Look at Personalized Fine-Tuning in Heterogeneous Federated Learning
Minghui Chen
Hrad Ghoukasian
Ruinan Jin
Zehua Wang
Sai Praneeth Karimireddy
Xiaoxiao Li
193
0
0
16 Nov 2025
When Data Falls Short: Grokking Below the Critical Threshold
When Data Falls Short: Grokking Below the Critical Threshold
Vaibhav Singh
Eugene Belilovsky
Rahaf Aljundi
134
0
0
06 Nov 2025
Rethinking Consistent Multi-Label Classification Under Inexact Supervision
Rethinking Consistent Multi-Label Classification Under Inexact Supervision
Wei Wang
Tianhao Ma
Ming-Kun Xie
Gang Niu
Masashi Sugiyama
197
1
0
05 Oct 2025
Class-wise Flooding Regularization for Imbalanced Image Classification
Class-wise Flooding Regularization for Imbalanced Image Classification
Hiroaki Aizawa
Yuta Naito
Kohei Fukuda
AI4CE
103
0
0
26 Aug 2025
Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models
Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models
Tianxiao Cao
Kyohei Atarashi
H. Kashima
268
0
0
14 Aug 2025
Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal
Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal
Yang Wang
Chenghao Xiao
Yi Zhou
Stuart E. Middleton
Noura Al Moubayed
C. D. Lin
AAML
356
1
0
29 Jul 2025
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence BenchmarksInternational Conference on Computational Linguistics (COLING), 2025
Yang Wang
Chenghua Lin
ELM
451
4
0
05 Jan 2025
Counter-Current Learning: A Biologically Plausible Dual Network Approach
  for Deep Learning
Counter-Current Learning: A Biologically Plausible Dual Network Approach for Deep LearningNeural Information Processing Systems (NeurIPS), 2024
Chia-Hsiang Kao
Bharath Hariharan
360
3
0
30 Sep 2024
Making Robust Generalizers Less Rigid with Loss Concentration
Making Robust Generalizers Less Rigid with Loss Concentration
Matthew J. Holland
Toma Hamada
OOD
402
0
0
07 Aug 2024
ClassiFIM: An Unsupervised Method To Detect Phase Transitions
ClassiFIM: An Unsupervised Method To Detect Phase Transitions
Victor Kasatkin
E. Mozgunov
Nicholas Ezzell
Utkarsh Mishra
Itay Hen
Daniel Lidar
192
3
0
06 Aug 2024
Risks, Causes, and Mitigations of Widespread Deployments of Large
  Language Models (LLMs): A Survey
Risks, Causes, and Mitigations of Widespread Deployments of Large Language Models (LLMs): A Survey
Md. Nazmus Sakib
Md Athikul Islam
Royal Pathak
Md Mashrur Arifin
ALMPILM
319
13
0
01 Aug 2024
Decoupling the Class Label and the Target Concept in Machine Unlearning
Decoupling the Class Label and the Target Concept in Machine Unlearning
Jianing Zhu
Bo Han
Jiangchao Yao
Jianliang Xu
Gang Niu
Masashi Sugiyama
CLLMU
207
9
0
12 Jun 2024
Are AI-Generated Text Detectors Robust to Adversarial Perturbations?
Are AI-Generated Text Detectors Robust to Adversarial Perturbations?
Guanhua Huang
Yuchen Zhang
Zhe Li
Yongjian You
Mingze Wang
Zhouwang Yang
DeLMO
533
23
0
03 Jun 2024
GenFighter: A Generative and Evolutive Textual Attack Removal
GenFighter: A Generative and Evolutive Textual Attack Removal
Md Athikul Islam
Edoardo Serra
Sushil Jajodia
AAML
203
0
0
17 Apr 2024
Layer-wise Regularized Dropout for Neural Language Models
Layer-wise Regularized Dropout for Neural Language Models
Shiwen Ni
Min Yang
Ruifeng Xu
Chengming Li
Xiping Hu
144
1
0
26 Feb 2024
Criterion Collapse and Loss Distribution Control
Criterion Collapse and Loss Distribution Control
Matthew J. Holland
295
2
0
15 Feb 2024
Coupled Confusion Correction: Learning from Crowds with Sparse
  Annotations
Coupled Confusion Correction: Learning from Crowds with Sparse AnnotationsAAAI Conference on Artificial Intelligence (AAAI), 2023
Hansong Zhang
Shikun Li
Dan Zeng
Chenggang Yan
Shiming Ge
379
23
0
12 Dec 2023
A Generalizable Deep Learning System for Cardiac MRI
A Generalizable Deep Learning System for Cardiac MRI
R. Shad
C. Zakka
Dhamanpreet Kaur
R. Fong
R. Filice
...
Michael A. Acker
Curt P. Langlotz
W. Hiesinger
Curtis Langlotz
William Hiesinger
MedIm
253
3
0
01 Dec 2023
On the Hyperparameter Loss Landscapes of Machine Learning Models: An
  Exploratory Study
On the Hyperparameter Loss Landscapes of Machine Learning Models: An Exploratory StudyKnowledge Discovery and Data Mining (KDD), 2023
Mingyu Huang
Ke Li
303
5
0
23 Nov 2023
AdaFlood: Adaptive Flood Regularization
AdaFlood: Adaptive Flood Regularization
Wonho Bae
Yi Ren
Mohamad Osama Ahmed
Frederick Tung
Danica J. Sutherland
Gabriel L. Oliveira
AI4CE
268
3
0
06 Nov 2023
Flooding Regularization for Stable Training of Generative Adversarial
  Networks
Flooding Regularization for Stable Training of Generative Adversarial Networks
Iu Yahiro
Takashi Ishida
Xiangwei Zhu
GANAI4CE
328
0
0
01 Nov 2023
NECO: NEural Collapse Based Out-of-distribution detection
NECO: NEural Collapse Based Out-of-distribution detectionInternational Conference on Learning Representations (ICLR), 2023
Mouin Ben Ammar
Nacim Belkhir
Sebastian Popescu
Antoine Manzanera
Gianni Franchi
OODD
381
37
0
10 Oct 2023
Quality-Agnostic Deepfake Detection with Intra-model Collaborative
  Learning
Quality-Agnostic Deepfake Detection with Intra-model Collaborative LearningIEEE International Conference on Computer Vision (ICCV), 2023
B. Le
Simon S. Woo
AAML
281
46
0
12 Sep 2023
Label Noise: Correcting a Correction
Label Noise: Correcting a Correction
William Toner
Amos Storkey
NoLa
270
1
0
24 Jul 2023
DSRM: Boost Textual Adversarial Training with Distribution Shift Risk
  Minimization
DSRM: Boost Textual Adversarial Training with Distribution Shift Risk MinimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Songyang Gao
Jiajun Sun
Yan Liu
Xiao Wang
Qi Zhang
Zhongyu Wei
Jin Ma
Yingchun Shan
OOD
208
9
0
27 Jun 2023
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection
  Capability
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection CapabilityInternational Conference on Machine Learning (ICML), 2023
Jianing Zhu
Hengzhuang Li
Jiangchao Yao
Tongliang Liu
Jianliang Xu
Bo Han
OODD
233
20
0
06 Jun 2023
HUB: Guiding Learned Optimizers with Continuous Prompt Tuning
Gaole Dai
Wei Wu
Ziyu Wang
Jie Fu
Shanghang Zhang
Tiejun Huang
AIFin
244
0
0
26 May 2023
Temporal Aware Mixed Attention-based Convolution and Transformer Network
  (MACTN) for EEG Emotion Recognition
Temporal Aware Mixed Attention-based Convolution and Transformer Network (MACTN) for EEG Emotion Recognition
Xiaopeng Si
Dong Huang
Yulin Sun
Dong Ming
214
7
0
18 May 2023
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge
  Distillation
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation
Rongzhi Zhang
Jiaming Shen
Tianqi Liu
Jia-Ling Liu
Michael Bendersky
Marc Najork
Chao Zhang
338
34
0
08 May 2023
Exploring the Effect of Multi-step Ascent in Sharpness-Aware
  Minimization
Exploring the Effect of Multi-step Ascent in Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Woojin Lee
Jaewook Lee
206
11
0
27 Jan 2023
A Stability Analysis of Fine-Tuning a Pre-Trained Model
A Stability Analysis of Fine-Tuning a Pre-Trained Model
Z. Fu
Anthony Man-Cho So
Nigel Collier
224
4
0
24 Jan 2023
$β$-DARTS++: Bi-level Regularization for Proxy-robust Differentiable
  Architecture Search
βββ-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture SearchIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Peng Ye
Tong He
Baopu Li
Tao Chen
Mengwei He
Wanli Ouyang
OOD
251
9
0
16 Jan 2023
Stability Analysis of Sharpness-Aware Minimization
Stability Analysis of Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Jaewook Lee
288
17
0
16 Jan 2023
Efficient Generalization Improvement Guided by Random Weight
  Perturbation
Efficient Generalization Improvement Guided by Random Weight Perturbation
Tao Li
Wei Yan
Zehao Lei
Yingwen Wu
Kun Fang
Ming-Hsuan Yang
Xiaolin Huang
AAML
191
8
0
21 Nov 2022
Noisy Pairing and Partial Supervision for Opinion Summarization
Noisy Pairing and Partial Supervision for Opinion SummarizationInternational Conference on Natural Language Generation (INLG), 2022
Hayate Iso
Xiaolan Wang
Yoshihiko Suhara
AI4TS
186
0
0
16 Nov 2022
One-Class Risk Estimation for One-Class Hyperspectral Image
  Classification
One-Class Risk Estimation for One-Class Hyperspectral Image ClassificationIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Hengwei Zhao
Yanfei Zhong
Xinyu Wang
H. Shu
164
12
0
27 Oct 2022
WaveBound: Dynamic Error Bounds for Stable Time Series Forecasting
WaveBound: Dynamic Error Bounds for Stable Time Series ForecastingNeural Information Processing Systems (NeurIPS), 2022
Youngin Cho
Daejin Kim
Dongmin Kim
Mohammad Azam Khan
Jaegul Choo
AI4TS
265
3
0
25 Oct 2022
Stable and Efficient Adversarial Training through Local Linearization
Stable and Efficient Adversarial Training through Local Linearization
Zhuorong Li
Daiwei Yu
AAML
131
0
0
11 Oct 2022
Understanding Gradient Regularization in Deep Learning: Efficient
  Finite-Difference Computation and Implicit Bias
Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit BiasInternational Conference on Machine Learning (ICML), 2022
Ryo Karakida
Tomoumi Takase
Tomohiro Hayase
Kazuki Osawa
166
21
0
06 Oct 2022
Pre-training General Trajectory Embeddings with Maximum Multi-view
  Entropy Coding
Pre-training General Trajectory Embeddings with Maximum Multi-view Entropy CodingIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Yan Lin
Huaiyu Wan
Shengnan Guo
Jilin Hu
Christian S. Jensen
Youfang Lin
AI4TS
216
34
0
29 Jul 2022
Multi-class Classification from Multiple Unlabeled Datasets with Partial
  Risk Regularization
Multi-class Classification from Multiple Unlabeled Datasets with Partial Risk RegularizationAsian Conference on Machine Learning (ACML), 2022
Yuting Tang
Nan Lu
Tianyi Zhang
Masashi Sugiyama
230
5
0
04 Jul 2022
Fairness via In-Processing in the Over-parameterized Regime: A
  Cautionary Tale
Fairness via In-Processing in the Over-parameterized Regime: A Cautionary Tale
A. Veldanda
Shubham Sharma
Jiahao Chen
Sanghamitra Dutta
Alan Mishler
S. Garg
206
7
0
29 Jun 2022
Improving robustness of language models from a geometry-aware
  perspective
Improving robustness of language models from a geometry-aware perspectiveFindings (Findings), 2022
Bin Zhu
Zhaoquan Gu
Le Wang
Jinyin Chen
Qi Xuan
AAML
196
10
0
28 Apr 2022
Improving Camouflaged Object Detection with the Uncertainty of
  Pseudo-edge Labels
Improving Camouflaged Object Detection with the Uncertainty of Pseudo-edge LabelsACM Multimedia Asia (MA), 2021
Nobukatsu Kajiura
Hong Liu
Shiníchi Satoh
181
32
0
29 Oct 2021
Anti-Backdoor Learning: Training Clean Models on Poisoned Data
Anti-Backdoor Learning: Training Clean Models on Poisoned DataNeural Information Processing Systems (NeurIPS), 2021
Yige Li
X. Lyu
Nodens Koren
Lingjuan Lyu
Yue Liu
Jiabo He
OnRL
368
430
0
22 Oct 2021
On the Generalization of Models Trained with SGD: Information-Theoretic
  Bounds and Implications
On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications
Ziqiao Wang
Yongyi Mao
FedMLMLT
331
32
0
07 Oct 2021
MutualGraphNet: A novel model for motor imagery classification
MutualGraphNet: A novel model for motor imagery classification
Yan Li
Ning Zhong
D. Taniar
Haolan Zhang
127
8
0
02 Sep 2021
HAT4RD: Hierarchical Adversarial Training for Rumor Detection on Social
  Media
HAT4RD: Hierarchical Adversarial Training for Rumor Detection on Social MediaItalian National Conference on Sensors (INS), 2021
Shiwen Ni
Jiawen Li
Hung-Yu kao
232
8
0
29 Aug 2021
DropAttack: A Masked Weight Adversarial Training Method to Improve
  Generalization of Neural Networks
DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks
Shiwen Ni
Jiawen Li
Hung-Yu kao
AAML
158
4
0
29 Aug 2021
12
Next
Page 1 of 2