Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.08709
Cited By
v1
v2 (latest)
Do We Need Zero Training Loss After Achieving Zero Training Error?
International Conference on Machine Learning (ICML), 2020
20 February 2020
Takashi Ishida
Ikko Yamane
Tomoya Sakai
Gang Niu
Masashi Sugiyama
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Do We Need Zero Training Loss After Achieving Zero Training Error?"
50 / 64 papers shown
A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees
Miao Zhang
Junpeng Li
Changchun Hua
Yana Yang
157
0
0
28 Nov 2025
A Closer Look at Personalized Fine-Tuning in Heterogeneous Federated Learning
Minghui Chen
Hrad Ghoukasian
Ruinan Jin
Zehua Wang
Sai Praneeth Karimireddy
Xiaoxiao Li
193
0
0
16 Nov 2025
When Data Falls Short: Grokking Below the Critical Threshold
Vaibhav Singh
Eugene Belilovsky
Rahaf Aljundi
134
0
0
06 Nov 2025
Rethinking Consistent Multi-Label Classification Under Inexact Supervision
Wei Wang
Tianhao Ma
Ming-Kun Xie
Gang Niu
Masashi Sugiyama
197
1
0
05 Oct 2025
Class-wise Flooding Regularization for Imbalanced Image Classification
Hiroaki Aizawa
Yuta Naito
Kohei Fukuda
AI4CE
103
0
0
26 Aug 2025
Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models
Tianxiao Cao
Kyohei Atarashi
H. Kashima
268
0
0
14 Aug 2025
Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal
Yang Wang
Chenghao Xiao
Yi Zhou
Stuart E. Middleton
Noura Al Moubayed
C. D. Lin
AAML
356
1
0
29 Jul 2025
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks
International Conference on Computational Linguistics (COLING), 2025
Yang Wang
Chenghua Lin
ELM
451
4
0
05 Jan 2025
Counter-Current Learning: A Biologically Plausible Dual Network Approach for Deep Learning
Neural Information Processing Systems (NeurIPS), 2024
Chia-Hsiang Kao
Bharath Hariharan
360
3
0
30 Sep 2024
Making Robust Generalizers Less Rigid with Loss Concentration
Matthew J. Holland
Toma Hamada
OOD
402
0
0
07 Aug 2024
ClassiFIM: An Unsupervised Method To Detect Phase Transitions
Victor Kasatkin
E. Mozgunov
Nicholas Ezzell
Utkarsh Mishra
Itay Hen
Daniel Lidar
192
3
0
06 Aug 2024
Risks, Causes, and Mitigations of Widespread Deployments of Large Language Models (LLMs): A Survey
Md. Nazmus Sakib
Md Athikul Islam
Royal Pathak
Md Mashrur Arifin
ALM
PILM
319
13
0
01 Aug 2024
Decoupling the Class Label and the Target Concept in Machine Unlearning
Jianing Zhu
Bo Han
Jiangchao Yao
Jianliang Xu
Gang Niu
Masashi Sugiyama
CLL
MU
207
9
0
12 Jun 2024
Are AI-Generated Text Detectors Robust to Adversarial Perturbations?
Guanhua Huang
Yuchen Zhang
Zhe Li
Yongjian You
Mingze Wang
Zhouwang Yang
DeLMO
533
23
0
03 Jun 2024
GenFighter: A Generative and Evolutive Textual Attack Removal
Md Athikul Islam
Edoardo Serra
Sushil Jajodia
AAML
203
0
0
17 Apr 2024
Layer-wise Regularized Dropout for Neural Language Models
Shiwen Ni
Min Yang
Ruifeng Xu
Chengming Li
Xiping Hu
144
1
0
26 Feb 2024
Criterion Collapse and Loss Distribution Control
Matthew J. Holland
295
2
0
15 Feb 2024
Coupled Confusion Correction: Learning from Crowds with Sparse Annotations
AAAI Conference on Artificial Intelligence (AAAI), 2023
Hansong Zhang
Shikun Li
Dan Zeng
Chenggang Yan
Shiming Ge
379
23
0
12 Dec 2023
A Generalizable Deep Learning System for Cardiac MRI
R. Shad
C. Zakka
Dhamanpreet Kaur
R. Fong
R. Filice
...
Michael A. Acker
Curt P. Langlotz
W. Hiesinger
Curtis Langlotz
William Hiesinger
MedIm
253
3
0
01 Dec 2023
On the Hyperparameter Loss Landscapes of Machine Learning Models: An Exploratory Study
Knowledge Discovery and Data Mining (KDD), 2023
Mingyu Huang
Ke Li
303
5
0
23 Nov 2023
AdaFlood: Adaptive Flood Regularization
Wonho Bae
Yi Ren
Mohamad Osama Ahmed
Frederick Tung
Danica J. Sutherland
Gabriel L. Oliveira
AI4CE
268
3
0
06 Nov 2023
Flooding Regularization for Stable Training of Generative Adversarial Networks
Iu Yahiro
Takashi Ishida
Xiangwei Zhu
GAN
AI4CE
328
0
0
01 Nov 2023
NECO: NEural Collapse Based Out-of-distribution detection
International Conference on Learning Representations (ICLR), 2023
Mouin Ben Ammar
Nacim Belkhir
Sebastian Popescu
Antoine Manzanera
Gianni Franchi
OODD
381
37
0
10 Oct 2023
Quality-Agnostic Deepfake Detection with Intra-model Collaborative Learning
IEEE International Conference on Computer Vision (ICCV), 2023
B. Le
Simon S. Woo
AAML
281
46
0
12 Sep 2023
Label Noise: Correcting a Correction
William Toner
Amos Storkey
NoLa
270
1
0
24 Jul 2023
DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Songyang Gao
Jiajun Sun
Yan Liu
Xiao Wang
Qi Zhang
Zhongyu Wei
Jin Ma
Yingchun Shan
OOD
208
9
0
27 Jun 2023
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability
International Conference on Machine Learning (ICML), 2023
Jianing Zhu
Hengzhuang Li
Jiangchao Yao
Tongliang Liu
Jianliang Xu
Bo Han
OODD
233
20
0
06 Jun 2023
HUB: Guiding Learned Optimizers with Continuous Prompt Tuning
Gaole Dai
Wei Wu
Ziyu Wang
Jie Fu
Shanghang Zhang
Tiejun Huang
AIFin
244
0
0
26 May 2023
Temporal Aware Mixed Attention-based Convolution and Transformer Network (MACTN) for EEG Emotion Recognition
Xiaopeng Si
Dong Huang
Yulin Sun
Dong Ming
214
7
0
18 May 2023
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation
Rongzhi Zhang
Jiaming Shen
Tianqi Liu
Jia-Ling Liu
Michael Bendersky
Marc Najork
Chao Zhang
338
34
0
08 May 2023
Exploring the Effect of Multi-step Ascent in Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Woojin Lee
Jaewook Lee
206
11
0
27 Jan 2023
A Stability Analysis of Fine-Tuning a Pre-Trained Model
Z. Fu
Anthony Man-Cho So
Nigel Collier
224
4
0
24 Jan 2023
β
β
β
-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Peng Ye
Tong He
Baopu Li
Tao Chen
Mengwei He
Wanli Ouyang
OOD
251
9
0
16 Jan 2023
Stability Analysis of Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Jaewook Lee
288
17
0
16 Jan 2023
Efficient Generalization Improvement Guided by Random Weight Perturbation
Tao Li
Wei Yan
Zehao Lei
Yingwen Wu
Kun Fang
Ming-Hsuan Yang
Xiaolin Huang
AAML
191
8
0
21 Nov 2022
Noisy Pairing and Partial Supervision for Opinion Summarization
International Conference on Natural Language Generation (INLG), 2022
Hayate Iso
Xiaolan Wang
Yoshihiko Suhara
AI4TS
186
0
0
16 Nov 2022
One-Class Risk Estimation for One-Class Hyperspectral Image Classification
IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Hengwei Zhao
Yanfei Zhong
Xinyu Wang
H. Shu
164
12
0
27 Oct 2022
WaveBound: Dynamic Error Bounds for Stable Time Series Forecasting
Neural Information Processing Systems (NeurIPS), 2022
Youngin Cho
Daejin Kim
Dongmin Kim
Mohammad Azam Khan
Jaegul Choo
AI4TS
265
3
0
25 Oct 2022
Stable and Efficient Adversarial Training through Local Linearization
Zhuorong Li
Daiwei Yu
AAML
131
0
0
11 Oct 2022
Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias
International Conference on Machine Learning (ICML), 2022
Ryo Karakida
Tomoumi Takase
Tomohiro Hayase
Kazuki Osawa
166
21
0
06 Oct 2022
Pre-training General Trajectory Embeddings with Maximum Multi-view Entropy Coding
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Yan Lin
Huaiyu Wan
Shengnan Guo
Jilin Hu
Christian S. Jensen
Youfang Lin
AI4TS
216
34
0
29 Jul 2022
Multi-class Classification from Multiple Unlabeled Datasets with Partial Risk Regularization
Asian Conference on Machine Learning (ACML), 2022
Yuting Tang
Nan Lu
Tianyi Zhang
Masashi Sugiyama
230
5
0
04 Jul 2022
Fairness via In-Processing in the Over-parameterized Regime: A Cautionary Tale
A. Veldanda
Shubham Sharma
Jiahao Chen
Sanghamitra Dutta
Alan Mishler
S. Garg
206
7
0
29 Jun 2022
Improving robustness of language models from a geometry-aware perspective
Findings (Findings), 2022
Bin Zhu
Zhaoquan Gu
Le Wang
Jinyin Chen
Qi Xuan
AAML
196
10
0
28 Apr 2022
Improving Camouflaged Object Detection with the Uncertainty of Pseudo-edge Labels
ACM Multimedia Asia (MA), 2021
Nobukatsu Kajiura
Hong Liu
Shiníchi Satoh
181
32
0
29 Oct 2021
Anti-Backdoor Learning: Training Clean Models on Poisoned Data
Neural Information Processing Systems (NeurIPS), 2021
Yige Li
X. Lyu
Nodens Koren
Lingjuan Lyu
Yue Liu
Jiabo He
OnRL
368
430
0
22 Oct 2021
On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications
Ziqiao Wang
Yongyi Mao
FedML
MLT
331
32
0
07 Oct 2021
MutualGraphNet: A novel model for motor imagery classification
Yan Li
Ning Zhong
D. Taniar
Haolan Zhang
127
8
0
02 Sep 2021
HAT4RD: Hierarchical Adversarial Training for Rumor Detection on Social Media
Italian National Conference on Sensors (INS), 2021
Shiwen Ni
Jiawen Li
Hung-Yu kao
232
8
0
29 Aug 2021
DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks
Shiwen Ni
Jiawen Li
Hung-Yu kao
AAML
158
4
0
29 Aug 2021
1
2
Next
Page 1 of 2