Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.02629
Cited By
When Does Label Smoothing Help?
6 June 2019
Rafael Müller
Simon Kornblith
Geoffrey E. Hinton
UQCV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"When Does Label Smoothing Help?"
50 / 282 papers shown
Title
Rethinking Label Smoothing on Multi-hop Question Answering
Zhangyue Yin
Yuxin Wang
Xiannian Hu
Yiguang Wu
Hang Yan
Xinyu Zhang
Zhao Cao
Xuanjing Huang
Xipeng Qiu
19
9
0
19 Dec 2022
Inductive Attention for Video Action Anticipation
Tsung-Ming Tai
G. Fiameni
Cheng-Kuang Lee
Simon See
O. Lanz
31
1
0
17 Dec 2022
Context Label Learning: Improving Background Class Representations in Semantic Segmentation
Zeju Li
Konstantinos Kamnitsas
C. Ouyang
Chen Chen
Ben Glocker
VLM
25
6
0
16 Dec 2022
Improving group robustness under noisy labels using predictive uncertainty
Dongpin Oh
Dae Lee
Jeunghyun Byun
Bonggun Shin
UQCV
18
3
0
14 Dec 2022
Universe Points Representation Learning for Partial Multi-Graph Matching
Zhakshylyk Nurlanov
Frank R. Schmidt
Florian Bernard
26
5
0
01 Dec 2022
FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
Yulei Qin
Xingyu Chen
Chao Chen
Yunhang Shen
Bohan Ren
Yun Gu
Jie-jin Yang
Chunhua Shen
36
4
0
01 Dec 2022
LUMix: Improving Mixup by Better Modelling Label Uncertainty
Shuyang Sun
Jieneng Chen
Ruifei He
Alan Yuille
Philip H. S. Torr
Song Bai
UQCV
NoLa
13
5
0
29 Nov 2022
Progressive Learning without Forgetting
Tao Feng
Hangjie Yuan
Mang Wang
Ziyuan Huang
Ang Bian
Jianzhou Zhang
CLL
KELM
42
4
0
28 Nov 2022
Uncertainty-aware Vision-based Metric Cross-view Geolocalization
F. Fervers
Sebastian Bullinger
C. Bodensteiner
Michael Arens
Rainer Stiefelhagen
22
39
0
22 Nov 2022
Compiler Provenance Recovery for Multi-CPU Architectures Using a Centrifuge Mechanism
Yuhei Otsubo
Akira Otsuka
M. Mimura
21
3
0
22 Nov 2022
AdaFocal: Calibration-aware Adaptive Focal Loss
Arindam Ghosh
Thomas Schaaf
Matthew R. Gormley
FedML
UQCV
21
25
0
21 Nov 2022
ParCNetV2: Oversized Kernel with Enhanced Attention
Ruihan Xu
Haokui Zhang
Wenze Hu
Shiliang Zhang
Xiaoyu Wang
ViT
25
6
0
14 Nov 2022
Estimating Soft Labels for Out-of-Domain Intent Detection
Hao Lang
Yinhe Zheng
Jian Sun
Feiling Huang
Luo Si
Yongbin Li
28
15
0
10 Nov 2022
Soft Augmentation for Image Classification
Yang Liu
Shen Yan
Laura Leal-Taixé
James Hays
Deva Ramanan
29
11
0
09 Nov 2022
Harmonizing the object recognition strategies of deep neural networks with humans
Thomas Fel
Ivan Felipe
Drew Linsley
Thomas Serre
30
71
0
08 Nov 2022
Respecting Transfer Gap in Knowledge Distillation
Yulei Niu
Long Chen
Chan Zhou
Hanwang Zhang
21
23
0
23 Oct 2022
Salience Allocation as Guidance for Abstractive Summarization
Fei Wang
Kaiqiang Song
Hongming Zhang
Lifeng Jin
Sangwoo Cho
Wenlin Yao
Xiaoyang Wang
Muhao Chen
Dong Yu
55
31
0
22 Oct 2022
A Continuum of Generation Tasks for Investigating Length Bias and Degenerate Repetition
Darcey Riley
David Chiang
19
5
0
19 Oct 2022
Rethinking Sharpness-Aware Minimization as Variational Inference
Szilvia Ujváry
Zsigmond Telek
A. Kerekes
Anna Mészáros
Ferenc Huszár
25
8
0
19 Oct 2022
Bidirectional Semi-supervised Dual-branch CNN for Robust 3D Reconstruction of Stereo Endoscopic Images via Adaptive Cross and Parallel Supervisions
Hongkuan Shi
Zhiwei Wang
Ying Zhou
Dun Li
Xin Yang
Qiang Li
3DV
22
7
0
15 Oct 2022
Robust Models are less Over-Confident
Julia Grabinski
Paul Gavrikov
J. Keuper
M. Keuper
AAML
28
24
0
12 Oct 2022
Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Yuxin Xiao
Paul Pu Liang
Umang Bhatt
W. Neiswanger
Ruslan Salakhutdinov
Louis-Philippe Morency
175
86
0
10 Oct 2022
Are All Losses Created Equal: A Neural Collapse Perspective
Jinxin Zhou
Chong You
Xiao Li
Kangning Liu
Sheng Liu
Qing Qu
Zhihui Zhu
25
58
0
04 Oct 2022
Using Knowledge Distillation to improve interpretable models in a retail banking context
Maxime Biehler
Mohamed Guermazi
Célim Starck
49
2
0
30 Sep 2022
Neural Clamping: Joint Input Perturbation and Temperature Scaling for Neural Network Calibration
Yu Tang
Pin-Yu Chen
Tsung-Yi Ho
18
5
0
23 Sep 2022
Relaxed Attention for Transformer Models
Timo Lohrenz
Björn Möller
Zhengyang Li
Tim Fingscheidt
KELM
24
11
0
20 Sep 2022
Learning Symbolic Model-Agnostic Loss Functions via Meta-Learning
Christian Raymond
Qi Chen
Bing Xue
Mengjie Zhang
FedML
27
11
0
19 Sep 2022
Weakly Supervised Medical Image Segmentation With Soft Labels and Noise Robust Loss
B. Felfeliyan
A. Hareendranathan
G. Kuntze
S. Wichuk
Nils D. Forkert
Jacob L. Jaremko
J. Ronsky
NoLa
31
2
0
16 Sep 2022
Towards Improving Calibration in Object Detection Under Domain Shift
Muhammad Akhtar Munir
M. H. Khan
M. Sarfraz
Mohsen Ali
16
22
0
15 Sep 2022
Membership-Doctor: Comprehensive Assessment of Membership Inference Against Machine Learning Models
Xinlei He
Zheng Li
Weilin Xu
Cory Cornelius
Yang Zhang
MIACV
21
24
0
22 Aug 2022
Rethinking Knowledge Distillation via Cross-Entropy
Zhendong Yang
Zhe Li
Yuan Gong
Tianke Zhang
Shanshan Lao
Chun Yuan
Yu Li
25
14
0
22 Aug 2022
Effectiveness of Function Matching in Driving Scene Recognition
Shingo Yashima
16
1
0
20 Aug 2022
Multi-View Correlation Consistency for Semi-Supervised Semantic Segmentation
Yunzhong Hou
Stephen Gould
Liang Zheng
25
0
0
17 Aug 2022
Reduced Implication-bias Logic Loss for Neuro-Symbolic Learning
Haoyuan He
Wang-Zhou Dai
Ming Li
AI4CE
29
2
0
14 Aug 2022
Two-Pass Low Latency End-to-End Spoken Language Understanding
Siddhant Arora
Siddharth Dalmia
Xuankai Chang
Brian Yan
A. Black
Shinji Watanabe
VLM
22
19
0
14 Jul 2022
Rich Feature Distillation with Feature Affinity Module for Efficient Image Dehazing
S. J.
Anushri Suresh
Nisha J.S.
V. Gopi
VLM
21
6
0
13 Jul 2022
Masked Autoencoders that Listen
Po-Yao (Bernie) Huang
Hu Xu
Juncheng Billy Li
Alexei Baevski
Michael Auli
Wojciech Galuba
Florian Metze
Christoph Feichtenhofer
13
268
0
13 Jul 2022
Sample-dependent Adaptive Temperature Scaling for Improved Calibration
Thomas Joy
Francesco Pinto
Ser-Nam Lim
Philip H. S. Torr
P. Dokania
UQCV
19
30
0
13 Jul 2022
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding
Yifan Peng
Siddharth Dalmia
Ian Lane
Shinji Watanabe
19
142
0
06 Jul 2022
PrUE: Distilling Knowledge from Sparse Teacher Networks
Shaopu Wang
Xiaojun Chen
Mengzhen Kou
Jinqiao Shi
8
2
0
03 Jul 2022
Eliciting and Learning with Soft Labels from Every Annotator
K. M. Collins
Umang Bhatt
Adrian Weller
11
44
0
02 Jul 2022
ProSelfLC: Progressive Self Label Correction Towards A Low-Temperature Entropy State
Xinshao Wang
Yang Hua
Elyor Kodirov
S. Mukherjee
David A. Clifton
N. Robertson
15
6
0
30 Jun 2022
Revisiting Label Smoothing and Knowledge Distillation Compatibility: What was Missing?
Keshigeyan Chandrasegaran
Ngoc-Trung Tran
Yunqing Zhao
Ngai-man Cheung
80
41
0
29 Jun 2022
RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness
Francesco Pinto
Harry Yang
Ser-Nam Lim
Philip H. S. Torr
P. Dokania
UQCV
27
34
0
29 Jun 2022
Knowledge Distillation of Transformer-based Language Models Revisited
Chengqiang Lu
Jianwei Zhang
Yunfei Chu
Zhengyu Chen
Jingren Zhou
Fei Wu
Haiqing Chen
Hongxia Yang
VLM
25
10
0
29 Jun 2022
Robust SAR ATR on MSTAR with Deep Learning Models trained on Full Synthetic MOCEM data
Benjamin Camus
C. Barbu
Eric Monteux
9
4
0
15 Jun 2022
Confidence Score for Source-Free Unsupervised Domain Adaptation
Jonghyun Lee
Dahuin Jung
Junho Yim
Sung-Hoon Yoon
TTA
13
70
0
14 Jun 2022
NeuGuard: Lightweight Neuron-Guided Defense against Membership Inference Attacks
Nuo Xu
Binghui Wang
Ran Ran
Wujie Wen
Parv Venkitasubramaniam
AAML
13
5
0
11 Jun 2022
On Calibration of Graph Neural Networks for Node Classification
Tong Liu
Yushan Liu
Marcel Hildebrandt
Mitchell Joblin
Hang Li
Volker Tresp
22
11
0
03 Jun 2022
DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks
Y. Fu
Haichuan Yang
Jiayi Yuan
Meng Li
Cheng Wan
Raghuraman Krishnamoorthi
Vikas Chandra
Yingyan Lin
28
18
0
02 Jun 2022
Previous
1
2
3
4
5
6
Next