Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.09412
Cited By
mixup: Beyond Empirical Risk Minimization
25 October 2017
Hongyi Zhang
Moustapha Cissé
Yann N. Dauphin
David Lopez-Paz
NoLa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"mixup: Beyond Empirical Risk Minimization"
50 / 4,964 papers shown
Title
MoH: Multi-Head Attention as Mixture-of-Head Attention
Peng Jin
Bo Zhu
Li Yuan
Shuicheng Yan
MoE
31
13
0
15 Oct 2024
Dual-Teacher Ensemble Models with Double-Copy-Paste for 3D Semi-Supervised Medical Image Segmentation
Zhan Fa
Shumeng Li
Jian Zhang
Lei Qi
Qian Yu
Yinghuan Shi
28
0
0
15 Oct 2024
FedCCRL: Federated Domain Generalization with Cross-Client Representation Learning
Xinpeng Wang
Xiaoying Tang
Xiaoying Tang
FedML
28
2
0
15 Oct 2024
RICASSO: Reinforced Imbalance Learning with Class-Aware Self-Supervised Outliers Exposure
X. Zhang
Sin Chee Chin
Tingxuan Gao
Wenming Yang
33
0
0
14 Oct 2024
V2M: Visual 2-Dimensional Mamba for Image Representation Learning
Chengkun Wang
Wenzhao Zheng
Yuanhui Huang
Jie Zhou
Jiwen Lu
Mamba
30
0
0
14 Oct 2024
Locality Alignment Improves Vision-Language Models
Ian Covert
Tony Sun
James Y. Zou
Tatsunori Hashimoto
VLM
67
3
0
14 Oct 2024
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
Tianshi Xu
Shuzhang Zhong
Wenxuan Zeng
Runsheng Wang
Meng Li
MQ
29
0
0
12 Oct 2024
ALVIN: Active Learning Via INterpolation
Michalis Korakakis
Andreas Vlachos
Adrian Weller
28
0
0
11 Oct 2024
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
Nguyen Huu Bao Long
Chenyu Zhang
Yuzhi Shi
Tsubasa Hirakawa
Takayoshi Yamashita
Tohgoroh Matsui
H. Fujiyoshi
31
2
0
11 Oct 2024
The Effects of Hallucinations in Synthetic Training Data for Relation Extraction
Steven Rogulsky
Nicholas Popovic
Michael Färber
HILM
30
1
0
10 Oct 2024
AHA: Human-Assisted Out-of-Distribution Generalization and Detection
Haoyue Bai
Jifan Zhang
Robert Nowak
30
6
0
10 Oct 2024
BA-Net: Bridge Attention in Deep Neural Networks
Ronghui Zhang
Runzong Zou
Yue Zhao
Zirui Zhang
Junzhou Chen
Yue Cao
Chuan Hu
Houbing Song
33
0
0
10 Oct 2024
Understanding Adversarially Robust Generalization via Weight-Curvature Index
Yuelin Xu
Xiao Zhang
AAML
29
0
0
10 Oct 2024
3D Vision-Language Gaussian Splatting
Qucheng Peng
Benjamin Planche
Zhongpai Gao
Meng Zheng
Anwesa Choudhuri
Terrence Chen
C. L. P. Chen
Ziyan Wu
3DGS
39
4
0
10 Oct 2024
Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series OOD Generalization
Chengtao Jian
Kai Yang
Yang Jiao
AI4TS
29
3
0
09 Oct 2024
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis
Ahmed Abdullah
Nikolas Ebert
Oliver Wasenmüller
ObjD
30
1
0
09 Oct 2024
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
Fei Xie
Weijia Zhang
Zhongdao Wang
Chao Ma
Mamba
24
3
0
09 Oct 2024
MatMamba: A Matryoshka State Space Model
Abhinav Shukla
Sai H. Vemprala
Aditya Kusupati
Ashish Kapoor
Mamba
28
0
0
09 Oct 2024
MaskBlur: Spatial and Angular Data Augmentation for Light Field Image Super-Resolution
Wentao Chao
Fuqing Duan
Yulan Guo
Guanghui Wang
32
1
0
09 Oct 2024
JPEG Inspired Deep Learning
Ahmed H. Salamah
Kaixiang Zheng
Yiwen Liu
E. Yang
27
0
0
09 Oct 2024
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Siyuan Li
Juanxi Tian
Zedong Wang
Luyuan Zhang
Zicheng Liu
Weiyang Jin
Yang Liu
Baigui Sun
Stan Z. Li
34
0
0
08 Oct 2024
Robust Domain Generalisation with Causal Invariant Bayesian Neural Networks
Gael Gendron
Michael Witbrock
Gillian Dobbie
CML
BDL
OOD
36
0
0
08 Oct 2024
Harnessing the Power of Noise: A Survey of Techniques and Applications
Reyhaneh Abdolazimi
Shengmin Jin
Pramod K. Varshney
Reza Zafarani
23
0
0
08 Oct 2024
Generalizing to any diverse distribution: uniformity, gentle finetuning and rebalancing
Andreas Loukas
Karolis Martinkus
Ed Wagstaff
Kyunghyun Cho
OOD
28
1
0
08 Oct 2024
Fill In The Gaps: Model Calibration and Generalization with Synthetic Data
Yang Ba
M. Mancenido
Rong Pan
SyDa
21
1
0
07 Oct 2024
Residual Kolmogorov-Arnold Network for Enhanced Deep Learning
Ray Congrui Yu
Sherry Wu
Jiang Gui
44
1
0
07 Oct 2024
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Yuxin Xiao
Shujian Zhang
Wenxuan Zhou
Marzyeh Ghassemi
Sanqiang Zhao
100
0
0
07 Oct 2024
Impact of Regularization on Calibration and Robustness: from the Representation Space Perspective
Jonghyun Park
Juyeop Kim
Jong-Seok Lee
23
1
0
05 Oct 2024
Training Over a Distribution of Hyperparameters for Enhanced Performance and Adaptability on Imbalanced Classification
Kelsey Lieberman
Swarna Kamlam Ravindran
Shuai Yuan
Carlo Tomasi
OOD
35
0
0
04 Oct 2024
Classification-Denoising Networks
Louis Thiry
Florentin Guth
34
0
0
04 Oct 2024
Generalizable Prompt Tuning for Vision-Language Models
Qian Zhang
VLM
VPVLM
50
0
0
04 Oct 2024
SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation
Mucong Ding
Bang An
Yuancheng Xu
Anirudh Satheesh
Furong Huang
24
1
0
03 Oct 2024
Dynamic Sparse Training versus Dense Training: The Unexpected Winner in Image Corruption Robustness
Boqian Wu
Q. Xiao
Shunxin Wang
N. Strisciuglio
Mykola Pechenizkiy
M. V. Keulen
D. Mocanu
Elena Mocanu
OOD
3DH
52
0
0
03 Oct 2024
MONICA: Benchmarking on Long-tailed Medical Image Classification
Lie Ju
Siyuan Yan
Yukun Zhou
Yang Nan
Xiaodan Xing
Peibo Duan
Zongyuan Ge
57
0
0
02 Oct 2024
TAEGAN: Generating Synthetic Tabular Data For Data Augmentation
Jiayu Li
Zilong Zhao
Kevin Yee
Uzair Javaid
Biplab Sikdar
LMTD
37
1
0
02 Oct 2024
Data Extrapolation for Text-to-image Generation on Small Datasets
Senmao Ye
Fei Liu
33
0
0
02 Oct 2024
Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks
Edan Kinderman
Itay Hubara
Haggai Maron
Daniel Soudry
MoMe
47
0
0
02 Oct 2024
DyMix: Dynamic Frequency Mixup Scheduler based Unsupervised Domain Adaptation for Enhancing Alzheimer's Disease Identification
Yooseung Shin
Kwanseok Oh
Heung-Il Suk
32
0
0
02 Oct 2024
ProxiMix: Enhancing Fairness with Proximity Samples in Subgroups
Jingyu Hu
Jun Hong
Mengnan Du
Weiru Liu
26
0
0
02 Oct 2024
Denoising with a Joint-Embedding Predictive Architecture
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
52
2
0
02 Oct 2024
WiGNet: Windowed Vision Graph Neural Network
Gabriele Spadaro
Marco Grangetto
A. Fiandrotti
Enzo Tartaglione
Jhony H. Giraldo
18
0
0
01 Oct 2024
Exploring Empty Spaces: Human-in-the-Loop Data Augmentation
Catherine Yeh
Donghao Ren
Yannick Assogba
Dominik Moritz
Fred Hohman
36
0
0
01 Oct 2024
Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation
Vlad-Cristian Matei
Iulian-Marius Taiatu
Razvan-Alexandru Smadu
Dumitru-Clementin Cercel
19
1
0
30 Sep 2024
Characterizing Model Robustness via Natural Input Gradients
Adrian Rodriguez-Munoz
Tongzhou Wang
Antonio Torralba
AAML
38
1
0
30 Sep 2024
Crafting Distribution Shifts for Validation and Training in Single Source Domain Generalization
Nikos Efthymiadis
Giorgos Tolias
Ondřej Chum
OOD
20
0
0
29 Sep 2024
DropEdge not Foolproof: Effective Augmentation Method for Signed Graph Neural Networks
Zeyu Zhang
Lu Li
Shuyan Wan
Sijie Wang
Zhiyi Wang
Zhiyuan Lu
Dong Hao
Wanli Li
33
2
0
29 Sep 2024
UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception
Chuang Chen
X. Sun
Zhi Liu
31
0
0
27 Sep 2024
Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation
Chaomin Shen
Yaomin Huang
Haokun Zhu
Jinsong Fan
Guixu Zhang
21
0
0
27 Sep 2024
Bridging OOD Detection and Generalization: A Graph-Theoretic View
Han Wang
Yixuan Li
CML
34
0
0
26 Sep 2024
ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning
Song Wang
Zhongdao Wang
Jiawei Yu
Wentong Li
Bailan Feng
Junbo Chen
Jianke Zhu
UQCV
31
3
0
26 Sep 2024
Previous
1
2
3
...
6
7
8
...
98
99
100
Next