v1v2v3 (latest)

Decoupled Weight Decay Regularization

14 November 2017

I. Loshchilov

Katharina Eggensperger

OffRL

ArXiv (abs)PDF HTML Github (275★)

Papers citing "Decoupled Weight Decay Regularization"

50 / 1,216 papers shown

Randomized Geometric Algebra Methods for Convex Neural Networks

329

04 Jun 2024

Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check

214

04 Jun 2024

FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation

Kun Chen

Tao Chen

Peng Ye

Wanli Ouyang

201

03 Jun 2024

Estimating Canopy Height at Scale

293

03 Jun 2024

Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging

Antonios Deligiannakis

FedML

444

31 May 2024

Improving code-mixed hate detection by native sample mixing: A case study for Hindi-English code-mixed scenario

Debajyoti Mazumder

Aakash Kumar

Jasabanta Patro

166

31 May 2024

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Bo Wang

...

300

30 May 2024

Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection

283

30 May 2024

MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning

267

29 May 2024

Multi-objective Cross-task Learning via Goal-conditioned GPT-based Decision Transformers for Surgical Robot Task Automation

301

29 May 2024

VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers

Xiaodan Liang

287

28 May 2024

AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario

198

28 May 2024

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

Keming Lu

Bowen Yu

Fei Huang

Yang Fan

Runji Lin

Chang Zhou

MoMe

205

28 May 2024

Language-Driven Interactive Traffic Trajectory Generation

Siheng Chen

312

24 May 2024

Distill-then-prune: An Efficient Compression Framework for Real-time Stereo Matching Network on Edge Devices

191

20 May 2024

Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network

Min Hun Lee

AI4TS ViT FAtt

221

18 May 2024

DINO as a von Mises-Fisher mixture modelInternational Conference on Learning Representations (ICLR), 2024

Hariprasath Govindarajan

Per Sidén

Jacob Roll

Fredrik Lindsten

264

17 May 2024

FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language modelsComputer Vision and Pattern Recognition (CVPR), 2024

Adrian Bulat

Yassine Ouali

Georgios Tzimiropoulos

VLM

262

16 May 2024

NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge

Radu Timofte

Lei Zhang

292

16 May 2024

Desk-AId: Humanitarian Aid Desk Assessment with Geospatial AI for Predicting Landmine Areas

164

15 May 2024

Using Machine Translation to Augment Multilingual ClassificationEuropean Association for Machine Translation Conferences/Workshops (EAMT), 2024

Adam King

207

09 May 2024

BenthicNet: A global compilation of seafloor images for deep learning applications

Joakim Bruslund Haurum

...

346

08 May 2024

Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking

Emre Can Acikgoz

Mete Erdogan

Deniz Yuret

217

07 May 2024

Topicwise Separable Sentence Retrieval for Medical Report Generation

227

07 May 2024

Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification ReframingAAAI Conference on Artificial Intelligence (AAAI), 2024

144

06 May 2024

AB-Training: A Communication-Efficient Approach for Distributed Low-Rank Learning

348

02 May 2024

RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation

Chanwoo Park

Mingyang Liu

Dingwen Kong

Kaiqing Zhang

Asuman Ozdaglar

421

30 Apr 2024

Performance-Aligned LLMs for Generating Fast Code

217

29 Apr 2024

Event-based Video Frame Interpolation with Edge Guided Motion Refinement

Hao Chen

248

28 Apr 2024

Improving Smart Contract Security with Contrastive Learning-based Vulnerability Detection

181

27 Apr 2024

3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior

249

25 Apr 2024

Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud

570

25 Apr 2024

Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP) for Enhanced Breast Cancer Diagnosis with Multi-view Mammography

Richard L. J. Qiu

213

24 Apr 2024

Real-Time Compressed Sensing for Joint Hyperspectral Image Transmission and Restoration for CubeSat

Chih-Chung Hsu

Chih-Yu Jian

Eng-Shen Tu

Chia-Ming Lee

Guan-Lin Chen

133

24 Apr 2024

Better Synthetic Data by Retrieving and Transforming Existing Datasets

Graham Neubig

401

22 Apr 2024

MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets

Jiaheng Liu

Junran Peng

294

22 Apr 2024

360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos

326

22 Apr 2024

Find The Gap: Knowledge Base Reasoning For Visual Question Answering

Elham J. Barezi

Parisa Kordjamshidi

215

16 Apr 2024

Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering

Zaid Khan

Yun Fu

AAML

252

16 Apr 2024

3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow

185

15 Apr 2024

Magic Clothing: Controllable Garment-Driven Image Synthesis

203

15 Apr 2024

GeMQuAD : Generating Multilingual Question Answering Datasets from Large Language Models using Few Shot Learning

207

14 Apr 2024

ToNER: Type-oriented Named Entity Recognition with Generative Language Model

Guochao Jiang

184

14 Apr 2024

From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language Representation

319

14 Apr 2024

Rethinking Iterative Stereo Matching from Diffusion Bridge Model Perspective

Yuguang Shi

DiffM

239

13 Apr 2024

OPSD: an Offensive Persian Social media Dataset and its baseline evaluations

Amir Hossein Mansouri

Mohammad Bisheh-Niasar

Zahra Pourbahman

101

08 Apr 2024

Progressive Alignment with VLM-LLM Feature to Augment Defect Classification for the ASE Dataset

Chih-Chung Hsu

Chia-Ming Lee

Chun-Hung Sun

Kuang-Ming Wu

157

08 Apr 2024

PairAug: What Can Augmented Image-Text Pairs Do for Radiology?

Qi Chen

Qi Wu

194

07 Apr 2024

PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian TweetsInternational Conference on Language Resources and Evaluation (LREC), 2024

Arianna Muti

Alberto Barrón-Cedeño

107

03 Apr 2024

Adaptive Cross-lingual Text Classification through In-Context One-Shot DemonstrationsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Emilio Villa-Cueva

A. P. López-Monroy

Fernando Sánchez-Vega

Thamar Solorio

VLM

196

03 Apr 2024