v1v2v3 (latest)

Decoupled Weight Decay Regularization

14 November 2017

I. Loshchilov

Katharina Eggensperger

OffRL

ArXiv (abs)PDF HTML Github (275★)

Papers citing "Decoupled Weight Decay Regularization"

50 / 1,216 papers shown

ChainNet: Structured Metaphor and Metonymy in WordNet

125

29 Mar 2024

Noise-Robust Keyword Spotting through Self-supervised Pretraining

220

27 Mar 2024

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

181

27 Mar 2024

BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models

237

27 Mar 2024

MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD MappingEuropean Conference on Computer Vision (ECCV), 2024

272

23 Mar 2024

M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling

...

393

20 Mar 2024

CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation

Wenqi Zhu

Jiale Cao

Jin Xie

Shuangming Yang

Yanwei Pang

VLM CLIP

292

19 Mar 2024

FinLlama: Financial Sentiment Classification for Algorithmic Trading Applications

Thanos Konstantinidis

183

18 Mar 2024

A Versatile Framework for Multi-scene Person Re-identification

327

17 Mar 2024

Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge ExtractionInternational Conference on Language Resources and Evaluation (LREC), 2024

Liang Ding

236

15 Mar 2024

Single Domain Generalization for Crowd CountingComputer Vision and Pattern Recognition (CVPR), 2024

Zhuoxuan Peng

S.-H. Gary Chan

262

14 Mar 2024

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF PriorsEuropean Conference on Computer Vision (ECCV), 2024

Yue Wang

Hang Zhao

283

14 Mar 2024

Identity-aware Dual-constraint Network for Cloth-Changing Person Re-identification

422

13 Mar 2024

Fine-tuning Large Language Models with Sequential InstructionsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

413

12 Mar 2024

generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation

Mennatallah El-Assady

275

12 Mar 2024

CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization PerspectiveComputer Vision and Pattern Recognition (CVPR), 2024

Shunsuke Yasuki

Masato Taki

AAML

320

11 Mar 2024

Probabilistic Neural CircuitsAAAI Conference on Artificial Intelligence (AAAI), 2024

Pedro Zuidberg Dos Martires

TPM

159

10 Mar 2024

Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation

Paweł Antoni Pierzchlewicz

Caio da Silva

R. J. Cotton

Fabian H. Sinz

237

10 Mar 2024

Calibrating Large Language Models Using Their Generations OnlyAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

782

09 Mar 2024

XPSR: Cross-modal Priors for Diffusion-based Image Super-ResolutionEuropean Conference on Computer Vision (ECCV), 2024

262

08 Mar 2024

The Blind Normalized Stein Variational Gradient Descent-Based Detection for Intelligent Random Access in Cellular IoTIEEE Internet of Things Journal (IEEE IoT J.), 2024

Xin Zhu

Ahmet Enis Cetin

204

08 Mar 2024

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

...

2.4K

2,755

05 Mar 2024

SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection

347

05 Mar 2024

RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models

Saeed Najafi

Alona Fyshe

254

04 Mar 2024

OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

306

132

04 Mar 2024

NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions

Marta Andronic

George A. Constantinides

228

29 Feb 2024

PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds

Guang Chen

207

29 Feb 2024

Exploring Data-Efficient Adaptation of Large Language Models for Code Generation

363

29 Feb 2024

ConvDTW-ACS: Audio Segmentation for Track Type Detection During Car Manufacturing

Friedrich Wolf-Monheim

Sam Michiels

Danny Hughes

183

28 Feb 2024

Where Do We Go from Here? Multi-scale Allocentric Relational Inference from Natural Spatial Descriptions

158

26 Feb 2024

PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization

Zhe Yang

Peiyi Wang

Qingxiu Dong

Liang Chen

Zhifang Sui

276

25 Feb 2024

Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving

Masayoshi Tomizuka

Wei Zhan

Yuning Chai

Xin Huang

3DPC

198

23 Feb 2024

Hands-Free VR

J. Fernandez

Jae Joong Lee

Santiago Andrés Serrano Vacca

Alejandra Magana

Bedrich Benes

V. Popescu

131

23 Feb 2024

Do Efficient Transformers Really Save Computation?

259

21 Feb 2024

LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation

Ikuya Yamada

Ryokan Ri

KELM

282

18 Feb 2024

Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish

Szymon Ruciñski

233

15 Feb 2024

Can LLMs Learn New Concepts Incrementally without Forgetting?

268

13 Feb 2024

EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages

Johnathan Mercer

104

12 Feb 2024

KVQ: Kwai Video Quality Assessment for Short-form VideosComputer Vision and Pattern Recognition (CVPR), 2024

Xin Li

297

11 Feb 2024

Pushing Boundaries: Mixup's Influence on Neural Collapse

209

09 Feb 2024

Mesoscale Traffic Forecasting for Real-Time Bottleneck and Shockwave Prediction

169

08 Feb 2024

Question Aware Vision Transformer for Multimodal Reasoning

299

08 Feb 2024

Improved Generalization of Weight Space Networks via Augmentations

330

06 Feb 2024

Sentiment-enhanced Graph-based Sarcasm Explanation in DialogueIEEE transactions on multimedia (IEEE TMM), 2024

449

06 Feb 2024

Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

169

05 Feb 2024

Stable and Robust Deep Learning By Hyperbolic Tangent Exponential Linear Unit (TeLU)

Alfredo Fernandez

Ankur Mali

121

05 Feb 2024

Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization

278

03 Feb 2024

From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers

412

02 Feb 2024

Sample, estimate, aggregate: A recipe for causal discovery foundation models

488

02 Feb 2024

Development and Adaptation of Robotic Vision in the Real-World: the Challenge of Door DetectionJournal of Field Robotics (JFR), 2024

279

31 Jan 2024