ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.05101
  4. Cited By
Decoupled Weight Decay Regularization
v1v2v3 (latest)

Decoupled Weight Decay Regularization

14 November 2017
I. Loshchilov
Katharina Eggensperger
    OffRL
ArXiv (abs)PDFHTMLGithub (275★)

Papers citing "Decoupled Weight Decay Regularization"

50 / 1,216 papers shown
Randomized Geometric Algebra Methods for Convex Neural Networks
Randomized Geometric Algebra Methods for Convex Neural Networks
Yifei Wang
Sungyoon Kim
Paul Chu
Indu Subramaniam
Mert Pilanci
AAML
329
1
0
04 Jun 2024
Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework
  for Chinese Spelling Check
Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
Haiming Wu
Hanqing Zhang
Richeng Xuan
Dawei Song
214
3
0
04 Jun 2024
FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation
FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation
Kun Chen
Tao Chen
Peng Ye
Hao Chen
Kang Chen
Tao Han
Wanli Ouyang
Mengwei He
201
11
0
03 Jun 2024
Estimating Canopy Height at Scale
Estimating Canopy Height at Scale
Jan Pauls
Max Zimmer
Una M. Kelly
Martin Schwartz
Sassan Saatchi
P. Ciais
Sebastian Pokutta
Martin Brandt
Fabian Gieseke
293
19
0
03 Jun 2024
Communication-Efficient Distributed Deep Learning via Federated Dynamic
  Averaging
Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging
Michail Theologitis
Georgios Frangias
Georgios Anestis
V. Samoladas
Antonios Deligiannakis
FedML
444
2
0
31 May 2024
Improving code-mixed hate detection by native sample mixing: A case
  study for Hindi-English code-mixed scenario
Improving code-mixed hate detection by native sample mixing: A case study for Hindi-English code-mixed scenario
Debajyoti Mazumder
Aakash Kumar
Jasabanta Patro
166
5
0
31 May 2024
Jina CLIP: Your CLIP Model Is Also Your Text Retriever
Jina CLIP: Your CLIP Model Is Also Your Text Retriever
Andreas Koukounas
Georgios Mastrapas
Michael Gunther
Bo Wang
Scott Martens
...
Saahil Ognawala
Susana Guzman
Maximilian Werk
Nan Wang
Han Xiao
VLM
300
38
0
30 May 2024
Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection
Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection
Prashanth Chandran
Gaspard Zoss
Paulo F. U. Gotardo
Derek Bradley
CVBM
283
4
0
30 May 2024
MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Junjie Wang
Guangjing Yang
Wentao Chen
Huahui Yi
Xiaohu Wu
Qicheng Lao
MoEALM
267
0
0
29 May 2024
Multi-objective Cross-task Learning via Goal-conditioned GPT-based
  Decision Transformers for Surgical Robot Task Automation
Multi-objective Cross-task Learning via Goal-conditioned GPT-based Decision Transformers for Surgical Robot Task Automation
Jiawei Fu
Yonghao Long
Kai-xiang Chen
Wang Wei
Qi Dou
MedIm
301
5
0
29 May 2024
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via
  Diffusion Transformers
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers
Jun Zheng
Fuwei Zhao
Youjiang Xu
Xin Dong
Xiaodan Liang
VGenDiffM
287
9
0
28 May 2024
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across
  Any Scenario
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario
Yuhan Li
Hao Zhou
Wenxiang Shang
Ran Lin
Xuanhong Chen
Bingbing Ni
DiffM
198
15
0
28 May 2024
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in
  Alignment
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
Keming Lu
Bowen Yu
Fei Huang
Yang Fan
Runji Lin
Chang Zhou
MoMe
205
27
0
28 May 2024
Language-Driven Interactive Traffic Trajectory Generation
Language-Driven Interactive Traffic Trajectory Generation
Junkai Xia
Chenxin Xu
Qingyao Xu
Chen Xie
Yanfeng Wang
Siheng Chen
312
17
0
24 May 2024
Distill-then-prune: An Efficient Compression Framework for Real-time
  Stereo Matching Network on Edge Devices
Distill-then-prune: An Efficient Compression Framework for Real-time Stereo Matching Network on Edge Devices
Baiyu Pan
Jichao Jiao
Jianxin Pang
Jun Cheng
191
8
0
20 May 2024
Towards Gradient-based Time-Series Explanations through a SpatioTemporal
  Attention Network
Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network
Min Hun Lee
AI4TSViTFAtt
221
3
0
18 May 2024
DINO as a von Mises-Fisher mixture model
DINO as a von Mises-Fisher mixture modelInternational Conference on Learning Representations (ICLR), 2024
Hariprasath Govindarajan
Per Sidén
Jacob Roll
Fredrik Lindsten
264
17
0
17 May 2024
FFF: Fixing Flawed Foundations in contrastive pre-training results in
  very strong Vision-Language models
FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language modelsComputer Vision and Pattern Recognition (CVPR), 2024
Adrian Bulat
Yassine Ouali
Georgios Tzimiropoulos
VLM
262
8
0
16 May 2024
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
Jie Liang
Radu Timofte
Qiaosi Yi
Shuai Liu
Lingchen Sun
Rongyuan Wu
Xindong Zhang
Huiyu Zeng
Lei Zhang
292
21
0
16 May 2024
Desk-AId: Humanitarian Aid Desk Assessment with Geospatial AI for
  Predicting Landmine Areas
Desk-AId: Humanitarian Aid Desk Assessment with Geospatial AI for Predicting Landmine Areas
Flavio Cirillo
Gürkan Solmaz
Yi-Hsuan Peng
Christian Bizer
Martin Jebens
164
0
0
15 May 2024
Using Machine Translation to Augment Multilingual Classification
Using Machine Translation to Augment Multilingual ClassificationEuropean Association for Machine Translation Conferences/Workshops (EAMT), 2024
Adam King
207
1
0
09 May 2024
BenthicNet: A global compilation of seafloor images for deep learning applications
BenthicNet: A global compilation of seafloor images for deep learning applications
Joakim Bruslund Haurum
B. Misiuk
Isaac Xu
Shakhboz Abdulazizov
A. R. Baroi
...
Jordan A. Thomson
Brittany R. Wilson
Melisa C. Wong
Craig J. Brown
Thomas Trappenberg
346
12
0
08 May 2024
Bridging the Bosphorus: Advancing Turkish Large Language Models through
  Strategies for Low-Resource Language Adaptation and Benchmarking
Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking
Emre Can Acikgoz
Mete Erdogan
Deniz Yuret
217
20
0
07 May 2024
Topicwise Separable Sentence Retrieval for Medical Report Generation
Topicwise Separable Sentence Retrieval for Medical Report Generation
Junting Zhao
Yang Zhou
Zhihao Chen
Huazhu Fu
Liang Wan
MedIm
227
3
0
07 May 2024
Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text
  Classification via Anchor Generation and Classification Reframing
Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification ReframingAAAI Conference on Artificial Intelligence (AAAI), 2024
Han Liu
Siyang Zhao
Xiaotong Zhang
Feng Zhang
Wei Wang
Fenglong Ma
Hongyang Chen
Hong Yu
Xianchao Zhang
VLM
144
6
0
06 May 2024
AB-Training: A Communication-Efficient Approach for Distributed Low-Rank
  Learning
AB-Training: A Communication-Efficient Approach for Distributed Low-Rank Learning
D. Coquelin
Katherina Flügel
Marie Weiel
Nicholas Kiefer
Muhammed Öz
Charlotte Debus
Achim Streit
Markus Goetz
348
0
0
02 May 2024
RLHF from Heterogeneous Feedback via Personalization and Preference
  Aggregation
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
Mingyang Liu
Dingwen Kong
Kaiqing Zhang
Asuman Ozdaglar
421
57
0
30 Apr 2024
Performance-Aligned LLMs for Generating Fast Code
Performance-Aligned LLMs for Generating Fast Code
Daniel Nichols
Pranav Polasam
Harshitha Menon
Aniruddha Marathe
T. Gamblin
A. Bhatele
217
20
0
29 Apr 2024
Event-based Video Frame Interpolation with Edge Guided Motion Refinement
Event-based Video Frame Interpolation with Edge Guided Motion Refinement
Yuhan Liu
Yongjian Deng
Hao Chen
Bochen Xie
Youfu Li
Zhen Yang
248
1
0
28 Apr 2024
Improving Smart Contract Security with Contrastive Learning-based
  Vulnerability Detection
Improving Smart Contract Security with Contrastive Learning-based Vulnerability Detection
Yizhou Chen
Zeyu Sun
Zhihao Gong
Dan Hao
AAML
181
48
0
27 Apr 2024
3D Face Modeling via Weakly-supervised Disentanglement Network joint
  Identity-consistency Prior
3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior
Guohao Li
Hongyu Yang
Di Huang
Yun Wang
CVBMCoGe
249
2
0
25 Apr 2024
Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
Ayumu Saito
Prachi Kudeshia
Jiju Poovvancheri
3DPC
570
15
0
25 Apr 2024
Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP)
  for Enhanced Breast Cancer Diagnosis with Multi-view Mammography
Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP) for Enhanced Breast Cancer Diagnosis with Multi-view Mammography
Xuxin Chen
Yuheng Li
Mingzhe Hu
Ella Salari
Xiaoqian Chen
Richard L. J. Qiu
Bin Zheng
Xiaofeng Yang
VLM
213
13
0
24 Apr 2024
Real-Time Compressed Sensing for Joint Hyperspectral Image Transmission
  and Restoration for CubeSat
Real-Time Compressed Sensing for Joint Hyperspectral Image Transmission and Restoration for CubeSat
Chih-Chung Hsu
Chih-Yu Jian
Eng-Shen Tu
Chia-Ming Lee
Guan-Lin Chen
133
11
0
24 Apr 2024
Better Synthetic Data by Retrieving and Transforming Existing Datasets
Better Synthetic Data by Retrieving and Transforming Existing Datasets
Saumya Gandhi
Ritu Gala
Vijay Viswanathan
Tongshuang Wu
Graham Neubig
SyDa
401
42
0
22 Apr 2024
MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets
MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets
Zeyu Li
Ruitong Gan
Chuanchen Luo
Yuxi Wang
Jiaheng Liu
Ziwei Zhu
Qing Li
Xucheng Yin
Zhaoxiang Zhang
Junran Peng
DiffM
294
3
0
22 Apr 2024
360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos
360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos
Yinzhe Xu
Huajian Huang
Yingshu Chen
Sai-Kit Yeung
VOS
326
5
0
22 Apr 2024
Find The Gap: Knowledge Base Reasoning For Visual Question Answering
Find The Gap: Knowledge Base Reasoning For Visual Question Answering
Elham J. Barezi
Parisa Kordjamshidi
215
3
0
16 Apr 2024
Consistency and Uncertainty: Identifying Unreliable Responses From
  Black-Box Vision-Language Models for Selective Visual Question Answering
Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Zaid Khan
Yun Fu
AAML
252
21
0
16 Apr 2024
3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow
3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow
Felix Taubner
Prashant Raina
Mathieu Tuli
Eu Wern Teh
Chul Lee
Jinmiao Huang
3DHCVBM
185
10
0
15 Apr 2024
Magic Clothing: Controllable Garment-Driven Image Synthesis
Magic Clothing: Controllable Garment-Driven Image Synthesis
Weifeng Chen
Tao Gu
Yuhao Xu
Chengcai Chen
203
28
0
15 Apr 2024
GeMQuAD : Generating Multilingual Question Answering Datasets from Large
  Language Models using Few Shot Learning
GeMQuAD : Generating Multilingual Question Answering Datasets from Large Language Models using Few Shot Learning
Amani Namboori
Shivam Mangale
Andrew Rosenbaum
Saleh Soltan
207
3
0
14 Apr 2024
ToNER: Type-oriented Named Entity Recognition with Generative Language
  Model
ToNER: Type-oriented Named Entity Recognition with Generative Language Model
Guochao Jiang
Ziqin Luo
Yuchen Shi
Dixuan Wang
Jiaqing Liang
Deqing Yang
184
20
0
14 Apr 2024
From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian
  Language Representation
From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language Representation
Artur Kiulian
Anton Polishko
M. Khandoga
Oryna Chubych
Jack Connor
Raghav Ravishankar
A. Shirawalmath
319
11
0
14 Apr 2024
Rethinking Iterative Stereo Matching from Diffusion Bridge Model
  Perspective
Rethinking Iterative Stereo Matching from Diffusion Bridge Model Perspective
Yuguang Shi
DiffM
239
2
0
13 Apr 2024
OPSD: an Offensive Persian Social media Dataset and its baseline
  evaluations
OPSD: an Offensive Persian Social media Dataset and its baseline evaluations
M. Safayani
Amir Sartipi
Amir Hossein Ahmadi
Parniyan Jalali
Amir Hossein Mansouri
Mohammad Bisheh-Niasar
Zahra Pourbahman
101
0
0
08 Apr 2024
Progressive Alignment with VLM-LLM Feature to Augment Defect
  Classification for the ASE Dataset
Progressive Alignment with VLM-LLM Feature to Augment Defect Classification for the ASE Dataset
Chih-Chung Hsu
Chia-Ming Lee
Chun-Hung Sun
Kuang-Ming Wu
157
0
0
08 Apr 2024
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
Yutong Xie
Qi Chen
Sinuo Wang
Minh-Son To
Iris Lee
Ee Win Khoo
Kerolos Hendy
Daniel Koh
Yong-quan Xia
Qi Wu
MedImLM&MA
194
12
0
07 Apr 2024
PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny
  Detection in Italian Tweets
PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian TweetsInternational Conference on Language Resources and Evaluation (LREC), 2024
Arianna Muti
Federico Ruggeri
Cagri Toraman
Lorenzo Musetti
Samuel Algherini
Silvia Ronchi
G. Saretto
Caterina Zapparoli
Alberto Barrón-Cedeño
107
7
0
03 Apr 2024
Adaptive Cross-lingual Text Classification through In-Context One-Shot
  Demonstrations
Adaptive Cross-lingual Text Classification through In-Context One-Shot DemonstrationsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Emilio Villa-Cueva
A. P. López-Monroy
Fernando Sánchez-Vega
Thamar Solorio
VLM
196
7
0
03 Apr 2024
Previous
123...789...232425
Next
Page 8 of 25
Pageof 25