ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.05101
  4. Cited By
Decoupled Weight Decay Regularization
v1v2v3 (latest)

Decoupled Weight Decay Regularization

14 November 2017
I. Loshchilov
Katharina Eggensperger
    OffRL
ArXiv (abs)PDFHTMLGithub (275★)

Papers citing "Decoupled Weight Decay Regularization"

50 / 1,216 papers shown
FAMSeg: Fetal Femur and Cranial Ultrasound Segmentation Using Feature-Aware Attention and Mamba Enhancement
FAMSeg: Fetal Femur and Cranial Ultrasound Segmentation Using Feature-Aware Attention and Mamba Enhancement
Jie He
Minglang Chen
Minying Lu
Bocheng Liang
Junming Wei
Guiyan Peng
Jiaxi Chen
Ying Tan
Mamba
182
0
0
09 Jun 2025
Cultural Bias Matters: A Cross-Cultural Benchmark Dataset and Sentiment-Enriched Model for Understanding Multimodal Metaphors
Cultural Bias Matters: A Cross-Cultural Benchmark Dataset and Sentiment-Enriched Model for Understanding Multimodal MetaphorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Senqi Yang
Dongyu Zhang
Jing Ren
Ziqi Xu
Xiuzhen Zhang
Yiliao Song
Hongfei Lin
Xiwei Xu
207
3
0
08 Jun 2025
Debiasing Online Preference Learning via Preference Feature Preservation
Debiasing Online Preference Learning via Preference Feature PreservationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Dongyoung Kim
Jinsung Yoon
Jinwoo Shin
Jaehyung Kim
210
0
0
06 Jun 2025
Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media ManipulationComputer Vision and Pattern Recognition (CVPR), 2025
Yiheng Li
Yang Yang
Zichang Tan
Huan Liu
Weihua Chen
Xu Zhou
Zhen Lei
191
2
0
06 Jun 2025
When can in-context learning generalize out of task distribution?
When can in-context learning generalize out of task distribution?
Chase Goddard
Lindsay M. Smith
Vudtiwat Ngampruetikorn
David J. Schwab
OOD
157
3
0
05 Jun 2025
Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning
Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning
Liang Chen
Xueting Han
Li Shen
Jing Bai
Kam-Fai Wong
AAML
314
5
0
04 Jun 2025
DS-VTON: An Enhanced Dual-Scale Coarse-to-Fine Framework for Virtual Try-On
DS-VTON: An Enhanced Dual-Scale Coarse-to-Fine Framework for Virtual Try-On
Xianbing Sun
Y. Hong
Jiahui Zhan
Jun Lan
Huijia Zhu
Weiqiang Wang
Liqing Zhang
Jianfu Zhang
DiffM
235
1
0
01 Jun 2025
IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection
IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection
Wayne Zhang
Changjiang Jiang
Zhonghao Zhang
Chenyang Si
Fengchang Yu
...
Xinbin Yuan
Yifei Bi
Ming Zhao
Zian Zhou
Caifeng Shan
328
8
0
01 Jun 2025
FinBERT2: A Specialized Bidirectional Encoder for Bridging the Gap in Finance-Specific Deployment of Large Language Models
FinBERT2: A Specialized Bidirectional Encoder for Bridging the Gap in Finance-Specific Deployment of Large Language Models
Xuan Xu
Fufang Wen
Beilin Chu
Zhibing Fu
Qinhong Lin
Jiaqi Liu
Binjie Fei
Zhongliang Yang
Linna Zhou
Yu Li
263
4
0
31 May 2025
MGS3: A Multi-Granularity Self-Supervised Code Search Framework
MGS3: A Multi-Granularity Self-Supervised Code Search FrameworkKnowledge Discovery and Data Mining (KDD), 2025
Rui Li
Junfeng Kang
Qi Liu
Liyang He
Zheng Zhang
Yunhao Sha
Linbo Zhu
Zhenya Huang
178
2
0
30 May 2025
Taming Transformer Without Using Learning Rate Warmup
Taming Transformer Without Using Learning Rate WarmupInternational Conference on Learning Representations (ICLR), 2025
Xianbiao Qi
Yelin He
Jiaquan Ye
Chun-Guang Li
Bojia Zi
Xili Dai
Qin Zou
Rong Xiao
173
3
0
28 May 2025
Hierarchical Material Recognition from Local Appearance
Hierarchical Material Recognition from Local Appearance
Matthew Beveridge
Shree K. Nayar
345
3
0
28 May 2025
Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings
Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings
Angéline Pouget
Mohammad Yaghini
Stephan Rabanser
Nicolas Papernot
198
1
0
28 May 2025
STACI: Spatio-Temporal Aleatoric Conformal Inference
STACI: Spatio-Temporal Aleatoric Conformal Inference
Brandon Feng
David K. Park
Xihaier Luo
Arantxa Urdangarin
Shinjae Yoo
Brian J. Reich
197
0
0
27 May 2025
How Do Transformers Learn Variable Binding in Symbolic Programs?
How Do Transformers Learn Variable Binding in Symbolic Programs?
Yiwei Wu
Atticus Geiger
Raphaël Millière
NAI
174
7
0
27 May 2025
Frictional Agent Alignment Framework: Slow Down and Don't Break Things
Frictional Agent Alignment Framework: Slow Down and Don't Break ThingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Abhijnan Nath
Carine Graff
Andrei Bachinin
Nikhil Krishnaswamy
305
4
0
26 May 2025
Advancing Video Self-Supervised Learning via Image Foundation Models
Advancing Video Self-Supervised Learning via Image Foundation ModelsPattern Recognition Letters (Pattern Recogn. Lett.), 2025
Jingwei Wu
Zhewei Huang
Chang Liu
208
0
0
25 May 2025
Latent Mamba Operator for Partial Differential Equations
Latent Mamba Operator for Partial Differential Equations
Karn Tiwari
Niladri Dutta
N. M. A. Krishnan
P. PrathoshA
MambaAI4CE
300
0
0
25 May 2025
What Do You Need for Diverse Trajectory Composition in Diffusion Planning?
What Do You Need for Diverse Trajectory Composition in Diffusion Planning?
Quentin Clark
Florian Shkurti
1.1K
0
0
23 May 2025
High-Fidelity Functional Ultrasound Reconstruction via A Visual Auto-Regressive Framework
High-Fidelity Functional Ultrasound Reconstruction via A Visual Auto-Regressive Framework
Xuhang Chen
Zhuo Li
Yanyan Shen
Mufti Mahmud
Hieu Pham
Chi-Man Pun
Shuqiang Wang
203
3
0
23 May 2025
Generative Latent Coding for Ultra-Low Bitrate Image and Video Compression
Generative Latent Coding for Ultra-Low Bitrate Image and Video Compression
Linfeng Qi
Zhaoyang Jia
Jiahao Li
Bin Li
Houqiang Li
Yan Lu
525
7
0
22 May 2025
PaTH Attention: Position Encoding via Accumulating Householder Transformations
PaTH Attention: Position Encoding via Accumulating Householder Transformations
Songlin Yang
Yikang Shen
Kaiyue Wen
Shawn Tan
Mayank Mishra
Liliang Ren
Rameswar Panda
Yoon Kim
866
12
0
22 May 2025
DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution
DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution
Zheng Chen
Zichen Zou
Kewei Zhang
Xiongfei Su
Xin Yuan
Yong Guo
Yulun Zhang
DiffMVGen
443
9
0
22 May 2025
CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning
CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning
Biao Yi
Tiansheng Huang
Baolei Zhang
Tong Li
Lihai Nie
Zheli Liu
Li Shen
MUAAML
307
5
0
22 May 2025
Watch your steps: Dormant Adversarial Behaviors that Activate upon LLM Finetuning
Watch your steps: Dormant Adversarial Behaviors that Activate upon LLM Finetuning
Thibaud Gloaguen
Mark Vero
Robin Staab
Martin Vechev
AAML
480
0
0
22 May 2025
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Chaoyang Wang
Xiangtai Li
Lu Qi
X. Lin
Jinbin Bai
Qianyu Zhou
Yunhai Tong
DiffM
320
3
0
22 May 2025
From Generic Empathy to Personalized Emotional Support: A Self-Evolution Framework for User Preference Alignment
From Generic Empathy to Personalized Emotional Support: A Self-Evolution Framework for User Preference Alignment
Jing Ye
Lu Xiang
Yaping Zhang
Chengqing Zong
247
4
0
22 May 2025
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document RankingInternational Conference on Information and Knowledge Management (CIKM), 2024
Songhao Wu
Quan Tu
Mingjie Zhong
Hong Liu
Jia Xu
Jinjie Gu
Rui Yan
319
0
0
20 May 2025
Unify Graph Learning with Text: Unleashing LLM Potentials for Session Search
Unify Graph Learning with Text: Unleashing LLM Potentials for Session SearchThe Web Conference (WWW), 2024
Songhao Wu
Quan Tu
Hong Liu
Jia Xu
Zhongyi Liu
Guannan Zhang
Ran Wang
Xiuying Chen
Rui Yan
365
9
0
20 May 2025
Flexible-weighted Chamfer Distance: Enhanced Objective Function for Point Cloud Completion
Flexible-weighted Chamfer Distance: Enhanced Objective Function for Point Cloud Completion
Jie Li
Shengwei Tian
Long Yu
Xin Ning
448
0
0
20 May 2025
Krikri: Advancing Open Large Language Models for Greek
Krikri: Advancing Open Large Language Models for Greek
Dimitris Roussis
Leon Voukoutis
Georgios Paraskevopoulos
Sokratis Sofianopoulos
Prokopis Prokopidis
Vassilis Papavasileiou
Athanasios Katsamanis
Stelios Piperidis
Vassilis Katsouros
ALM
409
6
0
19 May 2025
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
Fitsum Gaim
Hoyun Song
Huije Lee
Changgeon Ko
Eui Jun Hwang
Jong C. Park
221
2
0
17 May 2025
X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models
X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models
Valentina Bazyleva
Nicolo Bonettini
Gaurav Bharaj
DiffM
259
2
0
16 May 2025
LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis
LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis
Weiming Zhang
Lingyue Fu
Qingyao Li
Kounianhua Du
Jianghao Lin
Jingwei Yu
Wei Xia
Weinan Zhang
Ruiming Tang
Yong Yu
AI4Ed
176
0
0
14 May 2025
Contactless Cardiac Pulse Monitoring Using Event Cameras
Contactless Cardiac Pulse Monitoring Using Event Cameras
Mohamed Moustafa
Joseph Lemley
Peter Corcoran
199
1
0
14 May 2025
ExEBench: Benchmarking Foundation Models on Extreme Earth Events
ExEBench: Benchmarking Foundation Models on Extreme Earth Events
Shan Zhao
Zhitong Xiong
Jie Zhao
Xiao Xiang Zhu
229
2
0
13 May 2025
Adaptive Latent-Space Constraints in Personalized Federated Learning
Adaptive Latent-Space Constraints in Personalized Federated Learning
Sana Ayromlou
Fatemeh Tavakoli
D. B. Emerson
FedML
256
0
0
12 May 2025
MedEIR: A Specialized Medical Embedding Model for Enhanced Information Retrieval
MedEIR: A Specialized Medical Embedding Model for Enhanced Information Retrieval
Anand Selvadurai
Jasheen Shaik
Girish Chandrasekar
ShriRadhaKrishnan Balamurugan
Eswara Reddy
RALM
72
0
0
12 May 2025
Bi-directional Self-Registration for Misaligned Infrared-Visible Image Fusion
Bi-directional Self-Registration for Misaligned Infrared-Visible Image Fusion
Timing Li
Bing Cao
Q. Hu
Bin Xiao
Qinghua Hu
218
0
0
11 May 2025
Building-Guided Pseudo-Label Learning for Cross-Modal Building Damage Mapping
Building-Guided Pseudo-Label Learning for Cross-Modal Building Damage Mapping
Jiepan Li
He Huang
Yu Sheng
Xu Tan
Wei He
228
2
0
08 May 2025
Quiet Feature Learning in Algorithmic Tasks
Quiet Feature Learning in Algorithmic Tasks
Prudhviraj Naidu
Zixian Wang
Leon Bergen
R. Paturi
VLM
337
0
0
06 May 2025
Variational diffusion transformers for conditional sampling of supernovae spectra
Variational diffusion transformers for conditional sampling of supernovae spectra
Yunyi Shen
Alexander T. Gagliano
DiffM
200
2
0
05 May 2025
MISE: Meta-knowledge Inheritance for Social Media-Based Stressor Estimation
MISE: Meta-knowledge Inheritance for Social Media-Based Stressor EstimationThe Web Conference (WWW), 2025
Xin Wang
Ling Feng
Huijun Zhang
Lei Cao
Kaisheng Zeng
Qi Li
Yang Ding
Yi Dai
David A. Clifton
322
2
0
03 May 2025
GENMO: A GENeralist Model for Human MOtion
GENMO: A GENeralist Model for Human MOtion
Jiefeng Li
Jinkun Cao
Haotian Zhang
Davis Rempe
Jan Kautz
Umar Iqbal
Ye Yuan
DiffMVGen
301
7
0
02 May 2025
Enhancing Health Mention Classification Performance: A Study on Advancements in Parameter Efficient Tuning
Enhancing Health Mention Classification Performance: A Study on Advancements in Parameter Efficient Tuning
Reem Abdel-Salam
M. Adewunmi
299
0
0
30 Apr 2025
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
Roman Abramov
Felix Steinbauer
Gjergji Kasneci
896
7
0
29 Apr 2025
MERA: Multimodal and Multiscale Self-Explanatory Model with Considerably Reduced Annotation for Lung Nodule Diagnosis
MERA: Multimodal and Multiscale Self-Explanatory Model with Considerably Reduced Annotation for Lung Nodule Diagnosis
Jiahao Lu
Chong Yin
Silvia Ingala
S. Darkner
M. Nielsen
Kenny Erleben
274
0
0
27 Apr 2025
PCF-Grasp: Converting Point Completion to Geometry Feature to Enhance 6-DoF Grasp
PCF-Grasp: Converting Point Completion to Geometry Feature to Enhance 6-DoF Grasp
Yaofeng Cheng
Fusheng Zha
Wei Guo
Pengfei Wang
Chao Zeng
Lining Sun
Chenguang Yang
3DPC
306
1
0
22 Apr 2025
HFBRI-MAE: Handcrafted Feature Based Rotation-Invariant Masked Autoencoder for 3D Point Cloud Analysis
HFBRI-MAE: Handcrafted Feature Based Rotation-Invariant Masked Autoencoder for 3D Point Cloud Analysis
Xuanhua Yin
Dingxin Zhang
Jianhui Yu
Weidong Cai
235
1
0
19 Apr 2025
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
CheXWorld: Exploring Image World Modeling for Radiograph Representation LearningComputer Vision and Pattern Recognition (CVPR), 2025
Yang Yue
Yulin Wang
Chenxin Tao
Pan Liu
Shiji Song
Gao Huang
MedIm
325
3
0
18 Apr 2025
Previous
123456...232425
Next