Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.05101
Cited By
v1
v2
v3 (latest)
Decoupled Weight Decay Regularization
14 November 2017
I. Loshchilov
Katharina Eggensperger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (275★)
Papers citing
"Decoupled Weight Decay Regularization"
50 / 1,216 papers shown
FAMSeg: Fetal Femur and Cranial Ultrasound Segmentation Using Feature-Aware Attention and Mamba Enhancement
Jie He
Minglang Chen
Minying Lu
Bocheng Liang
Junming Wei
Guiyan Peng
Jiaxi Chen
Ying Tan
Mamba
182
0
0
09 Jun 2025
Cultural Bias Matters: A Cross-Cultural Benchmark Dataset and Sentiment-Enriched Model for Understanding Multimodal Metaphors
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Senqi Yang
Dongyu Zhang
Jing Ren
Ziqi Xu
Xiuzhen Zhang
Yiliao Song
Hongfei Lin
Xiwei Xu
207
3
0
08 Jun 2025
Debiasing Online Preference Learning via Preference Feature Preservation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Dongyoung Kim
Jinsung Yoon
Jinwoo Shin
Jaehyung Kim
210
0
0
06 Jun 2025
Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Computer Vision and Pattern Recognition (CVPR), 2025
Yiheng Li
Yang Yang
Zichang Tan
Huan Liu
Weihua Chen
Xu Zhou
Zhen Lei
191
2
0
06 Jun 2025
When can in-context learning generalize out of task distribution?
Chase Goddard
Lindsay M. Smith
Vudtiwat Ngampruetikorn
David J. Schwab
OOD
157
3
0
05 Jun 2025
Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning
Liang Chen
Xueting Han
Li Shen
Jing Bai
Kam-Fai Wong
AAML
314
5
0
04 Jun 2025
DS-VTON: An Enhanced Dual-Scale Coarse-to-Fine Framework for Virtual Try-On
Xianbing Sun
Y. Hong
Jiahui Zhan
Jun Lan
Huijia Zhu
Weiqiang Wang
Liqing Zhang
Jianfu Zhang
DiffM
235
1
0
01 Jun 2025
IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection
Wayne Zhang
Changjiang Jiang
Zhonghao Zhang
Chenyang Si
Fengchang Yu
...
Xinbin Yuan
Yifei Bi
Ming Zhao
Zian Zhou
Caifeng Shan
328
8
0
01 Jun 2025
FinBERT2: A Specialized Bidirectional Encoder for Bridging the Gap in Finance-Specific Deployment of Large Language Models
Xuan Xu
Fufang Wen
Beilin Chu
Zhibing Fu
Qinhong Lin
Jiaqi Liu
Binjie Fei
Zhongliang Yang
Linna Zhou
Yu Li
263
4
0
31 May 2025
MGS3: A Multi-Granularity Self-Supervised Code Search Framework
Knowledge Discovery and Data Mining (KDD), 2025
Rui Li
Junfeng Kang
Qi Liu
Liyang He
Zheng Zhang
Yunhao Sha
Linbo Zhu
Zhenya Huang
178
2
0
30 May 2025
Taming Transformer Without Using Learning Rate Warmup
International Conference on Learning Representations (ICLR), 2025
Xianbiao Qi
Yelin He
Jiaquan Ye
Chun-Guang Li
Bojia Zi
Xili Dai
Qin Zou
Rong Xiao
173
3
0
28 May 2025
Hierarchical Material Recognition from Local Appearance
Matthew Beveridge
Shree K. Nayar
345
3
0
28 May 2025
Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings
Angéline Pouget
Mohammad Yaghini
Stephan Rabanser
Nicolas Papernot
198
1
0
28 May 2025
STACI: Spatio-Temporal Aleatoric Conformal Inference
Brandon Feng
David K. Park
Xihaier Luo
Arantxa Urdangarin
Shinjae Yoo
Brian J. Reich
197
0
0
27 May 2025
How Do Transformers Learn Variable Binding in Symbolic Programs?
Yiwei Wu
Atticus Geiger
Raphaël Millière
NAI
174
7
0
27 May 2025
Frictional Agent Alignment Framework: Slow Down and Don't Break Things
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Abhijnan Nath
Carine Graff
Andrei Bachinin
Nikhil Krishnaswamy
305
4
0
26 May 2025
Advancing Video Self-Supervised Learning via Image Foundation Models
Pattern Recognition Letters (Pattern Recogn. Lett.), 2025
Jingwei Wu
Zhewei Huang
Chang Liu
208
0
0
25 May 2025
Latent Mamba Operator for Partial Differential Equations
Karn Tiwari
Niladri Dutta
N. M. A. Krishnan
P. PrathoshA
Mamba
AI4CE
300
0
0
25 May 2025
What Do You Need for Diverse Trajectory Composition in Diffusion Planning?
Quentin Clark
Florian Shkurti
1.1K
0
0
23 May 2025
High-Fidelity Functional Ultrasound Reconstruction via A Visual Auto-Regressive Framework
Xuhang Chen
Zhuo Li
Yanyan Shen
Mufti Mahmud
Hieu Pham
Chi-Man Pun
Shuqiang Wang
203
3
0
23 May 2025
Generative Latent Coding for Ultra-Low Bitrate Image and Video Compression
Linfeng Qi
Zhaoyang Jia
Jiahao Li
Bin Li
Houqiang Li
Yan Lu
525
7
0
22 May 2025
PaTH Attention: Position Encoding via Accumulating Householder Transformations
Songlin Yang
Yikang Shen
Kaiyue Wen
Shawn Tan
Mayank Mishra
Liliang Ren
Rameswar Panda
Yoon Kim
866
12
0
22 May 2025
DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution
Zheng Chen
Zichen Zou
Kewei Zhang
Xiongfei Su
Xin Yuan
Yong Guo
Yulun Zhang
DiffM
VGen
443
9
0
22 May 2025
CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning
Biao Yi
Tiansheng Huang
Baolei Zhang
Tong Li
Lihai Nie
Zheli Liu
Li Shen
MU
AAML
307
5
0
22 May 2025
Watch your steps: Dormant Adversarial Behaviors that Activate upon LLM Finetuning
Thibaud Gloaguen
Mark Vero
Robin Staab
Martin Vechev
AAML
480
0
0
22 May 2025
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Chaoyang Wang
Xiangtai Li
Lu Qi
X. Lin
Jinbin Bai
Qianyu Zhou
Yunhai Tong
DiffM
320
3
0
22 May 2025
From Generic Empathy to Personalized Emotional Support: A Self-Evolution Framework for User Preference Alignment
Jing Ye
Lu Xiang
Yaping Zhang
Chengqing Zong
247
4
0
22 May 2025
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking
International Conference on Information and Knowledge Management (CIKM), 2024
Songhao Wu
Quan Tu
Mingjie Zhong
Hong Liu
Jia Xu
Jinjie Gu
Rui Yan
319
0
0
20 May 2025
Unify Graph Learning with Text: Unleashing LLM Potentials for Session Search
The Web Conference (WWW), 2024
Songhao Wu
Quan Tu
Hong Liu
Jia Xu
Zhongyi Liu
Guannan Zhang
Ran Wang
Xiuying Chen
Rui Yan
365
9
0
20 May 2025
Flexible-weighted Chamfer Distance: Enhanced Objective Function for Point Cloud Completion
Jie Li
Shengwei Tian
Long Yu
Xin Ning
448
0
0
20 May 2025
Krikri: Advancing Open Large Language Models for Greek
Dimitris Roussis
Leon Voukoutis
Georgios Paraskevopoulos
Sokratis Sofianopoulos
Prokopis Prokopidis
Vassilis Papavasileiou
Athanasios Katsamanis
Stelios Piperidis
Vassilis Katsouros
ALM
409
6
0
19 May 2025
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
Fitsum Gaim
Hoyun Song
Huije Lee
Changgeon Ko
Eui Jun Hwang
Jong C. Park
221
2
0
17 May 2025
X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models
Valentina Bazyleva
Nicolo Bonettini
Gaurav Bharaj
DiffM
259
2
0
16 May 2025
LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis
Weiming Zhang
Lingyue Fu
Qingyao Li
Kounianhua Du
Jianghao Lin
Jingwei Yu
Wei Xia
Weinan Zhang
Ruiming Tang
Yong Yu
AI4Ed
176
0
0
14 May 2025
Contactless Cardiac Pulse Monitoring Using Event Cameras
Mohamed Moustafa
Joseph Lemley
Peter Corcoran
199
1
0
14 May 2025
ExEBench: Benchmarking Foundation Models on Extreme Earth Events
Shan Zhao
Zhitong Xiong
Jie Zhao
Xiao Xiang Zhu
229
2
0
13 May 2025
Adaptive Latent-Space Constraints in Personalized Federated Learning
Sana Ayromlou
Fatemeh Tavakoli
D. B. Emerson
FedML
256
0
0
12 May 2025
MedEIR: A Specialized Medical Embedding Model for Enhanced Information Retrieval
Anand Selvadurai
Jasheen Shaik
Girish Chandrasekar
ShriRadhaKrishnan Balamurugan
Eswara Reddy
RALM
72
0
0
12 May 2025
Bi-directional Self-Registration for Misaligned Infrared-Visible Image Fusion
Timing Li
Bing Cao
Q. Hu
Bin Xiao
Qinghua Hu
218
0
0
11 May 2025
Building-Guided Pseudo-Label Learning for Cross-Modal Building Damage Mapping
Jiepan Li
He Huang
Yu Sheng
Xu Tan
Wei He
228
2
0
08 May 2025
Quiet Feature Learning in Algorithmic Tasks
Prudhviraj Naidu
Zixian Wang
Leon Bergen
R. Paturi
VLM
337
0
0
06 May 2025
Variational diffusion transformers for conditional sampling of supernovae spectra
Yunyi Shen
Alexander T. Gagliano
DiffM
200
2
0
05 May 2025
MISE: Meta-knowledge Inheritance for Social Media-Based Stressor Estimation
The Web Conference (WWW), 2025
Xin Wang
Ling Feng
Huijun Zhang
Lei Cao
Kaisheng Zeng
Qi Li
Yang Ding
Yi Dai
David A. Clifton
322
2
0
03 May 2025
GENMO: A GENeralist Model for Human MOtion
Jiefeng Li
Jinkun Cao
Haotian Zhang
Davis Rempe
Jan Kautz
Umar Iqbal
Ye Yuan
DiffM
VGen
301
7
0
02 May 2025
Enhancing Health Mention Classification Performance: A Study on Advancements in Parameter Efficient Tuning
Reem Abdel-Salam
M. Adewunmi
299
0
0
30 Apr 2025
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
Roman Abramov
Felix Steinbauer
Gjergji Kasneci
896
7
0
29 Apr 2025
MERA: Multimodal and Multiscale Self-Explanatory Model with Considerably Reduced Annotation for Lung Nodule Diagnosis
Jiahao Lu
Chong Yin
Silvia Ingala
S. Darkner
M. Nielsen
Kenny Erleben
274
0
0
27 Apr 2025
PCF-Grasp: Converting Point Completion to Geometry Feature to Enhance 6-DoF Grasp
Yaofeng Cheng
Fusheng Zha
Wei Guo
Pengfei Wang
Chao Zeng
Lining Sun
Chenguang Yang
3DPC
306
1
0
22 Apr 2025
HFBRI-MAE: Handcrafted Feature Based Rotation-Invariant Masked Autoencoder for 3D Point Cloud Analysis
Xuanhua Yin
Dingxin Zhang
Jianhui Yu
Weidong Cai
235
1
0
19 Apr 2025
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Yang Yue
Yulin Wang
Chenxin Tao
Pan Liu
Shiji Song
Gao Huang
MedIm
325
3
0
18 Apr 2025
Previous
1
2
3
4
5
6
...
23
24
25
Next