ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.05101
  4. Cited By
Decoupled Weight Decay Regularization
v1v2v3 (latest)

Decoupled Weight Decay Regularization

14 November 2017
I. Loshchilov
Katharina Eggensperger
    OffRL
ArXiv (abs)PDFHTMLGithub (275★)

Papers citing "Decoupled Weight Decay Regularization"

50 / 1,216 papers shown
Title
Physics-Grounded Differentiable Simulation for Soft Growing Robots
Physics-Grounded Differentiable Simulation for Soft Growing RobotsInternational Conference on Soft Robotics (RoboSoft), 2025
Lucas Chen
Yitian Gao
Sicheng Wang
Francesco Fuentes
Laura H. Blumenschein
Zachary Kingston
270
5
0
29 Jan 2025
360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation
360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation
Hamed Firooz
Maziar Sanjabi
Adrian Englhardt
Aman Gupta
Ben Levine
...
Vignesh Kothapalli
Xiaoling Zhai
Ya Xu
Yu Wang
Yun Dai
ALM
562
12
0
27 Jan 2025
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward PassComputer Vision and Pattern Recognition (CVPR), 2025
Jianing Yang
Alexander Sax
Kevin J. Liang
Mikael Henaff
Hao Tang
Ang Cao
J. Chai
Franziska Meier
Matt Feiszli
3DGS
733
159
0
23 Jan 2025
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
Benjamin Kiefer
Lojze Žust
Jon Muhovič
Matej Kristan
J. Pers
...
Ashraf Saleem
Ching-Heng Cheng
Yu-Fan Lin
Tzu-Yu Lin
Chih-Chung Hsu
168
6
0
20 Jan 2025
PolyLUT: Ultra-low Latency Polynomial Inference with Hardware-Aware Structured Pruning
PolyLUT: Ultra-low Latency Polynomial Inference with Hardware-Aware Structured PruningIEEE transactions on computers (IEEE Trans. Comput.), 2025
Marta Andronic
Jiawen Li
George A. Constantinides
156
6
0
14 Jan 2025
Optimizing Small Language Models for In-Vehicle Function-Calling
Optimizing Small Language Models for In-Vehicle Function-Calling
Yahya Sowti Khiabani
Farris Atif
Chieh Hsu
Sven Stahlmann
Tobias Michels
Sebastian Kramer
Benedikt Heidrich
M. Saquib Sarfraz
Julian Merten
Faezeh Tafazzoli
141
3
0
04 Jan 2025
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Xiaotao Hu
Wei Yin
Mingkai Jia
Junyuan Deng
Xiaoyang Guo
Qian Zhang
Xiaoxiao Long
Ping Tan
VGen
342
34
0
31 Dec 2024
Positive2Negative: Breaking the Information-Lossy Barrier in Self-Supervised Single Image Denoising
Positive2Negative: Breaking the Information-Lossy Barrier in Self-Supervised Single Image DenoisingComputer Vision and Pattern Recognition (CVPR), 2024
Tong Li
Lizhi Wang
Zhiyuan Xu
Lin Zhu
Wanxuan Lu
Hua Huang
386
3
0
21 Dec 2024
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data
Zhiqiang Tang
Zihan Zhong
Tong He
Gerald Friedland
371
4
0
19 Dec 2024
HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages
HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel LanguagesInformation Security Conference (IS), 2024
Aman Chaturvedi
Daniel Nichols
Siddharth Singh
A. Bhatele
271
8
0
19 Dec 2024
Jet: A Modern Transformer-Based Normalizing Flow
Jet: A Modern Transformer-Based Normalizing Flow
Alexander Kolesnikov
André Susano Pinto
Michael Tschannen
226
10
0
19 Dec 2024
GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for
  Efficient Multi-Frame Interpolation in DSA Images
GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA ImagesAAAI Conference on Artificial Intelligence (AAAI), 2024
Ziyang Xu
Huangxuan Zhao
Wen Liu
Xinyu Wang
279
1
0
18 Dec 2024
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic
  Approach
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic ApproachInternational Conference on Computational Linguistics (COLING), 2024
Daiki Shirafuji
Makoto Takenaka
Shinya Taguchi
LLMAG
248
11
0
16 Dec 2024
Learning Implicit Features with Flow Infused Attention for Realistic
  Virtual Try-On
Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On
Delong Zhang
Qiwei Huang
Yuanliu Liu
Yang Sun
Wei-Shi Zheng
Pengfei Xiong
Wei Zhang
3DH
262
1
0
16 Dec 2024
Bayesian Flow Is All You Need to Sample Out-of-Distribution Chemical Spaces
Bayesian Flow Is All You Need to Sample Out-of-Distribution Chemical Spaces
Nianze Tao
OODOODDBDL
620
0
0
16 Dec 2024
APAR: Modeling Irregular Target Functions in Tabular Regression via
  Arithmetic-Aware Pre-Training and Adaptive-Regularized Fine-Tuning
APAR: Modeling Irregular Target Functions in Tabular Regression via Arithmetic-Aware Pre-Training and Adaptive-Regularized Fine-TuningAAAI Conference on Artificial Intelligence (AAAI), 2024
Hong-Wei Wu
Wei Wang
Kuang-Da Wang
Chao-Han Huck Yang
LMTD
374
1
0
14 Dec 2024
Exploring Grokking: Experimental and Mechanistic Investigations
Exploring Grokking: Experimental and Mechanistic Investigations
Hu Qiye
Zhou Hao
Yu RuoXi
357
1
0
14 Dec 2024
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Jun Zheng
Jing Wang
Fuwei Zhao
Xujie Zhang
Xiaodan Liang
VGenDiffM
315
2
0
13 Dec 2024
GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek
GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern GreekInternational Conference on Computational Linguistics (COLING), 2024
Lefteris Loukas
Nikolaos Smyrnioudis
Chrysa Dikonomaki
Spyros Barbakos
Anastasios Toumazatos
...
Manolis Kyriakakis
Mary Georgiou
Stavros Vassos
John Pavlopoulos
Ion Androutsopoulos
AILaw
267
2
0
11 Dec 2024
SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse
  Scenarios Handling Emotional Support Agent
SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse Scenarios Handling Emotional Support AgentInternational Conference on Computational Linguistics (COLING), 2024
Jing Ye
Lu Xiang
Yaping Zhang
Chengqing Zong
422
19
0
11 Dec 2024
FlashSloth: Lightning Multimodal Large Language Models via Embedded
  Visual Compression
FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual CompressionComputer Vision and Pattern Recognition (CVPR), 2024
Bo Tong
Bokai Lai
Weihao Ye
Gen Luo
Chunjiang Ge
Ke Li
Xiaoshuai Sun
Rongrong Ji
VLMMLLM
225
4
0
05 Dec 2024
Reinforcement Learning from Wild Animal Videos
Reinforcement Learning from Wild Animal Videos
Elliot Chane-Sane
Constant Roux
O. Stasse
Nicolas Mansard
925
1
0
05 Dec 2024
Unified Framework for Open-World Compositional Zero-shot Learning
Unified Framework for Open-World Compositional Zero-shot LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Hirunima Jayasekara
Khoi Pham
Nirat Saini
Abhinav Shrivastava
266
1
0
05 Dec 2024
GuARD: Effective Anomaly Detection through a Text-Rich and Graph-Informed Language Model
GuARD: Effective Anomaly Detection through a Text-Rich and Graph-Informed Language Model
Yunhe Pang
Bo Chen
Fanjin Zhang
Yanghui Rao
Jie Tang
Jie Tang
292
0
0
05 Dec 2024
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion
  Prior
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior
Li-Yuan Tsao
Hao-Wei Chen
Hao-Wei Chung
Deqing Sun
Chun-Yi Lee
Kelvin Chan
Ming-Hsuan Yang
DiffM
215
7
0
27 Nov 2024
Cautious Optimizers: Improving Training with One Line of Code
Cautious Optimizers: Improving Training with One Line of Code
Kaizhao Liang
Lizhang Chen
B. Liu
Qiang Liu
ODL
686
20
0
25 Nov 2024
RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks
RECAST: Reparameterized, Compact weight Adaptation for Sequential TasksInternational Conference on Learning Representations (ICLR), 2024
Nazia Tasnim
Bryan A. Plummer
CLLOffRL
444
0
0
25 Nov 2024
Beyond adaptive gradient: Fast-Controlled Minibatch Algorithm for
  large-scale optimization
Beyond adaptive gradient: Fast-Controlled Minibatch Algorithm for large-scale optimization
Corrado Coppola
Lorenzo Papa
Irene Amerini
L. Palagi
ODL
384
0
0
24 Nov 2024
Financial Risk Assessment via Long-term Payment Behavior Sequence
  Folding
Financial Risk Assessment via Long-term Payment Behavior Sequence FoldingIndustrial Conference on Data Mining (IDM), 2024
Yiran Qiao
Yateng Tang
Xiang Ao
Qi Yuan
Ziming Liu
Chen Shen
Xuehao Zheng
218
0
0
22 Nov 2024
Entropy Bootstrapping for Weakly Supervised Nuclei Detection
Entropy Bootstrapping for Weakly Supervised Nuclei Detection
James Willoughby
Irina Voiculescu
UQCV
224
0
0
20 Nov 2024
A Theory for Compressibility of Graph Transformers for Transductive
  Learning
A Theory for Compressibility of Graph Transformers for Transductive Learning
Hamed Shirzad
Honghao Lin
A. Velingker
B. Venkatachalam
David P. Woodruff
Danica J. Sutherland
291
2
0
20 Nov 2024
FitDiT: Advancing the Authentic Garment Details for High-fidelity
  Virtual Try-on
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Qu He
C. Xu
Jinlong Peng
Jing Zhang
Chengjie Wang
Yunsheng Wu
Yanwei Fu
DiffM
263
25
0
15 Nov 2024
Pay Attention to the Keys: Visual Piano Transcription Using Transformers
Pay Attention to the Keys: Visual Piano Transcription Using TransformersInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Uros Zivanovic
Ivan Pilkov
Carlos Eduardo Cancino-Chacón
ViT
160
0
0
13 Nov 2024
MEANT: Multimodal Encoder for Antecedent Information
MEANT: Multimodal Encoder for Antecedent InformationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Benjamin Iyoya Irving
Annika Marie Schoene
AIFin
172
0
1
10 Nov 2024
Multi-View Majority Vote Learning Algorithms: Direct Minimization of PAC-Bayesian Bounds
Multi-View Majority Vote Learning Algorithms: Direct Minimization of PAC-Bayesian Bounds
Mehdi Hennequin
Abdelkrim Zitouni
K. Benabdeslem
H. Elghazel
Yacine Gaci
301
1
0
09 Nov 2024
Few-Shot Task Learning through Inverse Generative Modeling
Few-Shot Task Learning through Inverse Generative ModelingNeural Information Processing Systems (NeurIPS), 2024
Aviv Netanyahu
Yilun Du
Antonia Bronars
Jyothish Pari
J. Tenenbaum
Tianmin Shu
Pulkit Agrawal
472
4
0
07 Nov 2024
Learning to Unify Audio, Visual and Text for Audio-Enhanced Multilingual
  Visual Answer Localization
Learning to Unify Audio, Visual and Text for Audio-Enhanced Multilingual Visual Answer Localization
Zhibin Wen
Bin Li
175
3
0
05 Nov 2024
Expanding Sparse Tuning for Low Memory Usage
Expanding Sparse Tuning for Low Memory UsageNeural Information Processing Systems (NeurIPS), 2024
Shufan Shen
Junshu Sun
Xiangyang Ji
Qingming Huang
Shuhui Wang
317
9
0
04 Nov 2024
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model
  with Frozen LLM
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
Xiong Wang
Yangze Li
Chaoyou Fu
Chunjiang Ge
Lei Xie
Ke Li
Xing Sun
Long Ma
AuLLMMLLM
403
99
0
01 Nov 2024
Joint Extraction and Classification of Danish Competences for Job
  Matching
Joint Extraction and Classification of Danish Competences for Job MatchingEuropean Conference on Information Retrieval (ECIR), 2024
Qiuchi Li
Christina Lioma
128
0
0
29 Oct 2024
USpeech: Ultrasound-Enhanced Speech with Minimal Human Effort via Cross-Modal Synthesis
USpeech: Ultrasound-Enhanced Speech with Minimal Human Effort via Cross-Modal SynthesisProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2024
Luca Jiang-Tao Yu
Running Zhao
Sijie Ji
Edith C.H. Ngai
Chenshu Wu
186
3
0
29 Oct 2024
Super-resolution in disordered media using neural networks
Super-resolution in disordered media using neural networks
Alexander Christie
Matan Leibovich
Miguel Moscoso
A. Novikov
George Papanicolaou
C. Tsogka
192
0
0
28 Oct 2024
Mixture of Parrots: Experts improve memorization more than reasoning
Mixture of Parrots: Experts improve memorization more than reasoningInternational Conference on Learning Representations (ICLR), 2024
Samy Jelassi
Clara Mohri
David Brandfonbrener
Alex Gu
Nikhil Vyas
Nikhil Anand
David Alvarez-Melis
Yuanzhi Li
Sham Kakade
Eran Malach
MoE
341
14
0
24 Oct 2024
Lightweight Neural App Control
Lightweight Neural App ControlInternational Conference on Learning Representations (ICLR), 2024
Filippos Christianos
Georgios Papoudakis
Thomas Coste
Jianye Hao
Jun Wang
Youssef Attia El Hili
LM&Ro
239
9
0
23 Oct 2024
Publishing Neural Networks in Drug Discovery Might Compromise Training
  Data Privacy
Publishing Neural Networks in Drug Discovery Might Compromise Training Data PrivacyJournal of Cheminformatics (J Cheminform), 2024
Fabian P. Krüger
Johan Östman
Lewis H. Mervin
Igor V. Tetko
Ola Engkvist
223
5
0
22 Oct 2024
Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding
Joint Top-Down and Bottom-Up Frameworks for 3D Visual GroundingInternational Conference on Pattern Recognition (ICPR), 2024
Yang Liu
Daizong Liu
Wei Hu
3DPC
366
9
0
21 Oct 2024
Catastrophic Failure of LLM Unlearning via Quantization
Catastrophic Failure of LLM Unlearning via QuantizationInternational Conference on Learning Representations (ICLR), 2024
Zhiwei Zhang
Fali Wang
Xiaomin Li
Zongyu Wu
Xianfeng Tang
Hui Liu
Qi He
Wenpeng Yin
Suhang Wang
MU
294
5
0
21 Oct 2024
Non-invasive Neural Decoding in Source Reconstructed Brain Space
Non-invasive Neural Decoding in Source Reconstructed Brain Space
Yonatan Gideoni
Ryan Charles Timms
Oiwi Parker Jones
210
3
0
20 Oct 2024
Physically Guided Deep Unsupervised Inversion for 1D Magnetotelluric Models
Physically Guided Deep Unsupervised Inversion for 1D Magnetotelluric ModelsIEEE Geoscience and Remote Sensing Letters (GRSL), 2024
Paul Goyes-Peñafiel
Umair bin Waheed
Henry Arguello
114
1
0
20 Oct 2024
Cliqueformer: Model-Based Optimization with Structured Transformers
Cliqueformer: Model-Based Optimization with Structured Transformers
J. Kuba
Pieter Abbeel
Sergey Levine
OffRLAI4CE
447
4
0
17 Oct 2024
Previous
123456...232425
Next