ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.05101
  4. Cited By
Decoupled Weight Decay Regularization
v1v2v3 (latest)

Decoupled Weight Decay Regularization

14 November 2017
I. Loshchilov
Katharina Eggensperger
    OffRL
ArXiv (abs)PDFHTMLGithub (275★)

Papers citing "Decoupled Weight Decay Regularization"

50 / 1,216 papers shown
ChainNet: Structured Metaphor and Metonymy in WordNet
ChainNet: Structured Metaphor and Metonymy in WordNet
Rowan Hall Maudslay
Simone Teufel
Francis Bond
James Pustejovsky
NAIAI4CE
125
1
0
29 Mar 2024
Noise-Robust Keyword Spotting through Self-supervised Pretraining
Noise-Robust Keyword Spotting through Self-supervised Pretraining
Jacob Mork
H. S. Bovbjerg
Gergely Kiss
Zheng-Hua Tan
220
6
0
27 Mar 2024
DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via
  Structural Word Alignment
DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment
Haitao Li
Jiaxin Mao
Xinyan Han
Jia Chen
Qian Dong
Yiqun Liu
Chong Chen
Qi Tian
AILaw
181
11
0
27 Mar 2024
BLADE: Enhancing Black-box Large Language Models with Small
  Domain-Specific Models
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Haitao Li
Jiaxin Mao
Jia Chen
Qian Dong
Zhijing Wu
Yiqun Liu
Chong Chen
Qi Tian
AILaw
237
24
0
27 Mar 2024
MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD
  Mapping
MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD MappingEuropean Conference on Computer Vision (ECCV), 2024
Jiacheng Chen
Yuefan Wu
Jiaqi Tan
Hang Ma
Yasutaka Furukawa
272
54
0
23 Mar 2024
M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling
M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling
Xudong Sun
Nutan Chen
Alexej Gossmann
Yu Xing
Carla Feistner
...
Felix Drost
Daniele Scarcella
Lisa Beer
Carsten Marr
Carsten Marr
393
1
0
20 Mar 2024
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Wenqi Zhu
Jiale Cao
Jin Xie
Shuangming Yang
Yanwei Pang
VLMCLIP
292
10
0
19 Mar 2024
FinLlama: Financial Sentiment Classification for Algorithmic Trading
  Applications
FinLlama: Financial Sentiment Classification for Algorithmic Trading Applications
Thanos Konstantinidis
Giorgos Iacovides
Mingxue Xu
T. Constantinides
Danilo Mandic
AIFin
183
22
0
18 Mar 2024
A Versatile Framework for Multi-scene Person Re-identification
A Versatile Framework for Multi-scene Person Re-identification
Wei-Shi Zheng
Junkai Yan
Yi-Xing Peng
VLM
327
17
0
17 Mar 2024
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias
  in Factual Knowledge Extraction
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge ExtractionInternational Conference on Language Resources and Evaluation (LREC), 2024
Ziyang Xu
Keqin Peng
Liang Ding
Dacheng Tao
Xiliang Lu
236
19
0
15 Mar 2024
Single Domain Generalization for Crowd Counting
Single Domain Generalization for Crowd CountingComputer Vision and Pattern Recognition (CVPR), 2024
Zhuoxuan Peng
S.-H. Gary Chan
262
27
0
14 Mar 2024
PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF
  Priors
PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF PriorsEuropean Conference on Computer Vision (ECCV), 2024
Tianyuan Yuan
Yucheng Mao
Jiawei Yang
Yicheng Liu
Yue Wang
Hang Zhao
283
21
0
14 Mar 2024
Identity-aware Dual-constraint Network for Cloth-Changing Person
  Re-identification
Identity-aware Dual-constraint Network for Cloth-Changing Person Re-identification
Peini Guo
Mengyuan Liu
Hong Liu
Ruijia Fan
Guoquan Wang
Bin He
422
1
0
13 Mar 2024
Fine-tuning Large Language Models with Sequential Instructions
Fine-tuning Large Language Models with Sequential InstructionsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hanxu Hu
Simon Yu
Pinzhen Chen
Edoardo Ponti
ALMLRM
413
21
0
12 Mar 2024
generAItor: Tree-in-the-Loop Text Generation for Language Model
  Explainability and Adaptation
generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Thilo Spinner
Rebecca Kehlbeck
Rita Sevastjanova
Tobias Stähle
Daniel A. Keim
Oliver Deussen
Mennatallah El-Assady
275
5
0
12 Mar 2024
CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object
  Localization Perspective
CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization PerspectiveComputer Vision and Pattern Recognition (CVPR), 2024
Shunsuke Yasuki
Masato Taki
AAML
320
5
0
11 Mar 2024
Probabilistic Neural Circuits
Probabilistic Neural CircuitsAAAI Conference on Artificial Intelligence (AAAI), 2024
Pedro Zuidberg Dos Martires
TPM
159
7
0
10 Mar 2024
Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion
  Estimation
Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation
Paweł Antoni Pierzchlewicz
Caio da Silva
R. J. Cotton
Fabian H. Sinz
237
2
0
10 Mar 2024
Calibrating Large Language Models Using Their Generations Only
Calibrating Large Language Models Using Their Generations OnlyAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Dennis Ulmer
Martin Gubri
Hwaran Lee
Sangdoo Yun
Seong Joon Oh
UQLM
782
54
1
09 Mar 2024
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
XPSR: Cross-modal Priors for Diffusion-based Image Super-ResolutionEuropean Conference on Computer Vision (ECCV), 2024
Yunpeng Qu
Kun Yuan
Kai Zhao
Qizhi Xie
Jinhua Hao
Ming Sun
Chao Zhou
262
33
0
08 Mar 2024
The Blind Normalized Stein Variational Gradient Descent-Based Detection for Intelligent Random Access in Cellular IoT
The Blind Normalized Stein Variational Gradient Descent-Based Detection for Intelligent Random Access in Cellular IoTIEEE Internet of Things Journal (IEEE IoT J.), 2024
Xin Zhu
Ahmet Enis Cetin
204
0
0
08 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
2.4K
2,755
0
05 Mar 2024
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context
  Misinformation Detection
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection
Peng Qi
Zehong Yan
Wynne Hsu
Yang Deng
MLLM
347
88
0
05 Mar 2024
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language
  Models
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
Saeed Najafi
Alona Fyshe
254
3
0
04 Mar 2024
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable
  Virtual Try-on
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Yuhao Xu
Tao Gu
Weifeng Chen
Chengcai Chen
DiffM
306
132
0
04 Mar 2024
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable
  Functions
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions
Marta Andronic
George A. Constantinides
228
18
0
29 Feb 2024
PCDepth: Pattern-based Complementary Learning for Monocular Depth
  Estimation by Best of Both Worlds
PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds
Haotian Liu
Sanqing Qu
Fan Lu
Zongtao Bu
Florian Roehrbein
Alois Knoll
Guang Chen
MDE
207
4
0
29 Feb 2024
Exploring Data-Efficient Adaptation of Large Language Models for Code Generation
Exploring Data-Efficient Adaptation of Large Language Models for Code Generation
Xue Jiang
Yihong Dong
Zhi Jin
Ge Li
Wenpin Jiao
Ge Li
VLM
363
6
0
29 Feb 2024
ConvDTW-ACS: Audio Segmentation for Track Type Detection During Car
  Manufacturing
ConvDTW-ACS: Audio Segmentation for Track Type Detection During Car Manufacturing
Álvaro López-Chilet
Zhaoyi Liu
Jon Ander Gómez
Carlos Alvarez
Marivi Alonso Ortiz
Andres Orejuela Mesa
David Newton
Friedrich Wolf-Monheim
Sam Michiels
Danny Hughes
183
2
0
28 Feb 2024
Where Do We Go from Here? Multi-scale Allocentric Relational Inference
  from Natural Spatial Descriptions
Where Do We Go from Here? Multi-scale Allocentric Relational Inference from Natural Spatial Descriptions
Tzuf Paz-Argaman
Sayali Kulkarni
John Palowitch
Jason Baldridge
Reut Tsarfaty
158
4
0
26 Feb 2024
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Xiangdi Meng
Damai Dai
Weiyao Luo
Zhe Yang
Shaoxiang Wu
Xiaochen Wang
Peiyi Wang
Qingxiu Dong
Liang Chen
Zhifang Sui
276
18
0
25 Feb 2024
Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation
  Learning of Vision-based Autonomous Driving
Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Yichen Xie
Hongge Chen
Gregory P. Meyer
Yong Jae Lee
Eric M. Wolff
Masayoshi Tomizuka
Wei Zhan
Yuning Chai
Xin Huang
3DPC
198
2
0
23 Feb 2024
Hands-Free VR
Hands-Free VR
J. Fernandez
Jae Joong Lee
Santiago Andrés Serrano Vacca
Alejandra Magana
Bedrich Benes
V. Popescu
131
2
0
23 Feb 2024
Do Efficient Transformers Really Save Computation?
Do Efficient Transformers Really Save Computation?
Kai-Bo Yang
Jan Ackermann
Zhenyu He
Guhao Feng
Bohang Zhang
Yunzhen Feng
Qiwei Ye
Di He
Liwei Wang
259
28
0
21 Feb 2024
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models
  with Entity-based Data Augmentation
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
Ikuya Yamada
Ryokan Ri
KELM
282
3
0
18 Feb 2024
Efficient Language Adaptive Pre-training: Extending State-of-the-Art
  Large Language Models for Polish
Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish
Szymon Ruciñski
233
5
0
15 Feb 2024
Can LLMs Learn New Concepts Incrementally without Forgetting?
Can LLMs Learn New Concepts Incrementally without Forgetting?
Junhao Zheng
Shengjie Qiu
Qianli Ma
CLL
268
0
0
13 Feb 2024
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math
  Languages
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages
Johnathan Mercer
104
0
0
12 Feb 2024
KVQ: Kwai Video Quality Assessment for Short-form Videos
KVQ: Kwai Video Quality Assessment for Short-form VideosComputer Vision and Pattern Recognition (CVPR), 2024
Yiting Lu
Xin Li
Yajing Pei
Kun Yuan
Qizhi Xie
Yunpeng Qu
Ming Sun
Chao Zhou
Zhibo Chen
297
43
0
11 Feb 2024
Pushing Boundaries: Mixup's Influence on Neural Collapse
Pushing Boundaries: Mixup's Influence on Neural Collapse
Quinn Fisher
Haoming Meng
Vardan Papyan
AAMLUQCV
209
7
0
09 Feb 2024
Mesoscale Traffic Forecasting for Real-Time Bottleneck and Shockwave
  Prediction
Mesoscale Traffic Forecasting for Real-Time Bottleneck and Shockwave Prediction
Raphael Chekroun
Han Wang
Jonathan W. Lee
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
M. D. Monache
169
0
0
08 Feb 2024
Question Aware Vision Transformer for Multimodal Reasoning
Question Aware Vision Transformer for Multimodal Reasoning
Roy Ganz
Yair Kittenplon
Aviad Aberdam
Elad Ben Avraham
Oren Nuriel
Shai Mazor
Ron Litman
299
36
0
08 Feb 2024
Improved Generalization of Weight Space Networks via Augmentations
Improved Generalization of Weight Space Networks via Augmentations
Aviv Shamsian
Aviv Navon
David W. Zhang
Yan Zhang
Ethan Fetaya
Gal Chechik
Haggai Maron
330
18
0
06 Feb 2024
Sentiment-enhanced Graph-based Sarcasm Explanation in Dialogue
Sentiment-enhanced Graph-based Sarcasm Explanation in DialogueIEEE transactions on multimedia (IEEE TMM), 2024
Kun Ouyang
Liqiang Jing
Xuemeng Song
Meng Liu
Yupeng Hu
Liqiang Nie
449
7
0
06 Feb 2024
Deal, or no deal (or who knows)? Forecasting Uncertainty in
  Conversations using Large Language Models
Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Anthony Sicilia
Hyunwoo J. Kim
Khyathi Chandu
Malihe Alikhani
Jack Hessel
169
3
0
05 Feb 2024
Stable and Robust Deep Learning By Hyperbolic Tangent Exponential Linear
  Unit (TeLU)
Stable and Robust Deep Learning By Hyperbolic Tangent Exponential Linear Unit (TeLU)
Alfredo Fernandez
Ankur Mali
121
3
0
05 Feb 2024
Zero-shot sketch-based remote sensing image retrieval based on
  multi-level and attention-guided tokenization
Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization
Bo Yang
Chen Wang
Xiaoshuang Ma
Beiping Song
Zhuang Liu
Fangde Sun
278
6
0
03 Feb 2024
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing
  Activation Density in Transformers
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers
Bharat Runwal
Tejaswini Pedapati
Pin-Yu Chen
MoE
412
8
0
02 Feb 2024
Sample, estimate, aggregate: A recipe for causal discovery foundation models
Sample, estimate, aggregate: A recipe for causal discovery foundation models
Menghua Wu
Yujia Bao
Regina Barzilay
Tommi Jaakkola
CML
488
8
0
02 Feb 2024
Development and Adaptation of Robotic Vision in the Real-World: the Challenge of Door Detection
Development and Adaptation of Robotic Vision in the Real-World: the Challenge of Door DetectionJournal of Field Robotics (JFR), 2024
Michele Antonazzi
Matteo Luperto
N. A. Borghese
Nicola Basilico
279
2
0
31 Jan 2024
Previous
123...8910...232425
Next
Page 9 of 25
Pageof 25