Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.05101
Cited By
v1
v2
v3 (latest)
Decoupled Weight Decay Regularization
14 November 2017
I. Loshchilov
Katharina Eggensperger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (275★)
Papers citing
"Decoupled Weight Decay Regularization"
50 / 1,216 papers shown
A mean teacher algorithm for unlearning of language models
Yegor Klochkov
MU
648
0
0
18 Apr 2025
NNTile: a machine learning framework capable of training extremely large GPT language models on a single node
A. Mikhalev
Aleksandr Katrutsa
Konstantin Sozykin
Ivan Oseledets
165
0
0
17 Apr 2025
Multi-Object Grounding via Hierarchical Contrastive Siamese Transformers
Chengyi Du
Keyan Jin
235
0
0
14 Apr 2025
Small Object Detection with YOLO: A Performance Analysis Across Model Versions and Hardware
Muhammad Fasih Tariq
Muhammad Azeem Javed
ObjD
277
4
0
14 Apr 2025
Decoupled Diffusion Sparks Adaptive Scene Generation
Yunsong Zhou
Naisheng Ye
William Ljungbergh
Tianyu Li
Jiazhi Yang
Zetong Yang
Hongzi Zhu
Christoffer Petersson
Hongyang Li
262
9
0
14 Apr 2025
Towards Quantifying Commonsense Reasoning with Mechanistic Insights
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Abhinav Joshi
A. Ahmad
Divyaksh Shukla
Ashutosh Modi
ReLM
LRM
252
4
0
14 Apr 2025
MatterTune: An Integrated, User-Friendly Platform for Fine-Tuning Atomistic Foundation Models to Accelerate Materials Simulation and Discovery
Digital Discovery (DD), 2025
Lingyu Kong
Nima Shoghi
Guoxiang Hu
Pan Li
Victor Fung
255
4
0
14 Apr 2025
A Model Zoo of Vision Transformers
Damian Falk
Léo Meynent
Florence Pfammatter
Konstantin Schurholt
Damian Borth
504
2
0
14 Apr 2025
Gradient as Conditions: Rethinking HOG for All-in-one Image Restoration
Jiawei Wu
Zhifei Yang
Zihan Wang
Zhi Jin
314
1
0
12 Apr 2025
SD
2
^2
2
: Self-Distilled Sparse Drafters
Mike Lasby
Nish Sinnadurai
Valavan Manohararajah
Sean Lie
Yani Andrew Ioannou
Vithursan Thangarasa
786
1
0
10 Apr 2025
Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment
Computer Vision and Pattern Recognition (CVPR), 2025
Fatemeh Behrad
Tinne Tuytelaars
Johan Wagemans
ViT
304
3
0
03 Apr 2025
NeuraLUT-Assemble: Hardware-aware Assembling of Sub-Neural Networks for Efficient LUT Inference
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2025
Marta Andronic
George A. Constantinides
326
7
0
01 Apr 2025
FIESTA: Fisher Information-based Efficient Selective Test-time Adaptation
Mohammadmahdi Honarmand
O. Mutlu
Parnian Azizian
Saimourya Surabhi
Dennis Paul Wall
TTA
266
0
0
29 Mar 2025
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving
Fuhao Li
Huan Jin
Bin-Bin Gao
Liaoyuan Fan
Lihui Jiang
Long Zeng
372
8
0
28 Mar 2025
SChanger: Change Detection from a Semantic Change and Spatial Consistency Perspective
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE J-STARS), 2025
Ziyu Zhou
Keyan Hu
Yutian Fang
Xiaoping Rui
405
4
0
26 Mar 2025
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment
Computer Vision and Pattern Recognition (CVPR), 2025
Yaojie Lu
Qichao Wang
H. Cao
Xierui Wang
Xiaoyin Xu
Min Zhang
329
8
0
24 Mar 2025
LeanStereo: A Leaner Backbone based Stereo Network
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Rafia Rahim
Samuel Woerz
A. Zell
3DV
329
2
0
24 Mar 2025
Beyond Accuracy: What Matters in Designing Well-Behaved Image Classification Models?
Robin Hesse
Doğukan Bağcı
Bernt Schiele
Simone Schaub-Meyer
Stefan Roth
VLM
444
0
0
21 Mar 2025
PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning
Yan Zhang
Yao Feng
Alpár Cseke
Nitin Saini
Nathan Bajandas
Nicolas Heron
M. Black
DiffM
VGen
359
5
0
21 Mar 2025
Classification of User Reports for Detection of Faulty Computer Components using NLP Models: A Case Study
Maria de Lourdes M. Silva
André L. C. Mendonça
Eduardo R. D. Neto
Iago C. Chaves
Felipe T. Brito
V. A. E. Farias
Javam C. Machado
128
1
0
20 Mar 2025
Learn Your Scales: Towards Scale-Consistent Generative Novel View Synthesis
Fereshteh Forghani
Jason J. Yu
Tristan Aumentado-Armstrong
Konstantinos G. Derpanis
Marcus A. Brubaker
DiffM
332
0
0
19 Mar 2025
VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning
Y. Tan
Chen Liu
Jingyuan Gao
Banghao Wu
Mingchen Li
...
Lingrong Zhang
Huiqun Yu
Guisheng Fan
Liang Hong
Bingxin Zhou
200
3
0
19 Mar 2025
Quantum EigenGame for excited state calculation
David Quiroga
Jason Han
Anastasios Kyrillidis
280
5
0
17 Mar 2025
Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors
Katja Schwarz
Norman Mueller
Peter Kontschieder
3DGS
301
11
0
17 Mar 2025
L2HCount:Generalizing Crowd Counting from Low to High Crowd Density via Density Simulation
Guoliang Xu
Jianqin Yin
Ren Zhang
Yonghao Dang
Feng Zhou
Bo Yu
257
0
0
17 Mar 2025
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
Ivan Kartáč
Mateusz Lango
Ondrej Dusek
ELM
364
5
0
14 Mar 2025
Text Compression for Efficient Language Generation
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
David Gu
Peter Belcak
Roger Wattenhofer
241
1
0
14 Mar 2025
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering
Computer Vision and Pattern Recognition (CVPR), 2025
Md. Mohaiminul Islam
Tushar Nagarajan
Huiyu Wang
Gedas Bertasius
Lorenzo Torresani
1.0K
10
0
12 Mar 2025
The R2D2 Deep Neural Network Series for Scalable Non-Cartesian Magnetic Resonance Imaging
Yiwei Chen
Amir Aghabiglou
Shijie Chen
Motahare Torki
Chao Tang
Ruud B. van Heeswijk
Yves Wiaux
193
0
0
12 Mar 2025
TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos
Chen-Da Liu-Zhang
Lin Sui
Shuming Liu
Fangzhou Mu
Ziyi Wang
Bernard Ghanem
313
3
0
09 Mar 2025
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters
Jiaming Liu
Linghe Kong
Guihai Chen
325
2
0
08 Mar 2025
PointsToWood: A deep learning framework for complete canopy leaf-wood segmentation of TLS data across diverse European forests
H. Owen
Matthew J. Allen
S. Grieve
Phill Wilkes
E. Lines
3DPC
119
1
0
06 Mar 2025
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Siyang Song
Mohammed Irfan Kurpath
Sahal Shaji Mullappilly
Jean Lahoud
Fahad A Khan
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
AuLLM
656
5
0
06 Mar 2025
Lead Instrument Detection from Multitrack Music
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Longshen Ou
Yu Takahashi
Ye Wang
188
0
0
05 Mar 2025
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
Computer Vision and Pattern Recognition (CVPR), 2025
Xuanchi Ren
Tianchang Shen
Jiahui Huang
Huan Ling
Yifan Lu
Merlin Nimier-David
Thomas Muller
Alexander Keller
Sanja Fidler
Jun Gao
DiffM
VGen
322
125
0
05 Mar 2025
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Gokul Swamy
Sanjiban Choudhury
Wen Sun
Zhiwei Steven Wu
J. Andrew Bagnell
OffRL
438
45
0
03 Mar 2025
Efficiently Editing Mixture-of-Experts Models with Compressed Experts
Yexiao He
Yang Liu
Chen Liang
Hany Awadalla
MoE
316
3
0
01 Mar 2025
DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model
Lei Zhao
Sizhou Chen
Linfeng Feng
Ju Liu
Xuelong Li
Fangqiu Yi
Xuelong Li
DiffM
MDE
415
4
0
26 Feb 2025
Reference-Aligned Retrieval-Augmented Question Answering over Heterogeneous Proprietary Documents
Nayoung Choi
Grace Byun
Andrew Chung
Ellie S. Paek
S. Lee
Jinho D. Choi
RALM
771
1
0
26 Feb 2025
FLINT: Learning-based Flow Estimation and Temporal Interpolation for Scientific Ensemble Visualization
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2024
Hamid Gadirov
Jos B. T. M. Roerdink
Steffen Frey
AI4CE
271
3
0
24 Feb 2025
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification
Vasilii Feofanov
Songkang Wen
Marius Alonso
Romain Ilbert
Hongbo Guo
Malik Tiomoko
Lujia Pan
Jianfeng Zhang
I. Redko
AI4TS
VLM
315
14
0
24 Feb 2025
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis
Yingahao Aaron Li
Rithesh Kumar
Zeyu Jin
DiffM
358
0
0
21 Feb 2025
MoM: Linear Sequence Modeling with Mixture-of-Memories
Jusen Du
Weigao Sun
Disen Lan
Jiaxi Hu
Yu Cheng
KELM
553
15
0
19 Feb 2025
ALGEN: Few-shot Inversion Attacks on Textual Embeddings using Alignment and Generation
Yiyi Chen
Qiongkai Xu
Johannes Bjerva
403
4
0
16 Feb 2025
Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Palaash Goel
Dushyant Singh Chauhan
Md. Shad Akhtar
LRM
300
1
0
11 Feb 2025
MatSwap: Light-aware material transfers in images
Ivan Lopes
Valentin Deschaintre
Yannick Hold-Geoffroy
Raoul de Charette
DiffM
508
3
0
11 Feb 2025
AppVLM: A Lightweight Vision Language Model for Online App Control
Georgios Papoudakis
Thomas Coste
Zhihao Wu
Jianye Hao
Jun Wang
Youssef Attia El Hili
299
13
0
10 Feb 2025
deCIFer: Crystal Structure Prediction from Powder Diffraction Data using Autoregressive Language Models
Frederik L. Johansen
Ulrik Friis-Jensen
Erik B. Dam
Kirsten M. Ø. Jensen
Rocío Mercado
Raghavendra Selvan
764
6
0
04 Feb 2025
CoddLLM: Empowering Large Language Models for Data Analytics
Jiani Zhang
Hengrui Zhang
Rishav Chakravarti
Yiqun Hu
Patrick Ng
Asterios Katsifodimos
Huzefa Rangwala
George Karypis
Alon Halevy
SyDa
ELM
890
5
0
01 Feb 2025
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
Yun Wang
Tiansheng Huang
Li Shen
Huanjin Yao
Haotian Luo
Rui Liu
Naiqiang Tan
Jiaxing Huang
Dacheng Tao
AAML
MoMe
CLL
410
12
0
30 Jan 2025
Previous
1
2
3
4
5
...
23
24
25
Next