Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.05101
Cited By
v1
v2
v3 (latest)
Decoupled Weight Decay Regularization
14 November 2017
I. Loshchilov
Katharina Eggensperger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (275★)
Papers citing
"Decoupled Weight Decay Regularization"
50 / 1,216 papers shown
Title
Physics-Grounded Differentiable Simulation for Soft Growing Robots
International Conference on Soft Robotics (RoboSoft), 2025
Lucas Chen
Yitian Gao
Sicheng Wang
Francesco Fuentes
Laura H. Blumenschein
Zachary Kingston
270
5
0
29 Jan 2025
360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation
Hamed Firooz
Maziar Sanjabi
Adrian Englhardt
Aman Gupta
Ben Levine
...
Vignesh Kothapalli
Xiaoling Zhai
Ya Xu
Yu Wang
Yun Dai
ALM
562
12
0
27 Jan 2025
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Computer Vision and Pattern Recognition (CVPR), 2025
Jianing Yang
Alexander Sax
Kevin J. Liang
Mikael Henaff
Hao Tang
Ang Cao
J. Chai
Franziska Meier
Matt Feiszli
3DGS
733
159
0
23 Jan 2025
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
Benjamin Kiefer
Lojze Žust
Jon Muhovič
Matej Kristan
J. Pers
...
Ashraf Saleem
Ching-Heng Cheng
Yu-Fan Lin
Tzu-Yu Lin
Chih-Chung Hsu
168
6
0
20 Jan 2025
PolyLUT: Ultra-low Latency Polynomial Inference with Hardware-Aware Structured Pruning
IEEE transactions on computers (IEEE Trans. Comput.), 2025
Marta Andronic
Jiawen Li
George A. Constantinides
156
6
0
14 Jan 2025
Optimizing Small Language Models for In-Vehicle Function-Calling
Yahya Sowti Khiabani
Farris Atif
Chieh Hsu
Sven Stahlmann
Tobias Michels
Sebastian Kramer
Benedikt Heidrich
M. Saquib Sarfraz
Julian Merten
Faezeh Tafazzoli
141
3
0
04 Jan 2025
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Xiaotao Hu
Wei Yin
Mingkai Jia
Junyuan Deng
Xiaoyang Guo
Qian Zhang
Xiaoxiao Long
Ping Tan
VGen
342
34
0
31 Dec 2024
Positive2Negative: Breaking the Information-Lossy Barrier in Self-Supervised Single Image Denoising
Computer Vision and Pattern Recognition (CVPR), 2024
Tong Li
Lizhi Wang
Zhiyuan Xu
Lin Zhu
Wanxuan Lu
Hua Huang
386
3
0
21 Dec 2024
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data
Zhiqiang Tang
Zihan Zhong
Tong He
Gerald Friedland
371
4
0
19 Dec 2024
HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages
Information Security Conference (IS), 2024
Aman Chaturvedi
Daniel Nichols
Siddharth Singh
A. Bhatele
271
8
0
19 Dec 2024
Jet: A Modern Transformer-Based Normalizing Flow
Alexander Kolesnikov
André Susano Pinto
Michael Tschannen
226
10
0
19 Dec 2024
GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images
AAAI Conference on Artificial Intelligence (AAAI), 2024
Ziyang Xu
Huangxuan Zhao
Wen Liu
Xinyu Wang
279
1
0
18 Dec 2024
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach
International Conference on Computational Linguistics (COLING), 2024
Daiki Shirafuji
Makoto Takenaka
Shinya Taguchi
LLMAG
248
11
0
16 Dec 2024
Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On
Delong Zhang
Qiwei Huang
Yuanliu Liu
Yang Sun
Wei-Shi Zheng
Pengfei Xiong
Wei Zhang
3DH
262
1
0
16 Dec 2024
Bayesian Flow Is All You Need to Sample Out-of-Distribution Chemical Spaces
Nianze Tao
OOD
OODD
BDL
620
0
0
16 Dec 2024
APAR: Modeling Irregular Target Functions in Tabular Regression via Arithmetic-Aware Pre-Training and Adaptive-Regularized Fine-Tuning
AAAI Conference on Artificial Intelligence (AAAI), 2024
Hong-Wei Wu
Wei Wang
Kuang-Da Wang
Chao-Han Huck Yang
LMTD
374
1
0
14 Dec 2024
Exploring Grokking: Experimental and Mechanistic Investigations
Hu Qiye
Zhou Hao
Yu RuoXi
357
1
0
14 Dec 2024
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Jun Zheng
Jing Wang
Fuwei Zhao
Xujie Zhang
Xiaodan Liang
VGen
DiffM
315
2
0
13 Dec 2024
GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek
International Conference on Computational Linguistics (COLING), 2024
Lefteris Loukas
Nikolaos Smyrnioudis
Chrysa Dikonomaki
Spyros Barbakos
Anastasios Toumazatos
...
Manolis Kyriakakis
Mary Georgiou
Stavros Vassos
John Pavlopoulos
Ion Androutsopoulos
AILaw
267
2
0
11 Dec 2024
SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse Scenarios Handling Emotional Support Agent
International Conference on Computational Linguistics (COLING), 2024
Jing Ye
Lu Xiang
Yaping Zhang
Chengqing Zong
422
19
0
11 Dec 2024
FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression
Computer Vision and Pattern Recognition (CVPR), 2024
Bo Tong
Bokai Lai
Weihao Ye
Gen Luo
Chunjiang Ge
Ke Li
Xiaoshuai Sun
Rongrong Ji
VLM
MLLM
225
4
0
05 Dec 2024
Reinforcement Learning from Wild Animal Videos
Elliot Chane-Sane
Constant Roux
O. Stasse
Nicolas Mansard
925
1
0
05 Dec 2024
Unified Framework for Open-World Compositional Zero-shot Learning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Hirunima Jayasekara
Khoi Pham
Nirat Saini
Abhinav Shrivastava
266
1
0
05 Dec 2024
GuARD: Effective Anomaly Detection through a Text-Rich and Graph-Informed Language Model
Yunhe Pang
Bo Chen
Fanjin Zhang
Yanghui Rao
Jie Tang
Jie Tang
292
0
0
05 Dec 2024
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior
Li-Yuan Tsao
Hao-Wei Chen
Hao-Wei Chung
Deqing Sun
Chun-Yi Lee
Kelvin Chan
Ming-Hsuan Yang
DiffM
215
7
0
27 Nov 2024
Cautious Optimizers: Improving Training with One Line of Code
Kaizhao Liang
Lizhang Chen
B. Liu
Qiang Liu
ODL
686
20
0
25 Nov 2024
RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks
International Conference on Learning Representations (ICLR), 2024
Nazia Tasnim
Bryan A. Plummer
CLL
OffRL
444
0
0
25 Nov 2024
Beyond adaptive gradient: Fast-Controlled Minibatch Algorithm for large-scale optimization
Corrado Coppola
Lorenzo Papa
Irene Amerini
L. Palagi
ODL
384
0
0
24 Nov 2024
Financial Risk Assessment via Long-term Payment Behavior Sequence Folding
Industrial Conference on Data Mining (IDM), 2024
Yiran Qiao
Yateng Tang
Xiang Ao
Qi Yuan
Ziming Liu
Chen Shen
Xuehao Zheng
218
0
0
22 Nov 2024
Entropy Bootstrapping for Weakly Supervised Nuclei Detection
James Willoughby
Irina Voiculescu
UQCV
224
0
0
20 Nov 2024
A Theory for Compressibility of Graph Transformers for Transductive Learning
Hamed Shirzad
Honghao Lin
A. Velingker
B. Venkatachalam
David P. Woodruff
Danica J. Sutherland
291
2
0
20 Nov 2024
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Qu He
C. Xu
Jinlong Peng
Jing Zhang
Chengjie Wang
Yunsheng Wu
Yanwei Fu
DiffM
263
25
0
15 Nov 2024
Pay Attention to the Keys: Visual Piano Transcription Using Transformers
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Uros Zivanovic
Ivan Pilkov
Carlos Eduardo Cancino-Chacón
ViT
160
0
0
13 Nov 2024
MEANT: Multimodal Encoder for Antecedent Information
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Benjamin Iyoya Irving
Annika Marie Schoene
AIFin
172
0
1
10 Nov 2024
Multi-View Majority Vote Learning Algorithms: Direct Minimization of PAC-Bayesian Bounds
Mehdi Hennequin
Abdelkrim Zitouni
K. Benabdeslem
H. Elghazel
Yacine Gaci
301
1
0
09 Nov 2024
Few-Shot Task Learning through Inverse Generative Modeling
Neural Information Processing Systems (NeurIPS), 2024
Aviv Netanyahu
Yilun Du
Antonia Bronars
Jyothish Pari
J. Tenenbaum
Tianmin Shu
Pulkit Agrawal
472
4
0
07 Nov 2024
Learning to Unify Audio, Visual and Text for Audio-Enhanced Multilingual Visual Answer Localization
Zhibin Wen
Bin Li
175
3
0
05 Nov 2024
Expanding Sparse Tuning for Low Memory Usage
Neural Information Processing Systems (NeurIPS), 2024
Shufan Shen
Junshu Sun
Xiangyang Ji
Qingming Huang
Shuhui Wang
317
9
0
04 Nov 2024
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
Xiong Wang
Yangze Li
Chaoyou Fu
Chunjiang Ge
Lei Xie
Ke Li
Xing Sun
Long Ma
AuLLM
MLLM
403
99
0
01 Nov 2024
Joint Extraction and Classification of Danish Competences for Job Matching
European Conference on Information Retrieval (ECIR), 2024
Qiuchi Li
Christina Lioma
128
0
0
29 Oct 2024
USpeech: Ultrasound-Enhanced Speech with Minimal Human Effort via Cross-Modal Synthesis
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2024
Luca Jiang-Tao Yu
Running Zhao
Sijie Ji
Edith C.H. Ngai
Chenshu Wu
186
3
0
29 Oct 2024
Super-resolution in disordered media using neural networks
Alexander Christie
Matan Leibovich
Miguel Moscoso
A. Novikov
George Papanicolaou
C. Tsogka
192
0
0
28 Oct 2024
Mixture of Parrots: Experts improve memorization more than reasoning
International Conference on Learning Representations (ICLR), 2024
Samy Jelassi
Clara Mohri
David Brandfonbrener
Alex Gu
Nikhil Vyas
Nikhil Anand
David Alvarez-Melis
Yuanzhi Li
Sham Kakade
Eran Malach
MoE
341
14
0
24 Oct 2024
Lightweight Neural App Control
International Conference on Learning Representations (ICLR), 2024
Filippos Christianos
Georgios Papoudakis
Thomas Coste
Jianye Hao
Jun Wang
Youssef Attia El Hili
LM&Ro
239
9
0
23 Oct 2024
Publishing Neural Networks in Drug Discovery Might Compromise Training Data Privacy
Journal of Cheminformatics (J Cheminform), 2024
Fabian P. Krüger
Johan Östman
Lewis H. Mervin
Igor V. Tetko
Ola Engkvist
223
5
0
22 Oct 2024
Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding
International Conference on Pattern Recognition (ICPR), 2024
Yang Liu
Daizong Liu
Wei Hu
3DPC
366
9
0
21 Oct 2024
Catastrophic Failure of LLM Unlearning via Quantization
International Conference on Learning Representations (ICLR), 2024
Zhiwei Zhang
Fali Wang
Xiaomin Li
Zongyu Wu
Xianfeng Tang
Hui Liu
Qi He
Wenpeng Yin
Suhang Wang
MU
294
5
0
21 Oct 2024
Non-invasive Neural Decoding in Source Reconstructed Brain Space
Yonatan Gideoni
Ryan Charles Timms
Oiwi Parker Jones
210
3
0
20 Oct 2024
Physically Guided Deep Unsupervised Inversion for 1D Magnetotelluric Models
IEEE Geoscience and Remote Sensing Letters (GRSL), 2024
Paul Goyes-Peñafiel
Umair bin Waheed
Henry Arguello
114
1
0
20 Oct 2024
Cliqueformer: Model-Based Optimization with Structured Transformers
J. Kuba
Pieter Abbeel
Sergey Levine
OffRL
AI4CE
447
4
0
17 Oct 2024
Previous
1
2
3
4
5
6
...
23
24
25
Next