Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.05101
Cited By
v1
v2
v3 (latest)
Decoupled Weight Decay Regularization
14 November 2017
I. Loshchilov
Katharina Eggensperger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (275★)
Papers citing
"Decoupled Weight Decay Regularization"
50 / 1,216 papers shown
ChainNet: Structured Metaphor and Metonymy in WordNet
Rowan Hall Maudslay
Simone Teufel
Francis Bond
James Pustejovsky
NAI
AI4CE
125
1
0
29 Mar 2024
Noise-Robust Keyword Spotting through Self-supervised Pretraining
Jacob Mork
H. S. Bovbjerg
Gergely Kiss
Zheng-Hua Tan
220
6
0
27 Mar 2024
DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment
Haitao Li
Jiaxin Mao
Xinyan Han
Jia Chen
Qian Dong
Yiqun Liu
Chong Chen
Qi Tian
AILaw
181
11
0
27 Mar 2024
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Haitao Li
Jiaxin Mao
Jia Chen
Qian Dong
Zhijing Wu
Yiqun Liu
Chong Chen
Qi Tian
AILaw
237
24
0
27 Mar 2024
MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping
European Conference on Computer Vision (ECCV), 2024
Jiacheng Chen
Yuefan Wu
Jiaqi Tan
Hang Ma
Yasutaka Furukawa
272
54
0
23 Mar 2024
M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling
Xudong Sun
Nutan Chen
Alexej Gossmann
Yu Xing
Carla Feistner
...
Felix Drost
Daniele Scarcella
Lisa Beer
Carsten Marr
Carsten Marr
393
1
0
20 Mar 2024
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Wenqi Zhu
Jiale Cao
Jin Xie
Shuangming Yang
Yanwei Pang
VLM
CLIP
292
10
0
19 Mar 2024
FinLlama: Financial Sentiment Classification for Algorithmic Trading Applications
Thanos Konstantinidis
Giorgos Iacovides
Mingxue Xu
T. Constantinides
Danilo Mandic
AIFin
183
22
0
18 Mar 2024
A Versatile Framework for Multi-scene Person Re-identification
Wei-Shi Zheng
Junkai Yan
Yi-Xing Peng
VLM
327
17
0
17 Mar 2024
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction
International Conference on Language Resources and Evaluation (LREC), 2024
Ziyang Xu
Keqin Peng
Liang Ding
Dacheng Tao
Xiliang Lu
236
19
0
15 Mar 2024
Single Domain Generalization for Crowd Counting
Computer Vision and Pattern Recognition (CVPR), 2024
Zhuoxuan Peng
S.-H. Gary Chan
262
27
0
14 Mar 2024
PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors
European Conference on Computer Vision (ECCV), 2024
Tianyuan Yuan
Yucheng Mao
Jiawei Yang
Yicheng Liu
Yue Wang
Hang Zhao
283
21
0
14 Mar 2024
Identity-aware Dual-constraint Network for Cloth-Changing Person Re-identification
Peini Guo
Mengyuan Liu
Hong Liu
Ruijia Fan
Guoquan Wang
Bin He
422
1
0
13 Mar 2024
Fine-tuning Large Language Models with Sequential Instructions
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hanxu Hu
Simon Yu
Pinzhen Chen
Edoardo Ponti
ALM
LRM
413
21
0
12 Mar 2024
generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Thilo Spinner
Rebecca Kehlbeck
Rita Sevastjanova
Tobias Stähle
Daniel A. Keim
Oliver Deussen
Mennatallah El-Assady
275
5
0
12 Mar 2024
CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective
Computer Vision and Pattern Recognition (CVPR), 2024
Shunsuke Yasuki
Masato Taki
AAML
320
5
0
11 Mar 2024
Probabilistic Neural Circuits
AAAI Conference on Artificial Intelligence (AAAI), 2024
Pedro Zuidberg Dos Martires
TPM
159
7
0
10 Mar 2024
Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation
Paweł Antoni Pierzchlewicz
Caio da Silva
R. J. Cotton
Fabian H. Sinz
237
2
0
10 Mar 2024
Calibrating Large Language Models Using Their Generations Only
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Dennis Ulmer
Martin Gubri
Hwaran Lee
Sangdoo Yun
Seong Joon Oh
UQLM
782
54
1
09 Mar 2024
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
European Conference on Computer Vision (ECCV), 2024
Yunpeng Qu
Kun Yuan
Kai Zhao
Qizhi Xie
Jinhua Hao
Ming Sun
Chao Zhou
262
33
0
08 Mar 2024
The Blind Normalized Stein Variational Gradient Descent-Based Detection for Intelligent Random Access in Cellular IoT
IEEE Internet of Things Journal (IEEE IoT J.), 2024
Xin Zhu
Ahmet Enis Cetin
204
0
0
08 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
2.4K
2,755
0
05 Mar 2024
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection
Peng Qi
Zehong Yan
Wynne Hsu
Yang Deng
MLLM
347
88
0
05 Mar 2024
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
Saeed Najafi
Alona Fyshe
254
3
0
04 Mar 2024
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Yuhao Xu
Tao Gu
Weifeng Chen
Chengcai Chen
DiffM
306
132
0
04 Mar 2024
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions
Marta Andronic
George A. Constantinides
228
18
0
29 Feb 2024
PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds
Haotian Liu
Sanqing Qu
Fan Lu
Zongtao Bu
Florian Roehrbein
Alois Knoll
Guang Chen
MDE
207
4
0
29 Feb 2024
Exploring Data-Efficient Adaptation of Large Language Models for Code Generation
Xue Jiang
Yihong Dong
Zhi Jin
Ge Li
Wenpin Jiao
Ge Li
VLM
363
6
0
29 Feb 2024
ConvDTW-ACS: Audio Segmentation for Track Type Detection During Car Manufacturing
Álvaro López-Chilet
Zhaoyi Liu
Jon Ander Gómez
Carlos Alvarez
Marivi Alonso Ortiz
Andres Orejuela Mesa
David Newton
Friedrich Wolf-Monheim
Sam Michiels
Danny Hughes
183
2
0
28 Feb 2024
Where Do We Go from Here? Multi-scale Allocentric Relational Inference from Natural Spatial Descriptions
Tzuf Paz-Argaman
Sayali Kulkarni
John Palowitch
Jason Baldridge
Reut Tsarfaty
158
4
0
26 Feb 2024
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Xiangdi Meng
Damai Dai
Weiyao Luo
Zhe Yang
Shaoxiang Wu
Xiaochen Wang
Peiyi Wang
Qingxiu Dong
Liang Chen
Zhifang Sui
276
18
0
25 Feb 2024
Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Yichen Xie
Hongge Chen
Gregory P. Meyer
Yong Jae Lee
Eric M. Wolff
Masayoshi Tomizuka
Wei Zhan
Yuning Chai
Xin Huang
3DPC
198
2
0
23 Feb 2024
Hands-Free VR
J. Fernandez
Jae Joong Lee
Santiago Andrés Serrano Vacca
Alejandra Magana
Bedrich Benes
V. Popescu
131
2
0
23 Feb 2024
Do Efficient Transformers Really Save Computation?
Kai-Bo Yang
Jan Ackermann
Zhenyu He
Guhao Feng
Bohang Zhang
Yunzhen Feng
Qiwei Ye
Di He
Liwei Wang
259
28
0
21 Feb 2024
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
Ikuya Yamada
Ryokan Ri
KELM
282
3
0
18 Feb 2024
Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish
Szymon Ruciñski
233
5
0
15 Feb 2024
Can LLMs Learn New Concepts Incrementally without Forgetting?
Junhao Zheng
Shengjie Qiu
Qianli Ma
CLL
268
0
0
13 Feb 2024
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages
Johnathan Mercer
104
0
0
12 Feb 2024
KVQ: Kwai Video Quality Assessment for Short-form Videos
Computer Vision and Pattern Recognition (CVPR), 2024
Yiting Lu
Xin Li
Yajing Pei
Kun Yuan
Qizhi Xie
Yunpeng Qu
Ming Sun
Chao Zhou
Zhibo Chen
297
43
0
11 Feb 2024
Pushing Boundaries: Mixup's Influence on Neural Collapse
Quinn Fisher
Haoming Meng
Vardan Papyan
AAML
UQCV
209
7
0
09 Feb 2024
Mesoscale Traffic Forecasting for Real-Time Bottleneck and Shockwave Prediction
Raphael Chekroun
Han Wang
Jonathan W. Lee
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
M. D. Monache
169
0
0
08 Feb 2024
Question Aware Vision Transformer for Multimodal Reasoning
Roy Ganz
Yair Kittenplon
Aviad Aberdam
Elad Ben Avraham
Oren Nuriel
Shai Mazor
Ron Litman
299
36
0
08 Feb 2024
Improved Generalization of Weight Space Networks via Augmentations
Aviv Shamsian
Aviv Navon
David W. Zhang
Yan Zhang
Ethan Fetaya
Gal Chechik
Haggai Maron
330
18
0
06 Feb 2024
Sentiment-enhanced Graph-based Sarcasm Explanation in Dialogue
IEEE transactions on multimedia (IEEE TMM), 2024
Kun Ouyang
Liqiang Jing
Xuemeng Song
Meng Liu
Yupeng Hu
Liqiang Nie
449
7
0
06 Feb 2024
Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Anthony Sicilia
Hyunwoo J. Kim
Khyathi Chandu
Malihe Alikhani
Jack Hessel
169
3
0
05 Feb 2024
Stable and Robust Deep Learning By Hyperbolic Tangent Exponential Linear Unit (TeLU)
Alfredo Fernandez
Ankur Mali
121
3
0
05 Feb 2024
Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization
Bo Yang
Chen Wang
Xiaoshuang Ma
Beiping Song
Zhuang Liu
Fangde Sun
278
6
0
03 Feb 2024
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers
Bharat Runwal
Tejaswini Pedapati
Pin-Yu Chen
MoE
412
8
0
02 Feb 2024
Sample, estimate, aggregate: A recipe for causal discovery foundation models
Menghua Wu
Yujia Bao
Regina Barzilay
Tommi Jaakkola
CML
488
8
0
02 Feb 2024
Development and Adaptation of Robotic Vision in the Real-World: the Challenge of Door Detection
Journal of Field Robotics (JFR), 2024
Michele Antonazzi
Matteo Luperto
N. A. Borghese
Nicola Basilico
279
2
0
31 Jan 2024
Previous
1
2
3
...
8
9
10
...
23
24
25
Next
Page 9 of 25
Page
of 25
Go