Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.05101
Cited By
v1
v2
v3 (latest)
Decoupled Weight Decay Regularization
14 November 2017
I. Loshchilov
Katharina Eggensperger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (275★)
Papers citing
"Decoupled Weight Decay Regularization"
50 / 1,216 papers shown
AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning
IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2021
Siddharth Singh
A. Bhatele
GNN
316
21
0
25 Oct 2021
MoDeRNN: Towards Fine-grained Motion Details for Spatiotemporal Predictive Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Runnan Li
Zhengzhuo Xu
Chun Yuan
AI4TS
168
5
0
25 Oct 2021
Seeking Patterns, Not just Memorizing Procedures: Contrastive Learning for Solving Math Word Problems
Zhongli Li
Wenxuan Zhang
Chao Yan
Qingyu Zhou
Chao Li
Hongzhi Liu
Yunbo Cao
AIMat
166
60
0
16 Oct 2021
Control Prefixes for Parameter-Efficient Text Generation
Jordan Clive
Kris Cao
Marek Rei
268
35
0
15 Oct 2021
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm
Shaoyi Huang
Dongkuan Xu
Ian En-Hsu Yen
Yijue Wang
Sung-En Chang
...
Shiyang Chen
Mimi Xie
Sanguthevar Rajasekaran
Hang Liu
Caiwen Ding
CLL
VLM
220
36
0
15 Oct 2021
MixQG: Neural Question Generation with Mixed Answer Types
Lidiya Murakhovs'ka
Chien-Sheng Wu
Philippe Laban
Tong Niu
Wenhao Liu
Caiming Xiong
185
52
0
15 Oct 2021
Cross-Lingual Fine-Grained Entity Typing
N. Selvaraj
Yasumasa Onoe
Greg Durrett
133
2
0
15 Oct 2021
SVG-Net: An SVG-based Trajectory Prediction Model
Mohammadhossein Bahari
Vahid Zehtab
Sadegh Khorasani
Sana Ayromlou
Saeed Saadatnejad
Alexandre Alahi
3DPC
168
5
0
07 Oct 2021
SPEED+: Next-Generation Dataset for Spacecraft Pose Estimation across Domain Gap
T. Park
Marcus Märtens
Gurvan Lécuyer
Dario Izzo
Simone DÁmico
401
136
0
06 Oct 2021
8-bit Optimizers via Block-wise Quantization
Tim Dettmers
M. Lewis
Sam Shleifer
Luke Zettlemoyer
MQ
414
393
0
06 Oct 2021
StairwayGraphNet for Inter- and Intra-modality Multi-resolution Brain Graph Alignment and Synthesis
Islem Mhiri
Mohamed Ali Mahjoub
I. Rekik
MedIm
168
4
0
06 Oct 2021
Recurrent Brain Graph Mapper for Predicting Time-Dependent Brain Graph Evaluation Trajectory
Alpay Tekin
Ahmed Nebli
I. Rekik
106
4
0
06 Oct 2021
One Representative-Shot Learning Using a Population-Driven Template with Application to Brain Connectivity Classification and Evolution Prediction
Umut Guvercin
Mohammed Amine Gharsallaoui
I. Rekik
150
7
0
06 Oct 2021
VTAMIQ: Transformers for Attention Modulated Image Quality Assessment
Andrei Chubarau
James Clark
ViT
245
11
0
04 Oct 2021
ResNet strikes back: An improved training procedure in timm
Ross Wightman
Hugo Touvron
Edouard Grave
AI4TS
562
573
0
01 Oct 2021
MFEViT: A Robust Lightweight Transformer-based Network for Multimodal 2D+3D Facial Expression Recognition
Hanting Li
Ming-Fa Sui
Zhaoqing Zhu
Feng Zhao
ViT
200
4
0
20 Sep 2021
Towards High-Quality Temporal Action Detection with Sparse Proposals
Jiannan Wu
Pei Sun
Shoufa Chen
Jiewen Yang
Zihao Qi
Lan Ma
Ping Luo
ViT
159
11
0
18 Sep 2021
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
Luisa März
Ehsaneddin Asgari
Fabienne Braune
Franziska Zimmermann
Benjamin Roth
GAN
167
2
0
16 Sep 2021
RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization
Chen An
Ming Zhong
Zhichao Geng
Jianqiang Yang
Xipeng Qiu
RALM
204
26
0
16 Sep 2021
ePiC: Employing Proverbs in Context as a Benchmark for Abstract Language Understanding
Sayan Ghosh
Shashank Srivastava
295
16
0
14 Sep 2021
Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction
Xuming Hu
Chenwei Zhang
Yawen Yang
Xiaohe Li
Li Lin
Lijie Wen
Philip S. Yu
164
64
0
14 Sep 2021
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization
Jing Liang
Xiaodong Cun
Chi-Man Pun
Jue Wang
DiffM
254
48
0
13 Sep 2021
End-to-End Conversational Search for Online Shopping with Utterance Transfer
Liqiang Xiao
Jun Ma
Xin Luna Dong
Pascual Martínez-Gómez
Nasser Zalmout
Wei Chen
Tong Zhao
Hao He
Yaohui Jin
109
12
0
12 Sep 2021
D-REX: Dialogue Relation Extraction with Explanations
Alon Albalak
Varun R. Embar
Yi-Lin Tuan
Lise Getoor
Wenjie Wang
169
10
0
10 Sep 2021
Genre as Weak Supervision for Cross-lingual Dependency Parsing
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
360
20
0
10 Sep 2021
A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Shilei Liu
Xiaofeng Zhao
Bochao Li
Feiliang Ren
Longhui Zhang
Shujuan Yin
184
35
0
09 Sep 2021
Sequential Attention Module for Natural Language Processing
Mengyuan Zhou
Jian Ma
Haiqing Yang
Lian-Xin Jiang
Yang Mo
AI4TS
89
2
0
07 Sep 2021
FH-SWF SG at GermEval 2021: Using Transformer-Based Language Models to Identify Toxic, Engaging, & Fact-Claiming Comments
Tobias Bornheim
Stephan Bialonski
126
12
0
07 Sep 2021
Towards Improving Adversarial Training of NLP Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Jin Yong Yoo
Yanjun Qi
AAML
508
147
0
01 Sep 2021
CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Hang Li
Yunxing Kang
Tianqiao Liu
Wenbiao Ding
Zitao Liu
173
20
0
01 Sep 2021
Knowledge-Grounded Dialogue with Reward-Driven Knowledge Selection
Natural Language Processing and Chinese Computing (NLPCC), 2021
Shilei Liu
Xiaofeng Zhao
Bochao Li
Feiliang Ren
141
1
0
31 Aug 2021
A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Yucheng Zhao
Guangting Wang
Chuanxin Tang
Chong Luo
Wenjun Zeng
Zhengjun Zha
175
90
0
30 Aug 2021
ReGen: Reinforcement Learning for Text and Knowledge Base Generation using Pretrained Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Pierre Dognin
Inkit Padhi
Igor Melnyk
Payel Das
OffRL
148
28
0
27 Aug 2021
Guiding Query Position and Performing Similar Attention for Transformer-Based Detection Heads
Xiaohu Jiang
Ze Chen
Zhicheng Wang
Erjin Zhou
Chun Yuan
121
2
0
22 Aug 2021
PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers
Xumin Yu
Yongming Rao
Ziyi Wang
Zuyan Liu
Jiwen Lu
Jie Zhou
ViT
266
551
0
19 Aug 2021
Accurate, yet inconsistent? Consistency Analysis on Language Understanding Models
Myeongjun Jang
D. Kwon
Thomas Lukasiewicz
203
14
0
15 Aug 2021
Conditional DETR for Fast Training Convergence
IEEE International Conference on Computer Vision (ICCV), 2021
Depu Meng
Xiaokang Chen
Zejia Fan
Gang Zeng
Houqiang Li
Yuhui Yuan
Lei-huan Sun
Jingdong Wang
ViT
474
862
0
13 Aug 2021
Disentangling Hate in Online Memes
ACM Multimedia (ACM MM), 2021
Rui Cao
Ziqing Fan
Roy Ka-wei Lee
Wen-Haw Chong
Jing Jiang
196
105
0
09 Aug 2021
From Voxel to Point: IoU-guided 3D Object Detection for Point Cloud with Voxel-to-Point Decoder
ACM Multimedia (ACM MM), 2021
Jiale Li
Hang Dai
Ling Shao
Yong Ding
3DPC
195
58
0
08 Aug 2021
Anchor-free 3D Single Stage Detector with Mask-Guided Attention for Point Cloud
ACM Multimedia (ACM MM), 2021
Jiale Li
Hang Dai
Ling Shao
Yong Ding
SLR
111
35
0
08 Aug 2021
Controllable Summarization with Constrained Markov Decision Process
Transactions of the Association for Computational Linguistics (TACL), 2021
Hou Pong Chan
Lu Wang
Irwin King
398
25
0
07 Aug 2021
Fast Convergence of DETR with Spatially Modulated Co-Attention
IEEE International Conference on Computer Vision (ICCV), 2021
Shiyang Feng
Minghang Zheng
Xiaogang Wang
Jifeng Dai
Jiaming Song
ViT
267
368
0
05 Aug 2021
Large-Scale Differentially Private BERT
Rohan Anil
Badih Ghazi
Vineet Gupta
Ravi Kumar
Pasin Manurangsi
250
148
0
03 Aug 2021
Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation
ACM Multimedia (ACM MM), 2021
Wenkang Shan
Haopeng Lu
Shanshe Wang
Xinfeng Zhang
Wen Gao
3DH
183
70
0
29 Jul 2021
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Jinyu Guo
Kai Shuang
Jijie Li
Zihan Wang
178
19
0
27 Jul 2021
A Joint and Domain-Adaptive Approach to Spoken Language Understanding
Linhao Zhang
Yu Shi
Linjun Shou
Ming Gong
Houfeng Wang
Michael Zeng
VLM
174
2
0
25 Jul 2021
A Deep Learning-based Quality Assessment and Segmentation System with a Large-scale Benchmark Dataset for Optical Coherence Tomographic Angiography Image
Yu-Fang Wang
Yiqing Shen
Meng Yuan
Jing Xu
B. Yang
Chicheng Liu
Wenjia Cai
Weijing Cheng
Wei Wang
163
22
0
22 Jul 2021
BoningKnife: Joint Entity Mention Detection and Typing for Nested NER via prior Boundary Knowledge
Huiqiang Jiang
Guoxin Wang
Weile Chen
Chengxi Zhang
Börje F. Karlsson
111
5
0
20 Jul 2021
Scene-adaptive Knowledge Distillation for Sequential Recommendation via Differentiable Architecture Search
Lei-tai Chen
Fajie Yuan
Jiaxi Yang
Min Yang
Chengming Li
150
4
0
15 Jul 2021
REX: Revisiting Budgeted Training with an Improved Schedule
Conference on Machine Learning and Systems (MLSys), 2021
John Chen
Cameron R. Wolfe
Anastasios Kyrillidis
160
9
0
09 Jul 2021
Previous
1
2
3
...
19
20
21
...
23
24
25
Next
Page 20 of 25
Page
of 25
Go