Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.05101
Cited By
v1
v2
v3 (latest)
Decoupled Weight Decay Regularization
14 November 2017
I. Loshchilov
Katharina Eggensperger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (275★)
Papers citing
"Decoupled Weight Decay Regularization"
50 / 1,216 papers shown
Generalization capabilities of translationally equivariant neural networks
S. S. Krishna Chaitanya Bulusu
Matteo Favoni
A. Ipp
David I. Müller
Daniel Schuh
AI4CE
263
22
0
26 Mar 2021
deGraphCS: Embedding Variable-based Flow Graph for Neural Code Search
ACM Transactions on Software Engineering and Methodology (TOSEM), 2021
Chen Zeng
Yue Yu
Shanshan Li
Xin Xia
Zhiming Wang
Mingyang Geng
Linxiao Bai
Wei Dong
Xiangke Liao
GNN
206
48
0
24 Mar 2021
Prototypical Representation Learning for Relation Extraction
International Conference on Learning Representations (ICLR), 2021
Ning Ding
Xiaobin Wang
Yao Fu
Guangwei Xu
Rui Wang
Pengjun Xie
Ying Shen
Fei Huang
Haitao Zheng
Rui Zhang
145
66
0
22 Mar 2021
Stereo CenterNet based 3D Object Detection for Autonomous Driving
Neurocomputing (Neurocomputing), 2021
Yuguang Shi
Yu Guo
Zhenqiang Mi
Xinjie Li
3DPC
230
48
0
20 Mar 2021
Suppress-and-Refine Framework for End-to-End 3D Object Detection
Social Science Research Network (SSRN), 2021
Zili Liu
Guodong Xu
Honghui Yang
Minghao Chen
Kuoliang Wu
Zheng Yang
Haifeng Liu
Deng Cai
3DPC
149
4
0
18 Mar 2021
R-GSN: The Relation-based Graph Similar Network for Heterogeneous Graph
Xinliang Wu
Mengying Jiang
Guizhong Liu
GNN
172
8
0
14 Mar 2021
Bidirectional Machine Reading Comprehension for Aspect Sentiment Triplet Extraction
AAAI Conference on Artificial Intelligence (AAAI), 2021
Shaowei Chen
Yu Wang
Jie Liu
Yuelin Wang
198
204
0
13 Mar 2021
Self-supervised Regularization for Text Classification
Transactions of the Association for Computational Linguistics (TACL), 2021
Meng Zhou
Zechen Li
P. Xie
163
18
0
09 Mar 2021
ZJUKLAB at SemEval-2021 Task 4: Negative Augmentation with Language Model for Reading Comprehension of Abstract Meaning
International Workshop on Semantic Evaluation (SemEval), 2021
Xin Xie
Xiangnan Chen
Xiang Chen
Yong Wang
Ningyu Zhang
Shumin Deng
Huajun Chen
191
2
0
25 Feb 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Computer Vision and Pattern Recognition (CVPR), 2021
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
430
274
0
20 Feb 2021
Multilingual Answer Sentence Reranking via Automatically Translated Data
Thuy Vu
Alessandro Moschitti
126
5
0
20 Feb 2021
Using Transformer based Ensemble Learning to classify Scientific Articles
Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2021
Sohom Ghosh
Ankush Chopra
72
7
0
19 Feb 2021
SciDr at SDU-2020: IDEAS -- Identifying and Disambiguating Everyday Acronyms for Scientific Domain
Aadarsh Singh
Priyanshu Kumar
151
9
0
17 Feb 2021
Within-Document Event Coreference with BERT-Based Contextualized Representations
Shafiuddin Rehan Ahmed
James H. Martin
95
0
0
15 Feb 2021
TransReID: Transformer-based Object Re-Identification
IEEE International Conference on Computer Vision (ICCV), 2021
Shuting He
Haowen Luo
Pichao Wang
F. Wang
Hao Li
Wei Jiang
ViT
505
1,066
0
08 Feb 2021
CMS-LSTM: Context Embedding and Multi-Scale Spatiotemporal Expression LSTM for Predictive Learning
IEEE International Conference on Multimedia and Expo (ICME), 2021
Runnan Li
Zhengzhuo Xu
Yunru Bai
Zhihui Lin
Chun Yuan
174
10
0
06 Feb 2021
Meta-Learning for Effective Multi-task and Multilingual Modelling
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Ishan Tarunesh
Sushil Khyalia
Vishwajeet Kumar
Ganesh Ramakrishnan
Preethi Jyothi
244
17
0
25 Jan 2021
EGFI: Drug-Drug Interaction Extraction and Generation with Fusion of Enriched Entity and Sentence Information
Lei Huang
Jiecong Lin
Xiangtao Li
Linqi Song
Ka-Chun Wong
173
31
0
25 Jan 2021
Weakly-Supervised Hierarchical Models for Predicting Persuasive Strategies in Good-faith Textual Requests
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jiaao Chen
Diyi Yang
132
24
0
16 Jan 2021
Training data-efficient image transformers & distillation through attention
International Conference on Machine Learning (ICML), 2020
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Edouard Grave
ViT
647
8,229
0
23 Dec 2020
g2tmn at Constraint@AAAI2021: Exploiting CT-BERT and Ensembling Learning for COVID-19 Fake News Detection
Anna Glazkova
Maksim Glazkov
T. Trifonov
330
66
0
22 Dec 2020
DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2020
Weizhou Shen
Junqing Chen
Xiaojun Quan
Zhixiang Xie
281
243
0
16 Dec 2020
Topological Planning with Transformers for Vision-and-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2020
Kevin Chen
Junshen K. Chen
Jo Chuang
Hao-Tien Lewis Chiang
Silvio Savarese
LM&Ro
217
136
0
09 Dec 2020
Cross-Domain Sentiment Classification with In-Domain Contrastive Learning
Tian Li
Xiang Chen
Shanghang Zhang
Zhen Dong
Kurt Keutzer
127
4
0
05 Dec 2020
Coarse-to-Fine Entity Representations for Document-level Relation Extraction
Natural Language Processing and Chinese Computing (NLPCC), 2020
Damai Dai
Jingjing Ren
Shuang Zeng
Baobao Chang
Zhifang Sui
AI4TS
294
3
0
04 Dec 2020
Stochastic Gradient Descent with Nonlinear Conjugate Gradient-Style Adaptive Momentum
Bao Wang
Qiang Ye
ODL
191
16
0
03 Dec 2020
DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances
AAAI Conference on Artificial Intelligence (AAAI), 2020
X. Gu
Kang Min Yoo
Jung-Woo Ha
312
79
0
03 Dec 2020
End-to-End Object Detection with Adaptive Clustering Transformer
British Machine Vision Conference (BMVC), 2020
Minghang Zheng
Shiyang Feng
Renrui Zhang
Kunchang Li
Xiaogang Wang
Jiaming Song
Hao Dong
ViT
331
222
0
18 Nov 2020
EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation
Computer Vision and Pattern Recognition (CVPR), 2020
Yang Jiao
T. Tran
Guangming Shi
281
39
0
16 Nov 2020
Mixing ADAM and SGD: a Combined Optimization Method
Nicola Landro
I. Gallo
Riccardo La Grassa
ODL
140
27
0
16 Nov 2020
Real-Time Intermediate Flow Estimation for Video Frame Interpolation
European Conference on Computer Vision (ECCV), 2020
Zhewei Huang
Tianyuan Zhang
Wen Heng
Boxin Shi
Shuchang Zhou
568
271
0
12 Nov 2020
Scaling Hidden Markov Language Models
Justin T. Chiu
Alexander M. Rush
BDL
191
27
0
09 Nov 2020
Reverse engineering learned optimizers reveals known and novel mechanisms
Niru Maheswaranathan
David Sussillo
Luke Metz
Ruoxi Sun
Jascha Narain Sohl-Dickstein
330
26
0
04 Nov 2020
Multi-View Adaptive Fusion Network for 3D Object Detection
Guojun Wang
Bin Tian
Yachen Zhang
Long Chen
Dongpu Cao
Jian Wu
3DPC
200
29
0
02 Nov 2020
CHIME: Cross-passage Hierarchical Memory Network for Generative Review Question Answering
International Conference on Computational Linguistics (COLING), 2020
Junru Lu
Gabriele Pergola
Lin Gui
Binyang Li
Yulan He
126
8
0
01 Nov 2020
EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising
International Conference on the Software Process (ICSP), 2020
Tengfei Liang
Yi Jin
Yidong Li
Tao Wang
Songhe Feng
Congyan Lang
190
135
0
30 Oct 2020
Cross-Domain Sentiment Classification with Contrastive Learning and Mutual Information Maximization
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Tian Li
Xiang Chen
Shanghang Zhang
Zhen Dong
Kurt Keutzer
237
41
0
30 Oct 2020
Scaling Laws for Autoregressive Generative Modeling
T. Henighan
Jared Kaplan
Mor Katz
Mark Chen
Christopher Hesse
...
Nick Ryder
Daniel M. Ziegler
John Schulman
Dario Amodei
Sam McCandlish
473
555
0
28 Oct 2020
Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Jianguo Zhang
Kazuma Hashimoto
Wenhao Liu
Chien-Sheng Wu
Yao Wan
Philip S. Yu
R. Socher
Caiming Xiong
215
98
0
25 Oct 2020
Optimal Subarchitecture Extraction For BERT
Adrian de Wynter
Daniel J. Perry
MQ
222
18
0
20 Oct 2020
Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
William Merrill
Vivek Ramanujan
Yoav Goldberg
Roy Schwartz
Noah A. Smith
AI4CE
593
42
0
19 Oct 2020
DialogueTRM: Exploring the Intra- and Inter-Modal Emotional Behaviors in the Conversation
Yuzhao Mao
Qi Sun
Guang Liu
Xiaojie Wang
Weiguo Gao
Xuan Li
Jianping Shen
139
37
0
15 Oct 2020
Self-Supervised Multi-View Synchronization Learning for 3D Pose Estimation
Simon Jenni
Paolo Favaro
3DH
174
11
0
13 Oct 2020
End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems
Siamak Shakeri
Cicero Nogueira dos Santos
He Zhu
Patrick Ng
Feng Nan
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
OOD
236
116
0
12 Oct 2020
Beyond Language: Learning Commonsense from Images for Reasoning
Findings (Findings), 2020
Wanqing Cui
Yanyan Lan
Liang Pang
Jiafeng Guo
Xueqi Cheng
LRM
140
5
0
10 Oct 2020
NutCracker at WNUT-2020 Task 2: Robustly Identifying Informative COVID-19 Tweets using Ensembling and Adversarial Training
Priyanshu Kumar
Aadarsh Singh
FedML
103
17
0
09 Oct 2020
Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
Kyle Lo
Aida Amini
Aman Rangapur
Madeleine van Zuylen
Sravanthi Parasa
Eric Horvitz
Daniel S. Weld
Roy Schwartz
Hannaneh Hajishirzi
347
30
0
08 Oct 2020
Like hiking? You probably enjoy nature: Persona-grounded Dialog with Commonsense Expansions
Bodhisattwa Prasad Majumder
Harsh Jhamtani
Taylor Berg-Kirkpatrick
Julian McAuley
170
90
0
07 Oct 2020
Towards a Multi-modal, Multi-task Learning based Pre-training Framework for Document Representation Learning
Subhojeet Pramanik
Shashank Mujumdar
Hima Patel
268
33
0
30 Sep 2020
Deep EvoGraphNet Architecture For Time-Dependent Brain Graph Data Synthesis From a Single Timepoint
Ahmed Nebli
Uğur Ali Kaplan
I. Rekik
AI4TS
112
18
0
28 Sep 2020
Previous
1
2
3
...
21
22
23
24
25
Next