Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.04692
Cited By
Not All Attention Is All You Need
10 April 2021
Hongqiu Wu
Hai Zhao
Min Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Not All Attention Is All You Need"
7 / 7 papers shown
Title
Pyramid-BERT: Reducing Complexity via Successive Core-set based Token Selection
Xin Huang
A. Khetan
Rene Bidart
Zohar S. Karnin
17
14
0
27 Mar 2022
R-Drop: Regularized Dropout for Neural Networks
Xiaobo Liang
Lijun Wu
Juntao Li
Yue Wang
Qi Meng
Tao Qin
Wei Chen
M. Zhang
Tie-Yan Liu
31
424
0
28 Jun 2021
AutoDropout: Learning Dropout Patterns to Regularize Deep Networks
Hieu H. Pham
Quoc V. Le
66
56
0
05 Jan 2021
Code Summarization with Structure-induced Transformer
Hongqiu Wu
Hai Zhao
Min Zhang
28
84
0
29 Dec 2020
How Does Selective Mechanism Improve Self-Attention Networks?
Xinwei Geng
Longyue Wang
Xing Wang
Bing Qin
Ting Liu
Zhaopeng Tu
AAML
34
35
0
03 May 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,950
0
20 Apr 2018
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
264
5,326
0
05 Nov 2016
1