ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.06170
  4. Cited By
Transformer on a Diet

Transformer on a Diet

14 February 2020
Chenguang Wang
Zihao Ye
Aston Zhang
Zheng Zhang
Alex Smola
ArXiv (abs)PDFHTML

Papers citing "Transformer on a Diet"

6 / 6 papers shown
Title
Activator: GLU Activation Function as the Core Component of a Vision
  Transformer
Activator: GLU Activation Function as the Core Component of a Vision Transformer
Abdullah Nazhat Abdullah
Tarkan Aydin
ViT
75
0
0
24 May 2024
NiNformer: A Network in Network Transformer with Token Mixing Generated
  Gating Function
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function
Abdullah Nazhat Abdullah
Tarkan Aydin
79
0
0
04 Mar 2024
Adaptive Multi-Resolution Attention with Linear Complexity
Adaptive Multi-Resolution Attention with Linear Complexity
Yao Zhang
Yunpu Ma
T. Seidl
Volker Tresp
35
1
0
10 Aug 2021
A Practical Survey on Faster and Lighter Transformers
A Practical Survey on Faster and Lighter Transformers
Quentin Fournier
G. Caron
Daniel Aloise
137
104
0
26 Mar 2021
Data-Efficient Pretraining via Contrastive Self-Supervision
Data-Efficient Pretraining via Contrastive Self-Supervision
Nils Rethmeier
Isabelle Augenstein
109
21
0
02 Oct 2020
SqueezeBERT: What can computer vision teach NLP about efficient neural
  networks?
SqueezeBERT: What can computer vision teach NLP about efficient neural networks?
F. Iandola
Albert Eaton Shaw
Ravi Krishna
Kurt Keutzer
VLM
90
128
0
19 Jun 2020
1