Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.16634
Cited By
SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers
29 November 2022
A. Deshpande
Md Arafat Sultan
Anthony Ferritto
A. Kalyan
Karthik Narasimhan
Avirup Sil
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers"
4 / 4 papers shown
Title
Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning
Nusrat Jahan Prottasha
Asif Mahmud
Md. Shohanur Islam Sobuj
Prakash Bhat
Md. Kowsher
Niloofar Yousefi
O. Garibay
22
4
0
11 Oct 2024
Making Pre-trained Language Models Better Few-shot Learners
Tianyu Gao
Adam Fisch
Danqi Chen
238
1,898
0
31 Dec 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
Learning Efficient Algorithms with Hierarchical Attentive Memory
Marcin Andrychowicz
Karol Kurach
17
51
0
09 Feb 2016
1