Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.20672
Cited By
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
28 October 2024
Sangmin Bae
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Seungyeon Kim
Tal Schuster
KELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA"
3 / 3 papers shown
Title
KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters
Haiduo Huang
Yadong Zhang
Pengju Ren
42
0
0
30 Mar 2025
On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding
Kevin Xu
Issei Sato
26
3
0
02 Oct 2024
Block Transformer: Global-to-Local Language Modeling for Fast Inference
Namgyu Ho
Sangmin Bae
Taehyeon Kim
Hyunjik Jo
Yireun Kim
Tal Schuster
Adam Fisch
James Thorne
Se-Young Yun
32
6
0
04 Jun 2024
1