Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.02622
Cited By
DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging
4 February 2024
Matteo Pagliardini
Amirkeivan Mohtashami
F. Fleuret
Martin Jaggi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging"
5 / 5 papers shown
Title
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook
Muyi Bao
Shuchang Lyu
Zhaoyang Xu
Huiyu Zhou
Jinchang Ren
Shiming Xiang
X. Li
Guangliang Cheng
Mamba
72
0
0
01 May 2025
MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections
Da Xiao
Qingye Meng
Shengping Li
Xingyuan Yuan
MoE
AI4CE
54
0
0
13 Feb 2025
Transformer Layers as Painters
Qi Sun
Marc Pickett
Aakash Kumar Nain
Llion Jones
AI4CE
29
13
0
12 Jul 2024
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
242
1,977
0
31 Dec 2020
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
244
35,884
0
25 Aug 2016
1