Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.05395
Cited By
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
9 September 2024
Georgios Pantazopoulos
Malvina Nikandrou
Alessandro Suglia
Oliver Lemon
Arash Eshghi
Mamba
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling"
2 / 2 papers shown
Title
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models
Siddharth Karamcheti
Suraj Nair
Ashwin Balakrishna
Percy Liang
Thomas Kollar
Dorsa Sadigh
MLLM
VLM
57
97
0
12 Feb 2024
Repeat After Me: Transformers are Better than State Space Models at Copying
Samy Jelassi
David Brandfonbrener
Sham Kakade
Eran Malach
95
78
0
01 Feb 2024
1