Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling

9 September 2024

Papers citing "Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling"

2 / 2 papers shown

Title
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models Siddharth Karamcheti Suraj Nair Ashwin Balakrishna Percy Liang Thomas Kollar Dorsa Sadigh MLLM VLM 57 97 0 12 Feb 2024
Repeat After Me: Transformers are Better than State Space Models at Copying Samy Jelassi David Brandfonbrener Sham Kakade Eran Malach 95 78 0 01 Feb 2024